1. 27 9月, 2020 2 次提交
    • L
      add support to float64 input of warpctc op. (#27399) · 1501a80f
      Li Fuchen 提交于
      * add float64 input to ctc_loss
      
      * modified error message of  warpctc
      
      * update repo and tag of warpctc
      
      * add test for warpctc with float64 input
      
      * modified warpctc.cmake to make sure build always
      
      * resolved sample code bug of warpctc
      
      * add core.ops in warpctc dygraph
      
      * fix a bug of test
      1501a80f
    • Q
      support elementwise add, activation, matmul on Baidu Kunlun (#27143) · 6b727e08
      QingshuChen 提交于
      * support elementwise add, activation, matmul on Baidu Kunlun
      * test=kunlun
      
      * minor
      * test=kunlun
      
      * reconstuct the xpu directory
      * test=kunlun
      
      * minor
      * test=kunlun
      
      * minor
      * test=kunlun
      
      * minor
      * test=kunlun
      
      * minor
      * test=kunlun
      
      * minor
      * test=kunlun
      6b727e08
  2. 26 9月, 2020 1 次提交
  3. 25 9月, 2020 1 次提交
  4. 24 9月, 2020 3 次提交
    • S
      fix tensorrt 6 build error. test=develop (#27511) · 8f7bb52b
      Shibo Tao 提交于
      * fix tensorrt 6 build error. test=develop
      
      * fix. test=develop
      
      * bug fix
      
      * test=develop
      8f7bb52b
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
    • Z
      Add GPU Kernels of Segment Ops, support, sum, max, min, mean · 4a9d21de
      Zhong Hui 提交于
      Add GPU Kernels of Segment Ops,  support, sum, max, min, mean
      4a9d21de
  5. 23 9月, 2020 2 次提交
  6. 21 9月, 2020 1 次提交
  7. 18 9月, 2020 1 次提交
  8. 17 9月, 2020 1 次提交
  9. 15 9月, 2020 1 次提交
  10. 14 9月, 2020 1 次提交
  11. 07 9月, 2020 1 次提交
  12. 03 9月, 2020 2 次提交
  13. 28 8月, 2020 1 次提交
  14. 27 8月, 2020 1 次提交
  15. 26 8月, 2020 1 次提交
  16. 21 8月, 2020 2 次提交
  17. 19 8月, 2020 1 次提交
  18. 17 8月, 2020 1 次提交
  19. 16 8月, 2020 1 次提交
  20. 08 8月, 2020 1 次提交
  21. 07 8月, 2020 2 次提交
  22. 06 8月, 2020 1 次提交
    • A
      Add oneDNN fusion_gru kernel (#25594) · 68c6160e
      Adam 提交于
      * Add oneDNN fusion_gru kernel and fix fc+gru pass
      test=develop
      
      * Formatting changes
      test=develop
      
      * Lint fixes
      test=develop
      
      * Add memory::format_tag::any to GRU weights
      test=develop
      
      * Fix build with CUDA
      
      * Fix build with CUDA v2
      68c6160e
  23. 05 8月, 2020 2 次提交
    • Z
      [CUDNN8 support] : support CUDNN8 (#25664) · 358bc06c
      Zhaolong Xing 提交于
      * cunn8 support
      test=develop
      
      * fix ci error
      test=develop
      358bc06c
    • P
      Fix registering trt plugin (#25744) · b717895f
      Pei Yang 提交于
      * develop dynamic shape serilization
      
      * add test param for gelu
      
      * fix bugs
      
      * delete redundant comments
      
      * debug
      
      * fix conflict. test=develop
      
      * fix bug. test=develop
      
      * add trt dynamic shape serialized support
      
      * fix ernie serialized bug
      test=develop
      
      * fix codestyle
      test=develop
      
      * fix bug
      test=develop
      
      * fix bug.test=develop
      
      * modify cmakelist test=develop
      
      * fix bug
      test=develop
      
      * fix error message.  test=develop
      
      * fix trt register plugin based on pr#25003
      
      * add trt dynload
      
      * fix deserialization bug of not finding plugin registration
      
      * refine code style
      
      * recover engine key in tensorrt_subgraph_pass
      
      * for ci coverage
      
      * add unittest for deserialization
      Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
      b717895f
  24. 04 8月, 2020 1 次提交
  25. 30 7月, 2020 1 次提交
  26. 29 7月, 2020 2 次提交
  27. 28 7月, 2020 1 次提交
    • A
      Added DNNL cache management for DyGraph (#25624) · e52df3b1
      arlesniak 提交于
      * Added DNNL cache management for DyGraph
      
      * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients
      
      * missing file
      
      * Fixes after review
      
      * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative.
      
      * Removed duplicate and added docs
      
      * Fixes for CI
      e52df3b1
  28. 23 7月, 2020 1 次提交
  29. 20 7月, 2020 1 次提交
  30. 17 7月, 2020 1 次提交
  31. 15 7月, 2020 1 次提交