1. 08 11月, 2021 1 次提交
  2. 26 10月, 2021 1 次提交
    • F
      [CPU-PSLIB] Fix bug for consistency insepection of op's embedding name and... · 15cb05c8
      Fan Zhang 提交于
      [CPU-PSLIB] Fix bug for consistency insepection of op's embedding name and sparse table name in config_fleet.py (#36215)
      
      * [CPU-PSLIB] Add consistency insepection of use_var_list and data_generator data
      
      * [CPU-PSLIB] Fix bug for consistency insepection of op's embedding name and sparse table name in config_fleet.py
      15cb05c8
  3. 07 9月, 2021 2 次提交
  4. 17 8月, 2021 1 次提交
  5. 12 8月, 2021 1 次提交
  6. 26 7月, 2021 1 次提交
  7. 16 7月, 2021 1 次提交
  8. 18 5月, 2021 2 次提交
  9. 09 4月, 2021 1 次提交
  10. 30 3月, 2021 1 次提交
    • S
      Scale 1.8 (#31940) · c36c22fe
      Shang Zhizhou 提交于
      * add n-d input support for trt scale converter (#31316)
      
      * add n-d input support for trt scale converter
      
      * add flatten for ut
      
      * fix dims
      
      * fix batchnorm when inpu dims < 3 (#31933)
      
      * fix batchnorm when inpu dims < 3
      
      * add unittest for batchnorm dims = 2
      
      * fix unittest
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      c36c22fe
  11. 24 3月, 2021 1 次提交
  12. 11 3月, 2021 1 次提交
  13. 04 1月, 2021 1 次提交
  14. 31 12月, 2020 1 次提交
  15. 09 12月, 2020 1 次提交
  16. 07 12月, 2020 2 次提交
  17. 03 12月, 2020 2 次提交
  18. 01 12月, 2020 2 次提交
  19. 25 11月, 2020 1 次提交
  20. 24 11月, 2020 1 次提交
  21. 20 11月, 2020 1 次提交
  22. 13 11月, 2020 1 次提交
    • S
      Skip layernorm to 1.8 (#28583) · ec672e88
      Shang Zhizhou 提交于
      * 裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
      
      * skip_layernorm_op done
      
      * add unittest
      
      * slice op convertor support trt < 6
      
      * skip_layernorm only work in ernie
      
      * fix unittest
      
      * fix unittest
      ec672e88
  23. 09 11月, 2020 1 次提交
  24. 05 11月, 2020 1 次提交
    • S
      Ernie varlen to 1.8 (#28400) · 78d68d59
      Shang Zhizhou 提交于
      * Fix TRT plugin registry without TRT lib (#25982)
      
      * fix trt plugin registry without trt lib
      
      * support trt4
      
      * refine code style
      
      * pick ea851796 from develop
      
      * cherry-pick develop PR  #26273 && #27796
      
      * fix unittest error
      
      * fix unittest error
      
      * remove const_cast
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      78d68d59
  25. 13 10月, 2020 1 次提交
  26. 12 10月, 2020 1 次提交
  27. 10 10月, 2020 3 次提交
  28. 28 9月, 2020 1 次提交
  29. 27 9月, 2020 2 次提交
  30. 23 9月, 2020 1 次提交
    • P
      Optimize slice trt plugin (#26970) (#27456) · 8e1712a7
      Pei Yang 提交于
      * optimize slice TRT plugin
      
      This patch removes unnecessary barrier for data transfer of needed offset,
      so data transfer can be overlap with GPU kernel execution.
      
      This patch also fixes incorrect name of slice plugin. That is, replaces
      "layernorm" with "slice"
      
      test=develop
      
      * add serialize/deserialize to slice plugin
      
      * add static shape slice trt plugin
      
      * fix slice trt op convertor dynamic shape bug
      
      * fix format by clang-format
      
      * fix pylint format error
      
      * fix problems commented by peiyang
      Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
      Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com>
      Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
      8e1712a7
  31. 22 9月, 2020 2 次提交