1. 13 11月, 2020 1 次提交
    • S
      Skip layernorm to 1.8 (#28583) · ec672e88
      Shang Zhizhou 提交于
      * 裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
      
      * skip_layernorm_op done
      
      * add unittest
      
      * slice op convertor support trt < 6
      
      * skip_layernorm only work in ernie
      
      * fix unittest
      
      * fix unittest
      ec672e88
  2. 09 11月, 2020 1 次提交
  3. 05 11月, 2020 1 次提交
    • S
      Ernie varlen to 1.8 (#28400) · 78d68d59
      Shang Zhizhou 提交于
      * Fix TRT plugin registry without TRT lib (#25982)
      
      * fix trt plugin registry without trt lib
      
      * support trt4
      
      * refine code style
      
      * pick ea851796 from develop
      
      * cherry-pick develop PR  #26273 && #27796
      
      * fix unittest error
      
      * fix unittest error
      
      * remove const_cast
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      78d68d59
  4. 21 9月, 2020 1 次提交
  5. 18 9月, 2020 1 次提交
    • P
      [cherry-pick][Paddle-TRT] Stack op plugin (#25605) (#27365) · 4283be52
      Pei Yang 提交于
      * [Paddle-TRT] Stack op plugin (#25605)
      
      * add stack_op to CMakeLists
      
      * add dim=3 support for scale op
      
      * add trt stack op, test=develop
      
      * remove debug message
      
      * add stack plugin serialize
      
      * remove slice, scale op, will add later
      
      * enhence error message
      
      * revise trt ernie test to conver the stack op CI testi, test=develop
      
      * add stack op serialization
      
      * fix test shape after adding stack op
      
      * remove slice op, will add after implementing serialization
      
      * roll back to min_graph=5 to avoid using slice op
      
      * fix scale op output layer
      
      * implement stack op createPlugin
      
      * use workspace and move the defination to .cu
      
      * move stack plugin creator definition to .cu, test=develop
      
      * sync ut with develop
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      4283be52
  6. 06 8月, 2020 2 次提交
  7. 27 7月, 2020 1 次提交
  8. 01 7月, 2020 1 次提交
  9. 15 5月, 2020 1 次提交
  10. 30 4月, 2020 1 次提交
  11. 24 4月, 2020 1 次提交
  12. 23 4月, 2020 2 次提交
    • L
      1b45847e
    • Z
      [Cherry-pick]: 23974, 23723, 23984 (#24084) · 26a1def9
      Zhaolong Xing 提交于
      * Chery_pick:[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
      
      * refine ernie trt dynamic shape support
      1. add slice op converter
      2. add emb eltwise layernorm fp16 support
      test=develop
      
      * fix dynamic shape test ut
      test=develop
      
      * fix comments.
      test=develop
      
      * fix comments
      test=develop
      
      * cherry-pick [BUG]: Head number can only be > 1 on multihead op (#23974)
      
      * support the head number == 1
      test=develop
      
      * fix slice op error.
      test=develop
      
      * cherry-pick :disable trt test, test=develop (#23984)
      
      test=release/2.0-beta
      26a1def9
  13. 12 4月, 2020 1 次提交
  14. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  15. 01 4月, 2020 1 次提交
  16. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  17. 09 3月, 2020 1 次提交
  18. 06 3月, 2020 1 次提交
  19. 02 3月, 2020 1 次提交
  20. 26 2月, 2020 1 次提交
  21. 24 2月, 2020 1 次提交
  22. 14 2月, 2020 1 次提交
  23. 12 2月, 2020 1 次提交
  24. 05 2月, 2020 1 次提交
  25. 04 2月, 2020 1 次提交
  26. 25 1月, 2020 1 次提交
  27. 13 1月, 2020 2 次提交
  28. 11 1月, 2020 1 次提交
  29. 09 1月, 2020 1 次提交
  30. 07 1月, 2020 1 次提交
  31. 06 1月, 2020 1 次提交
  32. 24 12月, 2019 1 次提交
  33. 16 12月, 2019 1 次提交
  34. 12 12月, 2019 1 次提交
  35. 10 12月, 2019 2 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
    • R
      fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461) · 7f5d532a
      rensilin 提交于
      * ZeroCopyTensor::mutable_data in the right device, test=develop
      
      * add unittest for zerocopy, test=develop
      7f5d532a
  36. 09 12月, 2019 1 次提交
    • L
      QAT Int8 document (#21360) · fbf9eca0
      lidanqing 提交于
      * update benchmark for int8v2, QAT1, QAT2 accuracy and performance
      test=document_fix
      
      * change according to reviews
      test=develop test=document_fix
      
      * improve some descriptions and some models
      test=develop test=document_fix
      
      * update models benchmark data
      test=develop test=document_fix
      
      * update int8v2 and qat2 performance
      test=develop test=document_fix
      fbf9eca0