1. 12 9月, 2020 1 次提交
    • L
      Fix GRU mkldnn kernel fail on look_table_v2 (#27198) · 5c4eed66
      lidanqing 提交于
      * Fix the lookup_table_v2 failed on GRU mkldnn kernel issue
      test=develop
      
      * fix according to reviews, removed x_num_col_dims
      test=develop
      
      * update gru model. change according to reviews
      test=develop
      
      * change according to reviews
      test=develop
      5c4eed66
  2. 11 9月, 2020 2 次提交
  3. 10 9月, 2020 1 次提交
  4. 09 9月, 2020 1 次提交
  5. 08 9月, 2020 1 次提交
  6. 07 9月, 2020 3 次提交
  7. 03 9月, 2020 1 次提交
  8. 02 9月, 2020 4 次提交
  9. 01 9月, 2020 2 次提交
    • Z
      [Paddle-TRT] Stack op plugin (#25605) · ad6e3dd6
      zlsh80826 提交于
      * add stack_op to CMakeLists
      
      * add dim=3 support for scale op
      
      * add trt stack op, test=develop
      
      * remove debug message
      
      * add stack plugin serialize
      
      * remove slice, scale op, will add later
      
      * enhence error message
      
      * revise trt ernie test to conver the stack op CI testi, test=develop
      
      * add stack op serialization
      
      * fix test shape after adding stack op
      
      * remove slice op, will add after implementing serialization
      
      * roll back to min_graph=5 to avoid using slice op
      
      * fix scale op output layer
      
      * implement stack op createPlugin
      
      * use workspace and move the defination to .cu
      
      * move stack plugin creator definition to .cu, test=develop
      ad6e3dd6
    • Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854) · ced6e87e
      石晓伟 提交于
      This reverts commit 02083bda.
      ced6e87e
  10. 31 8月, 2020 1 次提交
  11. 30 8月, 2020 1 次提交
  12. 29 8月, 2020 1 次提交
  13. 28 8月, 2020 3 次提交
  14. 27 8月, 2020 1 次提交
  15. 25 8月, 2020 2 次提交
  16. 21 8月, 2020 1 次提交
  17. 19 8月, 2020 2 次提交
  18. 14 8月, 2020 1 次提交
  19. 12 8月, 2020 1 次提交
  20. 11 8月, 2020 1 次提交
    • L
      GRU model xnli dataset C++ tester (#25534) · 65b97d62
      lidanqing 提交于
      * Add laxical GRU unit test
      
      performance works
      
      * Get model accuracy
      
      * model and data name to be confirmed
      test=develop
      
      * update model name and output format
      test=develop
      
      * update according to reviews
      test=develop
      
      * add accuracy check
      
      * accuracy check between native and analysis
      test=develop
      
      * fix a reading bug, fix gru passes sequence
      test=develop
      
      * fix passes sequence
      test=develop
      65b97d62
  21. 08 8月, 2020 1 次提交
  22. 07 8月, 2020 1 次提交
  23. 05 8月, 2020 1 次提交
    • P
      Fix registering trt plugin (#25744) · b717895f
      Pei Yang 提交于
      * develop dynamic shape serilization
      
      * add test param for gelu
      
      * fix bugs
      
      * delete redundant comments
      
      * debug
      
      * fix conflict. test=develop
      
      * fix bug. test=develop
      
      * add trt dynamic shape serialized support
      
      * fix ernie serialized bug
      test=develop
      
      * fix codestyle
      test=develop
      
      * fix bug
      test=develop
      
      * fix bug.test=develop
      
      * modify cmakelist test=develop
      
      * fix bug
      test=develop
      
      * fix error message.  test=develop
      
      * fix trt register plugin based on pr#25003
      
      * add trt dynload
      
      * fix deserialization bug of not finding plugin registration
      
      * refine code style
      
      * recover engine key in tensorrt_subgraph_pass
      
      * for ci coverage
      
      * add unittest for deserialization
      Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
      b717895f
  24. 03 8月, 2020 1 次提交
  25. 30 7月, 2020 1 次提交
  26. 28 7月, 2020 2 次提交
  27. 24 7月, 2020 1 次提交
  28. 22 7月, 2020 1 次提交
    • supports xpu runtime, test=develop (#25554) · 72064172
      石晓伟 提交于
      * update ResetHolder, test=develop
      
      * add TensorShare for lite engine, test=develop
      
      * tensor data changed from copying to sharing, test=develop
      
      * supports xpu runtime, test=develop
      
      * fix code styles, test=develop
      72064172