1. 11 12月, 2020 2 次提交
  2. 07 12月, 2020 1 次提交
    • S
      Fix unittest (#29412) · 225a9c4e
      Shang Zhizhou 提交于
      * fix tensorrt unittest precision error
      
      * fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
      225a9c4e
  3. 02 12月, 2020 1 次提交
  4. 30 11月, 2020 1 次提交
  5. 27 11月, 2020 1 次提交
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  6. 26 11月, 2020 1 次提交
  7. 13 11月, 2020 1 次提交
  8. 12 11月, 2020 1 次提交
  9. 08 11月, 2020 1 次提交
    • Y
      exec ut no more than 15s 1 (#28439) · ba075632
      YUNSHEN XIE 提交于
      * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
      
      * test for limiting ut exec time as 15S
      
      * fix an error caused by cannot find ut
      
      * fix some error
      
      * can not find test_transformer
      
      * fix error caused by ut not run in windows
      
      * fix error caused by Compiler Options
      
      * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
      
      * setting timeout value to 120s for old ut
      
      * add the timeout value setting
      
      * fix error caused by ut only run in coverage_ci
      
      * add analyzer_transformer_profile_tester
      
      * fix some error
      
      * fix some error
      
      * fix error with inference option
      
      * fix error with inference option setting as ON_INFER
      
      * add some ut to set timeout
      
      * modified some option
      
      * fix error
      
      * fix some timeout error
      
      * fix error
      
      * fix error
      
      * fix timeout for test_analyzer_bfloat16_resnet50
      
      * fix error
      
      * setting timeout properity for some ut
      
      * first pr for new ut timeout as 15S
      ba075632
  10. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796
  11. 29 10月, 2020 2 次提交
  12. 21 10月, 2020 1 次提交
  13. 15 10月, 2020 1 次提交
  14. 13 10月, 2020 1 次提交
  15. 01 10月, 2020 1 次提交
  16. 28 9月, 2020 1 次提交
  17. 25 9月, 2020 1 次提交
  18. 23 9月, 2020 1 次提交
  19. 18 9月, 2020 1 次提交
  20. 17 9月, 2020 1 次提交
  21. 16 9月, 2020 1 次提交
  22. 15 9月, 2020 2 次提交
  23. 12 9月, 2020 1 次提交
    • L
      Fix GRU mkldnn kernel fail on look_table_v2 (#27198) · 5c4eed66
      lidanqing 提交于
      * Fix the lookup_table_v2 failed on GRU mkldnn kernel issue
      test=develop
      
      * fix according to reviews, removed x_num_col_dims
      test=develop
      
      * update gru model. change according to reviews
      test=develop
      
      * change according to reviews
      test=develop
      5c4eed66
  24. 11 9月, 2020 1 次提交
  25. 08 9月, 2020 1 次提交
  26. 07 9月, 2020 2 次提交
  27. 02 9月, 2020 3 次提交
  28. 01 9月, 2020 2 次提交
    • Z
      [Paddle-TRT] Stack op plugin (#25605) · ad6e3dd6
      zlsh80826 提交于
      * add stack_op to CMakeLists
      
      * add dim=3 support for scale op
      
      * add trt stack op, test=develop
      
      * remove debug message
      
      * add stack plugin serialize
      
      * remove slice, scale op, will add later
      
      * enhence error message
      
      * revise trt ernie test to conver the stack op CI testi, test=develop
      
      * add stack op serialization
      
      * fix test shape after adding stack op
      
      * remove slice op, will add after implementing serialization
      
      * roll back to min_graph=5 to avoid using slice op
      
      * fix scale op output layer
      
      * implement stack op createPlugin
      
      * use workspace and move the defination to .cu
      
      * move stack plugin creator definition to .cu, test=develop
      ad6e3dd6
    • Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854) · ced6e87e
      石晓伟 提交于
      This reverts commit 02083bda.
      ced6e87e
  29. 31 8月, 2020 1 次提交
  30. 28 8月, 2020 2 次提交
  31. 27 8月, 2020 1 次提交
  32. 25 8月, 2020 1 次提交