1. 01 4月, 2021 5 次提交
    • H
      remove useless code (#32001) · 9c5d0286
      hutuxian 提交于
      9c5d0286
    • Z
      [Paddle-TRT] add anchor generator op plugin (#31730) · b807e408
      zlsh80826 提交于
      * add anchor generator op plugin
      
      * add anchor generator unit_test
      
      * remove dbg info
      
      * remove redundant line
      
      * replace assertion with paddle enforce
      
      * dynamic plugin replaces assertion with paddle enforce
      
      * anchor generator support dynamic shape on spatial axis
      
      * anchor generator test with fp16, dynamic shape
      
      * add anchor generator test all
      
      * add back main
      
      * reduce test input size to not exceed the timelimit of ci
      
      * change super to InferencePassTest for python2 compatibility
      
      * reuse paddle operator anchor generator
      
      * move creator construct to header with default
      
      * add cuda ifdef
      
      * reduce line
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix anchor generator fp16 serialize setting
      
      * split unittest from test_all
      
      * restrict anchor generator input format before version 7234
      
      * anchor generator only support greater than trt7.1
      
      * change min_graph_size to 2
      
      * min_graph size to 3 if dynamic shape
      
      * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit
      
      * remove anchor from fetch list
      
      * anchor generator support all trt version
      
      * fix memory not allocated but if serialized
      b807e408
    • Z
      Optimize the perf of SameDimsAdd CUDA Kernel (#31872) · 4acc87be
      Zhang Zheng 提交于
      4acc87be
    • Z
      Support uint8_t for fill_constant_op (#31911) · 980227f9
      Zhang Zheng 提交于
      980227f9
    • K
      new group (#31682) · 07741593
      kuizhiqing 提交于
      * new group
      
      * ci compatible fix
      
      * assert nccl
      07741593
  2. 31 3月, 2021 7 次提交
    • K
      fix one error massage (#31904) · 6f85e241
      Kqnonrime 提交于
      * fix one error massage
      
      * fix a error message
      
      * new fix three error messages
      
      * new fix three error messages
      
      * new fix some error
      
      * new fix one error message
      6f85e241
    • T
      delete cuda9 code (#31883) · ea738dda
      tianshuo78520a 提交于
      ea738dda
    • W
      Update eigen version to f612df27 (#31832) · 495e7f9c
      wuhuanzhou 提交于
      * update eigen version to f612df27, test=develop
      
      * fix compilation error, test=develop
      
      * remove patch command in eigen, test=develop
      
      * fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop
      
      * fix unittest error, test=develop
      
      * fix unittest error caused by precision, test=develop
      
      * remove patch files used by old version eigen, test=develop
      495e7f9c
    • W
      update compilation with C++14 (#31815) · 587d99ae
      wuhuanzhou 提交于
      * update compilation with C++14, test=develop
      
      * fix compilation error in eigen, test=develop
      587d99ae
    • T
      fix split core (#31892) · 393b3bd6
      Thunderbrook 提交于
      * fix split core
      
      * format
      393b3bd6
    • T
      fix some bug in transformer training in xpu (#31918) · 52b05bac
      taixiurong 提交于
      52b05bac
    • F
      [ROCM] Add ROCm support for warpctc op (#31817) · ef8323d4
      furnace 提交于
      * bugfix for warpctc
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix WARPCTC_WITH_HIP invalid
      
      * Add logs to find out why can not dlopen libwarpctc.so
      
      * fix warpctc commit id
      
      * fix unit test test_warpctc_op
      
      * Optime failed log for dlopen
      
      * Optime failed log for dlopen
      
      * Delete extra changes
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * Add is_compiled_with_rocm for test_warpctc_op
      
      * fix warpctc commit id
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * fix code style problems
      ef8323d4
  3. 30 3月, 2021 2 次提交
  4. 29 3月, 2021 3 次提交
  5. 26 3月, 2021 2 次提交
  6. 25 3月, 2021 2 次提交
  7. 24 3月, 2021 3 次提交
  8. 23 3月, 2021 2 次提交
  9. 22 3月, 2021 1 次提交
  10. 21 3月, 2021 2 次提交
  11. 19 3月, 2021 7 次提交
  12. 17 3月, 2021 1 次提交
  13. 16 3月, 2021 1 次提交
  14. 15 3月, 2021 2 次提交