1. 22 8月, 2020 1 次提交
  2. 21 8月, 2020 1 次提交
    • Q
      support Baidu Kunlun AI Accelerator (#25959) · 138ecf24
      QingshuChen 提交于
      * support Baidu AI Accelerator
        * test=kunlun
      
      * minor
       * test=kunlun
      
      * support xpu op in separate file
       * test=kunlun
      
      * update XPU error message and remove duplicated code
      
       * test=kunlun
      
      * minor
       * test=kunlun
      
      * minor
       * test=kunlun
      138ecf24
  3. 05 8月, 2020 1 次提交
    • P
      Fix registering trt plugin (#25744) · b717895f
      Pei Yang 提交于
      * develop dynamic shape serilization
      
      * add test param for gelu
      
      * fix bugs
      
      * delete redundant comments
      
      * debug
      
      * fix conflict. test=develop
      
      * fix bug. test=develop
      
      * add trt dynamic shape serialized support
      
      * fix ernie serialized bug
      test=develop
      
      * fix codestyle
      test=develop
      
      * fix bug
      test=develop
      
      * fix bug.test=develop
      
      * modify cmakelist test=develop
      
      * fix bug
      test=develop
      
      * fix error message.  test=develop
      
      * fix trt register plugin based on pr#25003
      
      * add trt dynload
      
      * fix deserialization bug of not finding plugin registration
      
      * refine code style
      
      * recover engine key in tensorrt_subgraph_pass
      
      * for ci coverage
      
      * add unittest for deserialization
      Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
      b717895f
  4. 03 8月, 2020 1 次提交
  5. 31 7月, 2020 1 次提交
    • S
      fix inverse bug (#25641) · bca30316
      ShenLiang 提交于
      * fix inverse bug, test=develop
      
      * fix the untest, test=develop
      
      * add singular checking, test=develop
      
      * fix the utest, test=develop
      
      * use memory::copy, test=develop
      
      * fix bost_get, test=develop
      
      * fix position, test=develop
      bca30316
  6. 23 7月, 2020 1 次提交
  7. 21 7月, 2020 1 次提交
  8. 11 7月, 2020 1 次提交
  9. 10 7月, 2020 1 次提交
  10. 07 7月, 2020 1 次提交
  11. 17 6月, 2020 2 次提交
  12. 12 6月, 2020 1 次提交
  13. 03 6月, 2020 2 次提交
  14. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  15. 27 4月, 2020 1 次提交
  16. 24 4月, 2020 2 次提交
    • G
      Add cholesky_op (#23543) · a8c0fb4e
      Guo Sheng 提交于
      * Add cholesky_op forward part. test=develop
      
      * Complete cholesky_op forward part. test=develop
      
      * Add cholesky_op backward part. test=develop
      
      * Complete cholesky_op backward part. test=develop
      
      * Refine cholesky_op error check and docs. test=develop
      
      * Add grad_check unit test for cholesky_op. test=develop
      
      * Fix sample code in cholesky doc. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Fix stream for cusolverDnSetStream. test=develop
      
      * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
      test=develop
      
      * Add CUSOLVER ERROR in enforce.h
      test=develop
      
      * Fix the missing return value in cholesky. test=develop
      a8c0fb4e
    • Z
      fix compilation failure (#24091) · ab2e2842
      Zeng Jinle 提交于
      ab2e2842
  17. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  18. 17 4月, 2020 1 次提交
  19. 10 4月, 2020 1 次提交
  20. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  21. 11 3月, 2020 1 次提交
  22. 28 2月, 2020 1 次提交
  23. 23 2月, 2020 1 次提交
  24. 10 2月, 2020 1 次提交
  25. 07 2月, 2020 1 次提交
  26. 06 2月, 2020 1 次提交
    • Y
      Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f
      Yiqun Liu 提交于
      Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
      
      * Add log in memory::Copy for debug purpose.
      
      * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.
      
      * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
      test=develop
      
      * Change the type of second_dim from size_t to int64_t.
      test=develop
      44b45b9f
  27. 19 1月, 2020 1 次提交
  28. 07 1月, 2020 1 次提交
  29. 04 1月, 2020 1 次提交
  30. 23 12月, 2019 1 次提交
  31. 11 12月, 2019 1 次提交
  32. 02 12月, 2019 1 次提交
  33. 28 11月, 2019 1 次提交
  34. 27 11月, 2019 1 次提交
  35. 26 11月, 2019 1 次提交
    • G
      Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8
      GaoWei8 提交于
      * Add fc padding to solve mkl performance
      test=develop
      
      * fix gpu pass and error information
      test=develop
      
      * fix fc_fuse_pass_test
      test=develop
      
      * fix error information
      test=develop
      
      * fix error information
      test=develop
      
      * fix name and add fc op padding test
      test=develop
      
      * fix attributes
      test=develop
      
      * optimize fc padding
      test=develop
      
      * fix test
      test=develop
      234060f8
  36. 22 11月, 2019 1 次提交
    • L
      add dequantize_abs_max op and modify lookup_table op (#20899) · f0b15184
      Liufang Sang 提交于
      * add int8 kernel to lookup_table op and add dequantize op test=develop
      
      * change paddle_enforce to paddle_enforce_eq test=develop
      
      * change copyright and change some not suitable code test=develop
      
      * remove debug log test=develop
      
      * replace GetInputType with IndicateVarDataType test=develop
      
      * fix EmptyGradMaker test=develop
      
      * fix diff between cpu and gpu test=develop
      
      * use memcopy when int8_t test=develop
      f0b15184
  37. 14 11月, 2019 1 次提交