1. 20 1月, 2021 1 次提交
    • C
      make abs op support complex types (#30375) · 358106fc
      chentianyu03 提交于
      * rewrite abs op
      
      * rewrite abs op and remove abs in activation
      
      * remove abs register in old codes
      
      * fix abs_grad type error
      
      * fix abs double_grad output name error
      
      * modify abs_grad, abs_grad_grad functor for windows building
      
      * format code style
      
      * fix the bug of result is nan when the divisor is zero
      
      * add missing abs attr and add abs for float16
      358106fc
  2. 06 1月, 2021 1 次提交
  3. 31 12月, 2020 1 次提交
    • C
      complex gradient matmul (#29966) · e012930a
      chentianyu03 提交于
      * dot op support complex types
      
      * matmul support complex types
      
      * add test case
      
      * matmul broadcast gradient support complex
      
      * move conjFunctor to complex_functor.h
      e012930a
  4. 25 12月, 2020 1 次提交
  5. 17 12月, 2020 1 次提交
  6. 16 12月, 2020 1 次提交
    • Y
      添加rocm平台支持代码 (#29342) · 76738504
      Y_Xuan 提交于
      * 添加rocm平台支持代码
      
      * 修改一些问题
      
      * 修改一些歧义并添加备注
      
      * 修改代码格式
      
      * 解决冲突后的代码修改
      
      * 修改operators.cmake
      
      * 修改格式
      
      * 修正错误
      
      * 统一接口
      
      * 修改日期
      76738504
  7. 15 12月, 2020 1 次提交
  8. 11 12月, 2020 2 次提交
    • L
      remove duplicated macro (#29563) · 1e72e032
      Leo Chen 提交于
      1e72e032
    • L
      Add the strategy of skipping cc/cu test compilation and execution in CI (#29499) · b5d4a1f3
      LoveAn 提交于
      * Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop
      
      * fix if error with CI_SKIP_TEST, test=develop
      
      * fix add properties to test error on Linux/MAC, test=develop
      
      * fix set test properties of test_code_generator error, test=develop
      
      * remove test codes and advance judgment of file modification on Linux, test=develop
      
      * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix
      
      * Add branch judgement on Linux, test=develop
      b5d4a1f3
  9. 08 12月, 2020 1 次提交
  10. 07 12月, 2020 1 次提交
    • L
      Compiling operator libraries with Unity build (#29130) · 671555ed
      LoveAn 提交于
      * Compiling operator libraries with Unity Build on Windows CPU.
      
      * Compiling operator libraries with Unity Build on Windows GPU, no_test, test=windows_ci
      
      * Add option in windows ci script, no_test, test=windows_ci
      
      * Optimize parallel compiling, test=develop
      
      * remove limit of parallel compile and skip some ops in UB, test=develop
      
      * remove changes of header file, test=develop
      
      * remove changes of header file, test=develop
      
      * fix test_eye_op unittest failed, test=develop
      
      * Compiling operator libraries with Unity Build on Linux, test=develop
      
      * set default WITH_UNITY_BUILD=OFF, test=develop
      
      * Move unity build rules into a single file and add comment, test=develop
      
      * optimize parallel compilation, test=develop
      
      * fix undefined reference error on coverage ci, test=develop
      671555ed
  11. 01 12月, 2020 1 次提交
  12. 30 11月, 2020 1 次提交
  13. 27 11月, 2020 2 次提交
    • J
      Add eigen gru and fix the dropout bug in the rnn · 085260f3
      Jack Zhou 提交于
      Add eigen gru and fix the dropout bug in the rnn 
      085260f3
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  14. 25 11月, 2020 1 次提交
  15. 20 11月, 2020 2 次提交
    • J
      Add LSTM, Simple RNN and GRU CPU kernel (#28577) · 9362d85e
      Jack Zhou 提交于
      * add lstm, simple rnn op kernel
      
      * fix the test_lstm for the rnn op
      
      * change func name
      
      * fix forward postprocess bug
      
      * add gru forward, backward code
      
      * remove unittest.skipIf; use a big rnn op instead of combination op
      
      * fix input doesn't have gradient bug
      
      * add eigen lstm forward, backward
      Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
      9362d85e
    • Q
      adjust kunlun header file (#28536) · 30ef3815
      QingshuChen 提交于
      * adjust kunlun header file
      *test=kunlun
      
      * update kunlun unittest
      *test=kunlun
      
      * update xpu unitest
      * test = kunlun
      
      * update xpu unittest
      * test=kunlun
      
      * update xpu unitest
      * test=kunlun
      30ef3815
  16. 08 11月, 2020 1 次提交
    • Y
      exec ut no more than 15s 1 (#28439) · ba075632
      YUNSHEN XIE 提交于
      * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
      
      * test for limiting ut exec time as 15S
      
      * fix an error caused by cannot find ut
      
      * fix some error
      
      * can not find test_transformer
      
      * fix error caused by ut not run in windows
      
      * fix error caused by Compiler Options
      
      * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
      
      * setting timeout value to 120s for old ut
      
      * add the timeout value setting
      
      * fix error caused by ut only run in coverage_ci
      
      * add analyzer_transformer_profile_tester
      
      * fix some error
      
      * fix some error
      
      * fix error with inference option
      
      * fix error with inference option setting as ON_INFER
      
      * add some ut to set timeout
      
      * modified some option
      
      * fix error
      
      * fix some timeout error
      
      * fix error
      
      * fix error
      
      * fix timeout for test_analyzer_bfloat16_resnet50
      
      * fix error
      
      * setting timeout properity for some ut
      
      * first pr for new ut timeout as 15S
      ba075632
  17. 03 11月, 2020 2 次提交
  18. 21 10月, 2020 1 次提交
  19. 20 10月, 2020 1 次提交
  20. 14 10月, 2020 1 次提交
  21. 10 10月, 2020 1 次提交
  22. 27 9月, 2020 1 次提交
    • L
      add support to float64 input of warpctc op. (#27399) · 1501a80f
      Li Fuchen 提交于
      * add float64 input to ctc_loss
      
      * modified error message of  warpctc
      
      * update repo and tag of warpctc
      
      * add test for warpctc with float64 input
      
      * modified warpctc.cmake to make sure build always
      
      * resolved sample code bug of warpctc
      
      * add core.ops in warpctc dygraph
      
      * fix a bug of test
      1501a80f
  23. 26 9月, 2020 1 次提交
  24. 25 9月, 2020 1 次提交
  25. 24 9月, 2020 2 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
    • Z
      Add GPU Kernels of Segment Ops, support, sum, max, min, mean · 4a9d21de
      Zhong Hui 提交于
      Add GPU Kernels of Segment Ops,  support, sum, max, min, mean
      4a9d21de
  26. 22 9月, 2020 1 次提交
  27. 18 9月, 2020 1 次提交
  28. 17 9月, 2020 1 次提交
  29. 16 9月, 2020 1 次提交
  30. 14 9月, 2020 2 次提交
  31. 10 9月, 2020 1 次提交
  32. 09 9月, 2020 2 次提交
  33. 08 9月, 2020 1 次提交