1. 12 1月, 2021 1 次提交
    • C
      [Cherry-pick] Complex grad for matmul, kron and type promotion (#30304) · 7346edc2
      chentianyu03 提交于
      * complex gradient matmul  (#29966)
      
      * dot op support complex types
      
      * matmul support complex types
      
      * add test case
      
      * matmul broadcast gradient support complex
      
      * move conjFunctor to complex_functor.h
      
      * change the kron gradient when complex types (#29995)
      
      * type promotion for grad (#30177)
      
      * type promotion for grad
      
      * add type promotion for div op
      7346edc2
  2. 07 1月, 2021 1 次提交
  3. 29 12月, 2020 1 次提交
    • C
      [Cherry-pick] Complex network execute support (#29905) · 91ebc460
      Chen Weihang 提交于
      * [Complex] Add support for complex grad accumulated (#29889)
      
      * add support for complex grad accumulated
      
      * add unittest for coverage
      
      * update test dtype
      
      * remove useless blank line
      
      * [Complex] Handle complex to real after type promotion (#29855)
      
      * try to add fwd op input dtypes
      
      * refactor base impl
      
      * return tmp_ins after dygraph prepare data
      
      * fix typo found in debug
      
      * polish comment & add complex net test
      
      * revert detail change
      
      * fix unittest failed
      
      * add complex kernel condition control
      
      * fix xpu test failed & polish comment
      
      * polish details by review comments
      
      * Complex op test (#29753)
      
      * delete no need to calculate inputs in dygraph op_test
      
      * delete no need to calculate inputs in dygraph op_test
      
      * change grad elementwise_mul for complex types (#29757)
      
      * add conj op for complex types
      
      * add conj for complex types
      
      * add more test case
      
      * add conj_op test
      
      * modify conj api and impl
      
      * add complex type for fill_constant_op xpu
      
      * add setConstant for complex type
      
      * remove complex conj test file
      
      * user define grad for test_conj_op
      
      * add test case for static mode of conj api
      
      * modify conj doc
      
      * change input args name to x
      
      * remove useless codes
      
      * conj support real types
      
      * add conj test case for real number
      
      * delete no need to calculate inputs in dygraph op_test
      
      * delete no need to calculate inputs in dygraph op_test
      
      * modify grad of mul for complex types
      
      * fix the grads of inputs args order not match bug
      
      * change the grad of div when complex types (#29804)
      
      * change the grad of div when complex types
      
      * fix the grads of inputs args order not match bug
      Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
      91ebc460
  4. 18 12月, 2020 1 次提交
    • C
      [Cherry-pick] Add complex api conj, real and imag (#29750) · ab5cc042
      Chen Weihang 提交于
      * Add complex dtype op (add) test example (#29603)
      
      
      * add op test case for complex
      
      * polish code details
      
      * add xpu set constant support
      
      * fix argument rror
      
      * remove useless pyc file
      
      * [Complex] Add real & imag op and api for complex tensor (#29672)
      
      * add complex real op & api & unittest
      
      * add imag op & api & unittest
      
      * refactor op impl
      
      * revert simplify writing due to complile failed
      
      * polish details
      
      * polish grad op code
      
      * add conj op for complex types (#29527)
      
      * add conj op for complex types
      
      * add conj for complex types
      
      * add more test case
      
      * add conj_op test
      
      * modify conj api and impl
      
      * add complex type for fill_constant_op xpu
      
      * add setConstant for complex type
      
      * remove complex conj test file
      
      * user define grad for test_conj_op
      
      * add test case for static mode of conj api
      
      * modify conj doc
      
      * change input args name to x
      
      * remove useless codes
      
      * conj support real types
      
      * add conj test case for real number
      Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
      ab5cc042
  5. 01 12月, 2020 1 次提交
  6. 30 11月, 2020 1 次提交
  7. 27 11月, 2020 2 次提交
    • J
      Add eigen gru and fix the dropout bug in the rnn · 085260f3
      Jack Zhou 提交于
      Add eigen gru and fix the dropout bug in the rnn 
      085260f3
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  8. 25 11月, 2020 1 次提交
  9. 20 11月, 2020 2 次提交
    • J
      Add LSTM, Simple RNN and GRU CPU kernel (#28577) · 9362d85e
      Jack Zhou 提交于
      * add lstm, simple rnn op kernel
      
      * fix the test_lstm for the rnn op
      
      * change func name
      
      * fix forward postprocess bug
      
      * add gru forward, backward code
      
      * remove unittest.skipIf; use a big rnn op instead of combination op
      
      * fix input doesn't have gradient bug
      
      * add eigen lstm forward, backward
      Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
      9362d85e
    • Q
      adjust kunlun header file (#28536) · 30ef3815
      QingshuChen 提交于
      * adjust kunlun header file
      *test=kunlun
      
      * update kunlun unittest
      *test=kunlun
      
      * update xpu unitest
      * test = kunlun
      
      * update xpu unittest
      * test=kunlun
      
      * update xpu unitest
      * test=kunlun
      30ef3815
  10. 08 11月, 2020 1 次提交
    • Y
      exec ut no more than 15s 1 (#28439) · ba075632
      YUNSHEN XIE 提交于
      * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
      
      * test for limiting ut exec time as 15S
      
      * fix an error caused by cannot find ut
      
      * fix some error
      
      * can not find test_transformer
      
      * fix error caused by ut not run in windows
      
      * fix error caused by Compiler Options
      
      * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
      
      * setting timeout value to 120s for old ut
      
      * add the timeout value setting
      
      * fix error caused by ut only run in coverage_ci
      
      * add analyzer_transformer_profile_tester
      
      * fix some error
      
      * fix some error
      
      * fix error with inference option
      
      * fix error with inference option setting as ON_INFER
      
      * add some ut to set timeout
      
      * modified some option
      
      * fix error
      
      * fix some timeout error
      
      * fix error
      
      * fix error
      
      * fix timeout for test_analyzer_bfloat16_resnet50
      
      * fix error
      
      * setting timeout properity for some ut
      
      * first pr for new ut timeout as 15S
      ba075632
  11. 03 11月, 2020 2 次提交
  12. 21 10月, 2020 1 次提交
  13. 20 10月, 2020 1 次提交
  14. 14 10月, 2020 1 次提交
  15. 10 10月, 2020 1 次提交
  16. 27 9月, 2020 1 次提交
    • L
      add support to float64 input of warpctc op. (#27399) · 1501a80f
      Li Fuchen 提交于
      * add float64 input to ctc_loss
      
      * modified error message of  warpctc
      
      * update repo and tag of warpctc
      
      * add test for warpctc with float64 input
      
      * modified warpctc.cmake to make sure build always
      
      * resolved sample code bug of warpctc
      
      * add core.ops in warpctc dygraph
      
      * fix a bug of test
      1501a80f
  17. 26 9月, 2020 1 次提交
  18. 25 9月, 2020 1 次提交
  19. 24 9月, 2020 2 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
    • Z
      Add GPU Kernels of Segment Ops, support, sum, max, min, mean · 4a9d21de
      Zhong Hui 提交于
      Add GPU Kernels of Segment Ops,  support, sum, max, min, mean
      4a9d21de
  20. 22 9月, 2020 1 次提交
  21. 18 9月, 2020 1 次提交
  22. 17 9月, 2020 1 次提交
  23. 16 9月, 2020 1 次提交
  24. 14 9月, 2020 2 次提交
  25. 10 9月, 2020 1 次提交
  26. 09 9月, 2020 2 次提交
  27. 08 9月, 2020 1 次提交
  28. 03 9月, 2020 1 次提交
  29. 28 8月, 2020 1 次提交
    • L
      Refine paddle.manual_seed (#26496) · 844583c8
      Leo Chen 提交于
      * refine manual seed
      
      * fix ci problem
      
      * fix unittests
      
      * fix unittest
      
      * set is_init_py=false in manual_seed
      
      * fix unittest
      
      * fix bernoulli_op
      
      * fix(unittest): change random_seed to manual_seed
      
      * 🐞fix(unittest): fix manual_seed
      
      * trigger ci
      
      * fix test_sentiment
      
      * fix test_imperative_save_load
      
      * fix test_uniform_random_op
      
      * fix test_uniform_random_op
      
      * fix test_jit_save_load
      
      * merge develop
      
      * fix manual_seed
      
      * fix manual_seed
      
      * use global engine
      
      * use shared_ptr
      
      * fix double free
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix test bug
      
      * fix test bug
      
      * fix test bug
      
      * fix ci
      844583c8
  30. 26 8月, 2020 1 次提交
  31. 24 8月, 2020 1 次提交
  32. 22 8月, 2020 1 次提交
  33. 21 8月, 2020 1 次提交
    • Q
      support Baidu Kunlun AI Accelerator (#25959) · 138ecf24
      QingshuChen 提交于
      * support Baidu AI Accelerator
        * test=kunlun
      
      * minor
       * test=kunlun
      
      * support xpu op in separate file
       * test=kunlun
      
      * update XPU error message and remove duplicated code
      
       * test=kunlun
      
      * minor
       * test=kunlun
      
      * minor
       * test=kunlun
      138ecf24
  34. 05 8月, 2020 1 次提交
    • P
      Fix registering trt plugin (#25744) · b717895f
      Pei Yang 提交于
      * develop dynamic shape serilization
      
      * add test param for gelu
      
      * fix bugs
      
      * delete redundant comments
      
      * debug
      
      * fix conflict. test=develop
      
      * fix bug. test=develop
      
      * add trt dynamic shape serialized support
      
      * fix ernie serialized bug
      test=develop
      
      * fix codestyle
      test=develop
      
      * fix bug
      test=develop
      
      * fix bug.test=develop
      
      * modify cmakelist test=develop
      
      * fix bug
      test=develop
      
      * fix error message.  test=develop
      
      * fix trt register plugin based on pr#25003
      
      * add trt dynload
      
      * fix deserialization bug of not finding plugin registration
      
      * refine code style
      
      * recover engine key in tensorrt_subgraph_pass
      
      * for ci coverage
      
      * add unittest for deserialization
      Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
      b717895f