1. 29 3月, 2020 1 次提交
    • Z
      Improve elementwise performance. (#23001) · 58615a62
      zhaoyuchen2018 提交于
      * Improve elementwise performance.
      
      Elementwise performace is poor as walk into CommonGradBroadcastCUDA, add some new kernels for different data pattern.
      
      * Add some cuda kernel to speedup common broadcast cases. test=develop
      
      * Add more test cases and fix cuda kernel bug. test=develop
      
      * Remove tests as cpu percision fails.test=develop
      
      * Refine SplitDims, test=develop
      
      * Change file mode, test=develop
      58615a62
  2. 25 3月, 2020 1 次提交
  3. 17 1月, 2020 1 次提交
  4. 19 11月, 2019 1 次提交
  5. 10 10月, 2019 1 次提交
  6. 04 9月, 2019 1 次提交
  7. 20 8月, 2019 1 次提交
  8. 14 6月, 2019 1 次提交
  9. 20 5月, 2019 1 次提交
    • L
      Double backward elementwise div (#17416) · 10b23a72
      lvmengsi 提交于
      * double backward, elementwise_div
      
      * fix dx empty. test=develop
      
      * bug fix (#17392)
      
      fix secure bug
      
      * Eanble stack operator for a Ngraph, test=develop (#17406)
      
      * fix sqrt_grad_grad unittest. test=develop (#17410)
      
      * fix sqrt_grad_grad unittest. test=develop
      
      * disable sqrt_grad_grad unittest. test=develop
      
      * test=develop, fix unittest
      
      * test=develop, fix unittest
      
      * test=develop, fix unittest
      
      * test=develop, fix bug
      
      * fix unittest. test=develop
      
      * fix unittest dx. test=develop
      
      * tmp fix! for test... test=develop
      
      * reduce tmp, test=develop
      
      * test=develop, reduce tmp
      
      * fix broadcast unittest. test=develop
      
      * fix format. test=develop
      
      * refine code. test=develop
      
      * refine code. test=develop
      
      * refine GetDoubleGradSafeTensor. test=develop
      
      * fix format. test=develop
      10b23a72
  10. 13 5月, 2019 1 次提交
    • K
      add double grad for elementwise_mul op (#17255) · 8bae8590
      Kaipeng Deng 提交于
      * add double grad for elementwise_mul. test=develop
      
      * remove comment. test=develop
      
      * fix grad sum. test=develop
      
      * fix for axis expand. test=develop
      
      * add test for axis expand. test=develop
      8bae8590
  11. 08 5月, 2019 1 次提交
  12. 24 1月, 2019 1 次提交
  13. 16 11月, 2018 1 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
  14. 14 11月, 2018 1 次提交
  15. 08 11月, 2018 1 次提交
  16. 07 11月, 2018 1 次提交
    • C
      Add fp16 backward support (#14202) · a9b5d42d
      chengduo 提交于
      * add fp16 backward support
      test=develop
      
      * add sum_op fp16 test
      
      * disable test_dist_save_load
      test=develop
      
      * add check_grad for sum
      
      * add unit test for softmax_grad fp16
      test=develop
      
      * add scale_op unit test
      
      * add mul_grad_op unit test for fp16
      
      * add cross_entropy_grad and eman_grad unit test for fp16
      test=develop
      
      * fix cross_entropy unit test
      
      * add pool2d fp16 unit test
      
      * refine conv2d fp16 unit test
      test=develop
      
      * refine activation unit test
      test=develop
      
      * fix ci
      test=develop
      
      * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
      test=develop
      a9b5d42d
  17. 05 11月, 2018 1 次提交
  18. 14 10月, 2018 1 次提交
  19. 20 9月, 2018 1 次提交
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  20. 12 9月, 2018 1 次提交
  21. 03 9月, 2018 1 次提交
  22. 30 8月, 2018 1 次提交
    • C
      Enhance fused_elementwise_activation_op (#12837) · 3bd1d22a
      chengduo 提交于
      * Enhance the function of fused_elementwise_activation_op
      
      * enhance unit test
      
      * Clean Code And Add Doc
      
      * Add compound functors
      
      * Fix doc and enhance unit test
      
      * define Dx and Dy for d_binary_func
      
      * add mul_scale
      
      * add mul_scale
      
      * add elementwise_mul
      
      * code refine
      
      * code refine
      
      * add doc
      
      * add  AsIntermediate
      3bd1d22a
  23. 27 8月, 2018 1 次提交
  24. 20 8月, 2018 1 次提交
  25. 17 8月, 2018 1 次提交
  26. 16 8月, 2018 1 次提交
  27. 10 8月, 2018 1 次提交
  28. 01 8月, 2018 1 次提交
  29. 03 5月, 2018 1 次提交
  30. 30 4月, 2018 1 次提交
  31. 24 4月, 2018 1 次提交
  32. 10 4月, 2018 1 次提交
  33. 06 3月, 2018 1 次提交
  34. 28 2月, 2018 1 次提交
  35. 26 2月, 2018 1 次提交
  36. 24 2月, 2018 2 次提交
  37. 23 2月, 2018 2 次提交
  38. 13 2月, 2018 1 次提交