1. 27 6月, 2022 1 次提交
  2. 24 6月, 2022 1 次提交
  3. 23 6月, 2022 1 次提交
  4. 22 6月, 2022 4 次提交
  5. 20 6月, 2022 1 次提交
  6. 15 6月, 2022 1 次提交
  7. 14 6月, 2022 1 次提交
    • X
      [ CherryPick ] Cherry pick for einsum optimization. (#43468) · 22e75d92
      xiongkun 提交于
      * [EinsumOp] Polish forward logic and backward logic for optimize (#42603)
      
      * change logic for optimize
      
      * modifty
      
      * merge
      
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 (#43010)
      
      * [EinsumOp] Make EinsumOp support bfloat16. (#43085)
      
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0
      
      * make EInsumOP support bf16
      
      * add unittest for BF16
      
      * add condition for test_BF16
      
      * fix bugs
      
      * fix
      
      * change the backward api to fit einsum op
      22e75d92
  8. 08 6月, 2022 1 次提交
    • N
      Replace ReduceAmax/Amax.part.cu with KP (#43202) (#43263) · e161979e
      niuliling123 提交于
      Reduce amax/amin frobenius_norm_kerne原始实现为Eigen实现,文件编译时间较长,因此本PR将其替换为KP实现
      删除DefaultElementwiseOperator中重复功能支持,减少elementwise_double_grad OP编译时间
      e161979e
  9. 07 6月, 2022 1 次提交
  10. 06 6月, 2022 1 次提交
    • N
      cherry-pick 42645 (#43205) · 835a1888
      niuliling123 提交于
      删除Broadcast function中rank例化以及Elementwise调用,降低编译时间。
      从develop分支中的#42645 PR修改而来,由于develop分支与release分支相差较大,无法实现cherry-pick,因此针对release2.3重新提交PR.
      Broadcast中关于rank的例化会导致底层模板展开较多,造成reduce_sum_grad_kernel.cu.o文件体积过大,修改后可以降低.o体积及编译时间
      835a1888
  11. 10 5月, 2022 1 次提交
  12. 06 5月, 2022 1 次提交
  13. 05 5月, 2022 1 次提交
  14. 04 5月, 2022 2 次提交
  15. 01 5月, 2022 1 次提交
  16. 30 4月, 2022 2 次提交
  17. 28 4月, 2022 5 次提交
  18. 27 4月, 2022 3 次提交
  19. 26 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
      Chen Weihang 提交于
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix variant compile error (#42203)
      
      * replace any by variant in infermeta (#42181)
      ab24b9c0
  20. 25 4月, 2022 2 次提交
  21. 22 4月, 2022 1 次提交
  22. 21 4月, 2022 4 次提交
  23. 20 4月, 2022 2 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) (#41963) · 3b25afb2
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      3b25afb2
    • A
      [Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and... · 93f0e594
      Aurelius84 提交于
      [Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and NeedTransformPlace behavior if set skip_transform in yaml (#41973)
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode (#41936)
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode
      
      * [Phi]Fix unittest
      
      * [Phi]Fix unittest
      
      * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml (#41920)
      
      * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml
      
      * add unittest for full_like
      
      * fix unittest
      93f0e594
  24. 19 4月, 2022 1 次提交