1. 15 6月, 2022 1 次提交
  2. 14 6月, 2022 1 次提交
    • X
      [ CherryPick ] Cherry pick for einsum optimization. (#43468) · 22e75d92
      xiongkun 提交于
      * [EinsumOp] Polish forward logic and backward logic for optimize (#42603)
      
      * change logic for optimize
      
      * modifty
      
      * merge
      
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 (#43010)
      
      * [EinsumOp] Make EinsumOp support bfloat16. (#43085)
      
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0
      
      * make EInsumOP support bf16
      
      * add unittest for BF16
      
      * add condition for test_BF16
      
      * fix bugs
      
      * fix
      
      * change the backward api to fit einsum op
      22e75d92
  3. 08 6月, 2022 1 次提交
    • N
      Replace ReduceAmax/Amax.part.cu with KP (#43202) (#43263) · e161979e
      niuliling123 提交于
      Reduce amax/amin frobenius_norm_kerne原始实现为Eigen实现,文件编译时间较长,因此本PR将其替换为KP实现
      删除DefaultElementwiseOperator中重复功能支持,减少elementwise_double_grad OP编译时间
      e161979e
  4. 07 6月, 2022 1 次提交
  5. 06 6月, 2022 1 次提交
    • N
      cherry-pick 42645 (#43205) · 835a1888
      niuliling123 提交于
      删除Broadcast function中rank例化以及Elementwise调用,降低编译时间。
      从develop分支中的#42645 PR修改而来,由于develop分支与release分支相差较大,无法实现cherry-pick,因此针对release2.3重新提交PR.
      Broadcast中关于rank的例化会导致底层模板展开较多,造成reduce_sum_grad_kernel.cu.o文件体积过大,修改后可以降低.o体积及编译时间
      835a1888
  6. 10 5月, 2022 1 次提交
  7. 06 5月, 2022 1 次提交
  8. 05 5月, 2022 1 次提交
  9. 04 5月, 2022 2 次提交
  10. 01 5月, 2022 1 次提交
  11. 30 4月, 2022 2 次提交
  12. 28 4月, 2022 5 次提交
  13. 27 4月, 2022 3 次提交
  14. 26 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
      Chen Weihang 提交于
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix variant compile error (#42203)
      
      * replace any by variant in infermeta (#42181)
      ab24b9c0
  15. 25 4月, 2022 2 次提交
  16. 22 4月, 2022 1 次提交
  17. 21 4月, 2022 4 次提交
  18. 20 4月, 2022 2 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) (#41963) · 3b25afb2
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      3b25afb2
    • A
      [Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and... · 93f0e594
      Aurelius84 提交于
      [Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and NeedTransformPlace behavior if set skip_transform in yaml (#41973)
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode (#41936)
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode
      
      * [Phi]Fix unittest
      
      * [Phi]Fix unittest
      
      * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml (#41920)
      
      * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml
      
      * add unittest for full_like
      
      * fix unittest
      93f0e594
  19. 19 4月, 2022 6 次提交
  20. 18 4月, 2022 3 次提交
    • C
      [Phi]Reduce kernels into multiply files (#41747) (#41854) · 688f4ec0
      chentianyu03 提交于
      * split reduce_kernel
      
      * rm reduce_kernel in cmake
      
      * split reduce_grad kernels
      
      * fix cmake build error
      
      * format code
      
      * fix standalone_executor_test error
      688f4ec0
    • Z
      [DoubleGrad] Enabled double grad test cases in eager_mode for... · a367fbab
      Zhanlue Yang 提交于
      [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad (#41451) (#41893)
      
      * [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad
      
      * Fixed elementwise issue
      
      * Addressed CI failures
      a367fbab
    • J
      Add eager string tensor (#41039) (#41839) · 623f8308
      Jack Zhou 提交于
      * Add core.eager.StringTensor __init__ which pyarray args can be passed
      
      * Add the numpy method of core.eager.StringTensor
      
      * revert tensor.to_string modification
      
      * Add ToPyObject for core.eager.StringTensor
      
      * Add debug string for core.eager.StringTensor
      
      * Remove place args of core.eager.StringTensor temporarily
      
      * Fix check string_tensor error
      
      * remove dtype of core.eager.StringTensor
      
      * add core.eager.StringTensor unittest
      
      * remove pstring from VarDesc
      
      * Add InitStringTensorWithStringTensor
      
      * Remove to_string modification
      
      * Remove zero_copy arg from StringTensor creator
      623f8308