1. 08 6月, 2022 1 次提交
    • N
      Replace ReduceAmax/Amax.part.cu with KP (#43202) (#43263) · e161979e
      niuliling123 提交于
      Reduce amax/amin frobenius_norm_kerne原始实现为Eigen实现,文件编译时间较长,因此本PR将其替换为KP实现
      删除DefaultElementwiseOperator中重复功能支持,减少elementwise_double_grad OP编译时间
      e161979e
  2. 07 6月, 2022 1 次提交
  3. 06 6月, 2022 1 次提交
    • N
      cherry-pick 42645 (#43205) · 835a1888
      niuliling123 提交于
      删除Broadcast function中rank例化以及Elementwise调用,降低编译时间。
      从develop分支中的#42645 PR修改而来,由于develop分支与release分支相差较大,无法实现cherry-pick,因此针对release2.3重新提交PR.
      Broadcast中关于rank的例化会导致底层模板展开较多,造成reduce_sum_grad_kernel.cu.o文件体积过大,修改后可以降低.o体积及编译时间
      835a1888
  4. 10 5月, 2022 1 次提交
  5. 06 5月, 2022 1 次提交
  6. 05 5月, 2022 1 次提交
  7. 04 5月, 2022 2 次提交
  8. 01 5月, 2022 1 次提交
  9. 30 4月, 2022 2 次提交
  10. 28 4月, 2022 5 次提交
  11. 27 4月, 2022 3 次提交
  12. 26 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
      Chen Weihang 提交于
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix variant compile error (#42203)
      
      * replace any by variant in infermeta (#42181)
      ab24b9c0
  13. 25 4月, 2022 2 次提交
  14. 22 4月, 2022 1 次提交
  15. 21 4月, 2022 4 次提交
  16. 20 4月, 2022 2 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) (#41963) · 3b25afb2
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      3b25afb2
    • A
      [Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and... · 93f0e594
      Aurelius84 提交于
      [Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and NeedTransformPlace behavior if set skip_transform in yaml (#41973)
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode (#41936)
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode
      
      * [Phi]Fix expand_sig infershape BUG under static graph mode
      
      * [Phi]Fix unittest
      
      * [Phi]Fix unittest
      
      * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml (#41920)
      
      * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml
      
      * add unittest for full_like
      
      * fix unittest
      93f0e594
  17. 19 4月, 2022 6 次提交
  18. 18 4月, 2022 4 次提交
    • C
      [Phi]Reduce kernels into multiply files (#41747) (#41854) · 688f4ec0
      chentianyu03 提交于
      * split reduce_kernel
      
      * rm reduce_kernel in cmake
      
      * split reduce_grad kernels
      
      * fix cmake build error
      
      * format code
      
      * fix standalone_executor_test error
      688f4ec0
    • Z
      [DoubleGrad] Enabled double grad test cases in eager_mode for... · a367fbab
      Zhanlue Yang 提交于
      [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad (#41451) (#41893)
      
      * [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad
      
      * Fixed elementwise issue
      
      * Addressed CI failures
      a367fbab
    • J
      Add eager string tensor (#41039) (#41839) · 623f8308
      Jack Zhou 提交于
      * Add core.eager.StringTensor __init__ which pyarray args can be passed
      
      * Add the numpy method of core.eager.StringTensor
      
      * revert tensor.to_string modification
      
      * Add ToPyObject for core.eager.StringTensor
      
      * Add debug string for core.eager.StringTensor
      
      * Remove place args of core.eager.StringTensor temporarily
      
      * Fix check string_tensor error
      
      * remove dtype of core.eager.StringTensor
      
      * add core.eager.StringTensor unittest
      
      * remove pstring from VarDesc
      
      * Add InitStringTensorWithStringTensor
      
      * Remove to_string modification
      
      * Remove zero_copy arg from StringTensor creator
      623f8308
    • C
      [Cherry-pick] Organize the API of custom operators (#41882) · 897911fc
      Chen Weihang 提交于
      * [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647)
      
      * remove old custom op placetype
      
      * replace dist  placetype using
      
      * add with gpu macro
      
      * fix mutable_data error
      
      * fix set value error
      
      * add comment
      
      * remove all is initialized using (#41766)
      
      * remove inner_place using (#41768)
      
      * polish tensor depreacted method warning (#41807)
      
      * [CustomOp] Fix PlaceType related compat error (#41826)
      
      * fix place type related compat error
      
      * fix test failed
      
      * remove dll decl
      
      * revert place type change
      
      * add dll decl
      
      * resolve conflict
      897911fc
  19. 15 4月, 2022 1 次提交