1. 26 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
      Chen Weihang 提交于
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix variant compile error (#42203)
      
      * replace any by variant in infermeta (#42181)
      ab24b9c0
  2. 21 4月, 2022 5 次提交
  3. 20 4月, 2022 1 次提交
  4. 19 4月, 2022 4 次提交
    • Z
      [cherry-pick] add rsqrt, equal_all, expand yaml and unittest (#41443, #41540) (#41965) · 018245d8
      zyfncg 提交于
      * add rsqrt yaml and unittest (#41443)
      
      * Add expand equal all yaml (#41540)
      
      * add expand, poisson
      
      * add poison grad
      
      * add expand equal_all poisson triangular solve yaml
      Co-authored-by: Nhong <43953930+phlrain@users.noreply.github.com>
      018245d8
    • Y
      [Cherry-pick 2.3] Autotune the workspace and kernel choosing of conv (#41833) · b4adbe5c
      Yiqun Liu 提交于
      Cherry-pick #40338 #41741 #41313
      b4adbe5c
    • F
      [cherry-pick] XPUPS Adaptation (#41917) · a9d8b947
      Fan Zhang 提交于
      * XPUPS Adaptation (#40991)
      
      * Adapt XPUPS - 1st version - 3.24
      
      * Adapt XPUPS - update XPU PushSparse -  2nd version - 3.24
      
      * Adapt XPUPS - add XPU PullSparseOp - 3nd version - 3.25
      
      * refactor heter comm kernel
      
      * update. test=develop
      
      * Adapt XPUPS - modify by compilation - 4th version - 3.27
      
      * update calc_shard_offset. test=develop
      
      * update xpu kernel. test=develop
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * heter_comm update
      
      * heter_comm update
      
      * update calc_shard_offset. test=develop
      
      * heter_comm update
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * Adapt XPUPS - use WITH_XPU_KP and modify wrapper kernel function - 5th version - 3.30
      
      * update. test=develop
      
      * update pslib.cmake
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 6th version - 3.30
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * used by minxu
      
      * update heter_comm_inl
      
      * fix. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 7th version - 3.30
      
      * fix. test=develop
      
      * add optimizer kernel. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 3.31 update
      
      * Adapt XPUPS - update kp compilation path  - 8th version - 3.31
      
      * add optimizer kernel. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm.h 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update hashtable. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 9th version - 4.1
      
      * update hashtable. test=develop
      
      * fix. test=develop
      
      * update hashtable 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 10th version - 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1 19:30
      
      * fix. test=develop
      
      * update ps_gpu_wrapper.kps 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 11th version - 4.1
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 12nd version - 4.2
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.2
      
      * 4.2 update
      
      * fix. test=develop
      
      * template init. test=develop
      
      * update 4.6
      
      * fix. test=develop
      
      * template init. test=develop
      
      * 4.6 modify by compilation
      
      * hashtable template init. test=develop
      
      * hashtable template init. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 13nd version - 4.7
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * update by pre-commit
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.12 update
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 14th version - 4.13
      
      * 4.13 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 modify by merged latest compilation
      
      * retry CI 4.14
      
      * 4.15 pass static check
      
      * 4.15 modify by gpups CI
      
      * 3.16 update by gpups CI - modify ps_gpu_wrapper.h
      
      * 4.16 update
      
      * 4.16 pass xpu compile
      
      * 4.16 retry CI
      
      * 4.16 update
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      
      * modify ps_gpu_wrapper.cc
      
      * update
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      a9d8b947
    • T
      cinn_launch_op: optimize the overhead of preparing variables before executing... · dab7dfbf
      TeFeng Chen 提交于
      cinn_launch_op: optimize the overhead of preparing variables before executing cinn compiled program (#41777) (#41910)
      
      cherry-pick #41777
      * optimize preparation overhead before executing cinn compiled program
      dab7dfbf
  5. 18 4月, 2022 3 次提交
    • L
      update (#41756) · 97d1ab2a
      lilong12 提交于
      97d1ab2a
    • R
      fix bugs in moe (#41903) · f92dbfb7
      Roc 提交于
      * fix moe apis (#41650)
      
      * Moe ref (#41836)
      
      * moe ref
      
      * ref commit
      
      * update; document_fix
      
      * update;document_fix
      
      * Moe ref (#41864)
      
      * moe ref
      
      * ref commit; document_fix
      
      * update; document_fix
      
      * update document_fix
      
      * update; document_fix
      f92dbfb7
    • H
      [Cherry-Pick] take along axis bug fix (#41863) · e7980adf
      huangxu96 提交于
      This PR is the cherry-pick of #41824
      
      This PR fixes a bug that will cause the Cuda address error. The reason for this bug is that the grid number of the Cuda Kernel had been wrongly set.
      e7980adf
  6. 15 4月, 2022 1 次提交
  7. 14 4月, 2022 1 次提交
    • C
      Cherry pick final state ops (#41755) · 921a6fb7
      chentianyu03 提交于
      * [Yaml]add exp yaml (#41217)
      
      * add exp yaml
      
      * add exp api in test case
      
      * add determinant yaml
      
      * fix exp op unittest
      
      * change test class name
      
      * modify api name
      
      * compacted with raw api
      
      * fix det api
      
      * add python_api
      
      * add test eager for determinant op
      
      * [Yaml] Add assign yaml (#41428)
      
      * add assign yaml
      
      * add assign api
      
      * add assign backward api
      
      * add assign
      
      * add assign yaml
      
      * add assign
      
      * assign yaml
      
      * add assign raw kernel and use assign_raw in yaml
      
      * merge develop branch
      
      * add missing python_api
      
      * exchange assign and assign_raw kernel name (#41625)
      
      * exchange assign and assign_raw kernel name
      
      * fix register error
      
      * [Yaml]add gaussian_random yaml and test case (#41312)
      
      * add guassian random yaml
      
      * add gaussian_random yaml and test case
      
      * fix error modify of full yaml
      
      * import in_dygraph_mode
      
      * import _in_legacy_dygraph
      
      * add place arg in api
      
      * import __current_expected_place
      
      * fix test_egr_python_api failed case
      
      * add test case
      
      * add cast for NormalInitializer
      
      * fix test error
      
      * fix test error
      
      * rm unsed check code
      
      * fix test error in test_initializer_nn
      
      * modify by review
      
      * [Phi]fix split error when sections has 0 size and add test case (#41708)
      
      * fix split error when sections has 0 size and add test case
      
      * fix test case
      921a6fb7
  8. 13 4月, 2022 1 次提交
  9. 12 4月, 2022 5 次提交
  10. 11 4月, 2022 3 次提交
  11. 08 4月, 2022 1 次提交
  12. 06 4月, 2022 2 次提交
    • W
      [Eager] Support test_layers's test cases switch to eager mode (#41216) · 5ae8babb
      Weilong Wu 提交于
      * [Eager] Support test_layers's test cases switch to eager mode
      
      * Update batch_norm _C_ops action to fix CI
      
      * Use None instead of new EmptyTensor
      
      * Updated var name
      
      * Make sure to switch eager mode, Fix Coverage_CI
      
      * Remove _non_static_mode statement
      
      * Remove batch_norm dispensable input statement
      
      * Polish batch_norm code
      
      * Fix CI issue
      5ae8babb
    • H
      Add conv yaml (#41354) · 7ed7c6c7
      hong 提交于
      * update
      
      * add conv yaml
      
      * add backward
      
      * remove useless code
      
      * fix bug
      
      * fix bug
      
      * revert fluid dygraph conv2d
      
      * remove useless infermeta function
      
      * fix meta fn deluplicat error
      
      * conv using custom impl
      
      * remove amp include
      
      * fix bug
      
      * use cudnn = true
      
      * fix test mkldnn caching bug
      7ed7c6c7
  13. 05 4月, 2022 4 次提交
  14. 04 4月, 2022 2 次提交
  15. 03 4月, 2022 3 次提交
    • C
      [Phi]Concat grad (#41112) · 3f57ef7a
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      3f57ef7a
    • H
      Add infer meta (#41054) · 868a3203
      hong 提交于
      * add some infer meta
      
      * fix bug
      
      * fix bugs;
      
      * fix bug and add set data type
      
      * revert infer shape of lookup table
      
      * recover test
      868a3203
    • Z
      Add randperm and range yaml (#41265) · fd1ecfc5
      zyfncg 提交于
      * add randperm and range yaml
      
      * add eager test for randperm
      fd1ecfc5
  16. 02 4月, 2022 3 次提交
    • S
      Add graph apis (#40809) · b0398c8e
      Siming Dai 提交于
      * Add graph_reindex API
      
      * add graph_sample_neighbors api
      
      * Add buffer
      
      * delete VLOG
      
      * delete thrust::copy for output
      
      * add ShareDataWith
      
      * delete graph_reindex hashtable output
      
      * add graph_reindex dispensable
      
      * add reindex unittest, move memset to cuda kernel, change api
      
      * fix conflict
      
      * add reindex buffer for gpu version note
      
      * fix conflicts for op_func_generator
      
      * Add fisher_yates sampling, add dispensable, change infermeta
      
      * add dtype for edge_id
      
      * fix rocm ci and static check ci
      
      * add unittest
      
      * fix unittest
      
      * fix unittest
      
      * fix bug
      b0398c8e
    • L
      do not use scope in op kernel (#41316) · 0f6412c0
      Leo Chen 提交于
      0f6412c0
    • W
      [Paddle inference] support new quant_model (#41049) · 1b58ce14
      Wangzheee 提交于
      * paddle inference support new quant_model
      1b58ce14