1. 19 9月, 2022 9 次提交
    • W
      refactor mp. (#45803) (#46121) · e5dc9d61
      wuhuachaocoding 提交于
      * refactor mp.
      
      * update setup.py.
      
      * update mp_layers.py for compatibility.
      
      * add documents for mp_layers.py
      
      * update init.py
      
      * update collective.py.
      
      * update.
      
      * update mp_ops.py
      
      * update.
      
      * update code style.
      
      * update code style.
      e5dc9d61
    • Y
      [Cherry-pick][Auto Parallel] Improve the APIs (#46164) · c5cc4278
      Yulong Ao 提交于
      * [AutoParallel] adapt gradient merge pass (#45915)
      
      * adapt gradient merge
      
      * fix op_role
      
      * fix strategy
      
      * [Auto Parallel] Gradient Fuse Allreduce (#45643)
      
      * bugfix (#45332)
      
      * dist embedding support lookup table v1
      
      * add unitest
      
      * customize wait_comm
      
      * group gradients
      
      * bugfix
      
      * update program
      
      * [Auto Parallel] Improve the APIs (#45776)
      
      * [Auto Parallel] Use c++ dist attr in the completion process
      
      * [Auto Parallel] Add minor changes
      
      * [Auto Parallel] Use c++ dist attr in the completion process
      
      * [Auto Parallel] Add minor changes
      
      * [Auto Parallel] Add the serialization process for dist attrs
      
      * [Auto Parallel] Remove unnecessary comments
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix the code style
      
      * [Auto Parallel] Remove unnecessary impls
      
      * [Auto Parallel] Fix the importing error
      
      * [Auto Parallel] Fix the copy from bugs of op dist attr
      
      * [Auto Parallel] Replace the use of constexpr if
      
      * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh
      
      * [Auto Parallel] Change API of the completion unittest
      
      * [Auto Parallel] Fix the bug when set_attr an int
      
      * [Auto Parallel] Add the unittest for the serialization
      
      * [Auto Parallel] Add some unit tests
      
      * [Auto Paralle] Unify the strategy
      
      * [Auto Parallel] Improve the engine api
      
      * [Auto Parallel] Reset the changes made to the framework
      
      * [Auto Parallel] Change the engine unittest
      
      * [Auto Parallel] Update API of the completion and partitioner
      
      * [Auto Parallel] Update unit tests using engine api
      
      * update shard annotation
      
      * [Auto Parallel] Remove the modifications of other modules
      
      * [Auto Parallel] Add docs for APIs
      
      * add new strategy
      
      * [Auto Parallel] Replace the logger
      
      * [Auto Parallel] Restore the test_program.py
      
      * [Auto Parallel] Change the import rules
      
      * [Auto Parallel] Add the examples for Engine
      
      * [Auto Parallel] Do some minor changes
      
      * [Auto Parallel] Remove yaml dependency
      
      * [Auto Parallel] Fix the unittests
      
      * add valid after train
      
      * bug fix
      Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
      
      * [Auto Parallel] Bugfix allreduce fuse for MP (#46086)
      
      * bugfix
      
      * bugfix
      
      * typos fixed
      
      * update strategy (#46138)
      Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com>
      Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com>
      Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
      c5cc4278
    • S
      fix broadcast kernel (#46158) · 860f6077
      sneaxiy 提交于
      860f6077
    • J
      [Eager] Optimize log (#45783) (#46133) · e468e93c
      Jiabin Yang 提交于
      * make eager log readable
      
      * fix compile error
      
      * recover test
      
      * invoke ci again
      e468e93c
    • X
      convfusion_cache (#46054) · f4ec1563
      xiaoxiaohehe001 提交于
      f4ec1563
    • C
      Revert "Simplify size op impl (#45808)" (#46168) · dabb8f23
      Chen Weihang 提交于
      This reverts commit c252b1de.
      dabb8f23
    • S
      rename fleetx, develop=document_fix (#46141) · 7a6db0a3
      ShenLiang 提交于
      7a6db0a3
    • M
      [Cherry-pick] fix bug for TransformedDistribution (#46157) · a5d4f571
      MayYouBeProsperous 提交于
      fix bug for TransformedDistribution
      a5d4f571
    • C
      Unify core avx and core_noavx to libpaddle (#46095) (#46113) · 4261ae34
      Chen Weihang 提交于
      * unify  core_avx and core_noavx
      
      * fix except error
      
      * revert mac compile logic
      
      * revert dylib to so
      
      * add core_noavx branch
      
      * remove core_noavx
      
      * replace paddle_core by lib paddle
      
      * polish var name
      
      * replace paddle_core by libpaddle
      
      * update custom device commit
      
      * polish code by comments
      4261ae34
  2. 17 9月, 2022 2 次提交
  3. 16 9月, 2022 4 次提交
  4. 15 9月, 2022 8 次提交
  5. 14 9月, 2022 7 次提交
  6. 13 9月, 2022 2 次提交
  7. 09 9月, 2022 8 次提交