- 08 3月, 2023 1 次提交
- 
- 
由 YuanRisheng 提交于* move io * fix ci bugs * fix ci bugs * fix py3 bugs * fix example code * fix example code * fix text * fix text * deal with ci bugs * perfect code according comment * delete import batch 
 
- 
- 27 2月, 2023 3 次提交
- 
- 
由 chenxujun 提交于
- 
由 zhaoyingli 提交于* fix dist_attr in data_parallel in optimization * fix grad_clip pass when pp2 * fix dist_attr 
- 
由 zhaoyingli 提交于* fix set_grad_var_shape * recover modify 
 
- 
- 10 1月, 2023 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Remove some fluid APIs * [Auto Parallel] Fix the wrong import * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix the importing bug 
 
- 
- 07 1月, 2023 1 次提交
- 
- 
由 Ruibiao Chen 提交于* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel 
 
- 
- 06 1月, 2023 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style 
 
- 
- 04 1月, 2023 1 次提交
- 
- 
由 JZ-LIANG 提交于* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
 
- 
- 28 12月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* [AutoParallel] adapt for clip * fix unittest * enable_static * fix dist_fill_constant_batch_size_like * fix process_mesh.shape * update cond of modifying shape_list 
 
- 
- 26 12月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug 
 
- 
- 14 12月, 2022 2 次提交
- 
- 
由 zhaoyingli 提交于* [AutoParallel] recompute tuning * fix conflict * update comment * bug fix * update rc algo * tiny fix * fix clear process_group * remove comment * update segment print * fix import OpRole * adapt amp pass and grad_clip pass for opt_tuner * update tuning config * fix import * annotate recompute info on ops and upgrade recompute pass * add op_namescope for seed op * record reserved vars * fix recompute var's dist_attr * fix strategy unittest * adapt for fp16 * update unittest * revert copy opt * update unittest * rename set_recompute_segments * fix unittest 
- 
由 JZ-LIANG 提交于* recompute dep filter param * recompute dep for reshard 
 
- 
- 05 12月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于
 
- 
- 29 11月, 2022 2 次提交
- 
- 
由 Nyakku Shigure 提交于* isort all files * revert conflicting files * revert conflicting files * revert conflicting files 
- 
由 JZ-LIANG 提交于* add depend * add origin amp files * fp16 distinguish None & False * engine log * dp add deps for graph exe * add dep for grad clip * dep ops in comm stream * unitest 
 
- 
- 22 11月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* add depend * fp16 pass distinguish None & False * engine log 
 
- 
- 18 11月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* [AutoParallel] selective recompute * add cmakelist 
 
- 
- 09 11月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 08 11月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于[Auto Parallel] Sharding Optimization:Partition Algorithm & Stage2 Parameter Bucket communication (#47180) * partition param by order * add logging * reorder opt * config * stage2 bucket * update unitest 
 
- 
- 07 11月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* expand op donot use naive data parallel * fix unittest 
 
- 
- 31 10月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * [Auto Parallel] Add the missiong import 
 
- 
- 28 10月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* fix engine build method * fix import * update engine cost * update raise error * update cmakelist * revert optimizer * revert optimizer * fix unittest * fix unittest Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
 
- 
- 23 10月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 
 
- 
- 18 10月, 2022 2 次提交
- 
- 
由 caozhou 提交于* add cost interface * update inferface and add unittest * update unittest * update inferface 
- 
由 zhaoyingli 提交于* [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist 
 
- 
- 14 10月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* for gpt-gen * fix reshard * adapt assign and shape op * add dist_assign & unittest * add conditional block unittest * rename unittest 
 
- 
- 12 10月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* [CodeStyle][F401] remove unused import in python/paddle/distributed * remove pass * empty commit * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. * Fix split import. Fix split import. * add noqa after meta_optimizers in factory * restort collective ops * expand `import *` * add noqa after required imports * try to fix APIs without core.ops * Revert "try to fix APIs without core.ops" This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb. * fix an increment * empty commit * add noqa after required imports * expand `import *`, fix ci error Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
 
- 
- 20 9月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 15 9月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Add the serialization process for dist attrs * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix the code style * [Auto Parallel] Remove unnecessary impls * [Auto Parallel] Fix the importing error * [Auto Parallel] Fix the copy from bugs of op dist attr * [Auto Parallel] Replace the use of constexpr if * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh * [Auto Parallel] Change API of the completion unittest * [Auto Parallel] Fix the bug when set_attr an int * [Auto Parallel] Add the unittest for the serialization * [Auto Parallel] Add some unit tests * [Auto Paralle] Unify the strategy * [Auto Parallel] Improve the engine api * [Auto Parallel] Reset the changes made to the framework * [Auto Parallel] Change the engine unittest * [Auto Parallel] Update API of the completion and partitioner * [Auto Parallel] Update unit tests using engine api * update shard annotation * [Auto Parallel] Remove the modifications of other modules * [Auto Parallel] Add docs for APIs * add new strategy * [Auto Parallel] Replace the logger * [Auto Parallel] Restore the test_program.py * [Auto Parallel] Change the import rules * [Auto Parallel] Add the examples for Engine * [Auto Parallel] Do some minor changes * [Auto Parallel] Remove yaml dependency * [Auto Parallel] Fix the unittests * add valid after train * bug fix Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com> 
 
- 
- 14 9月, 2022 2 次提交
- 
- 
由 Nyakku Shigure 提交于* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 
- 
由 JZ-LIANG 提交于* bugfix (#45332) * dist embedding support lookup table v1 * add unitest * customize wait_comm * group gradients * bugfix * update program 
 
- 
- 13 9月, 2022 1 次提交
- 
- 
由 Charles-hit 提交于
 
- 
- 31 8月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add grad_clip pass * add unittest * add notes * update func * add dist_attr for new op 
 
- 
- 18 8月, 2022 2 次提交
- 
- 
由 zhaoyingli 提交于* add clip_grad * fix comments * add unittest * update logger 
- 
由 caozhou 提交于
 
- 
- 12 8月, 2022 2 次提交
- 29 7月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config 
 
- 
- 28 7月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 07 7月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* fix op_role * fix engine * update op_role 
 
- 
