- 17 9月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 16 9月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* bugfix * bugfix * typos fixed 
 
- 
- 15 9月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Add the serialization process for dist attrs * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix the code style * [Auto Parallel] Remove unnecessary impls * [Auto Parallel] Fix the importing error * [Auto Parallel] Fix the copy from bugs of op dist attr * [Auto Parallel] Replace the use of constexpr if * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh * [Auto Parallel] Change API of the completion unittest * [Auto Parallel] Fix the bug when set_attr an int * [Auto Parallel] Add the unittest for the serialization * [Auto Parallel] Add some unit tests * [Auto Paralle] Unify the strategy * [Auto Parallel] Improve the engine api * [Auto Parallel] Reset the changes made to the framework * [Auto Parallel] Change the engine unittest * [Auto Parallel] Update API of the completion and partitioner * [Auto Parallel] Update unit tests using engine api * update shard annotation * [Auto Parallel] Remove the modifications of other modules * [Auto Parallel] Add docs for APIs * add new strategy * [Auto Parallel] Replace the logger * [Auto Parallel] Restore the test_program.py * [Auto Parallel] Change the import rules * [Auto Parallel] Add the examples for Engine * [Auto Parallel] Do some minor changes * [Auto Parallel] Remove yaml dependency * [Auto Parallel] Fix the unittests * add valid after train * bug fix Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com> 
 
- 
- 14 9月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* bugfix (#45332) * dist embedding support lookup table v1 * add unitest * customize wait_comm * group gradients * bugfix * update program 
 
- 
- 09 9月, 2022 2 次提交
- 
- 
由 zhaoyingli 提交于* adapt gradient merge * fix op_role * fix strategy 
- 
由 zhaoyingli 提交于* adapt lazy init and fix pass * add unittest * update comment * fix amp and sharding * remove clip_by_norm 
 
- 
- 07 9月, 2022 1 次提交
- 
- 
由 caozhou 提交于* support iterable dataset for auto parallel * add split_data proto * fix unittest bug * fix recompute bug * update cmake 
 
- 
- 23 8月, 2022 2 次提交
- 18 8月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add clip_grad * fix comments * add unittest * update logger 
 
- 
- 15 8月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add collate_fn * fix number of inputs 
 
- 
- 03 8月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于
 
- 
- 29 7月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config 
 
- 
- 25 7月, 2022 1 次提交
- 
- 
由 Aurelius84 提交于* [dy2st]Add ProgramHelper to polish build program logic in autoparallel.Engine * refine code 
 
- 
- 18 7月, 2022 1 次提交
- 
- 
由 caozhou 提交于
 
- 
- 13 7月, 2022 2 次提交
- 11 7月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add 'to_static' in engine api * fix cmakelist 
 
- 
- 07 7月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* fix op_role * fix engine * update op_role 
 
- 
- 29 6月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* fixed bug for pass & engine * fixed bug for benchmark GPT-3 
 
- 
- 24 6月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Use a fast completion for data parallelism * remove unuse cuSparse function * [Auto Parallel] Fix some bugs of the fast dp completion * [Auto Parallel] Add the cmake statements * [Auto Parallel] Make the unittest adapt to the new interface * [Auto Parallel] Modify the timeout of the unittest * [Auto Parallel] Remove unnecessary comments Co-authored-by: Nzhouwei25 <zhouwei25@baidu.com>
 
- 
- 13 6月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* fix fetch list * fix unittest 
 
- 
- 08 6月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add fetch_list * fix evaluate log * tiny fix 
 
- 
- 05 6月, 2022 1 次提交
- 
- 
由 Sing_chan 提交于* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 
 
- 
- 02 6月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* prepare only once 
 
- 
- 01 6月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Add the parallel tuner * [Auto Parallel] Improve the parallel tuner and fix some bugs * upodate cost model * update import Resharder by dist op * update cost model * fix comp cost bug * update cost model * [Auto Parallel] Amend the dist attr for #processses=1 * update cost model and tuner * update cost model and tuner * update cost model and tuner * update cluster * update reshard * [Auto Parallel] Add the estimation from the cost model * [Auto Parallel] Reimplement the backup and restore functions * [Auto Parallel] Fix the bugs of the parallel tuner * [Auto Parallel] Update the engine api and dist context * [Auto Parallel] Work around the high order grad problem * [Auto Parallel] Add some miscellaneous improvements * [Auto Parallel] Add a unittest for DistributedContext Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
 
- 
- 19 5月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* slice data in dist_loader & flag to scale grad * bug fix * update unittest * enable static 
 
- 
- 10 5月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Refactor the engine api and parallelizer * [Auto Parallel] Fix the default dist op for the slice op * [Auto Parallel] Fix the format of planer.py * [Auto Parallel] Fix a bug 
 
- 
- 07 5月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Replace the old planner by the new partition tuner * [Auto Parallel] Improve the completion and distributed context * [Auto Parallel] Fix some bugs of the compatible check of some dist ops * [Auto Parallel] Fix some bugs 
 
- 
- 06 5月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add default_ctx in backward.py * record grad_var_to_var with grad_times * fix backward * update annotation * add complete_high_order_grad in complete_forward * add dist slice op * update grad_var_to_var type * update partition_block init mapping before loss op * update compatible for 'XShape' & update 'allreduce_vars' * add dist reshape op when input dim equal to output dim * update 'set_grad_var_shape' with grad_var_to_var * fix dist slice * fix set_grad_var_shape * add dist pnorm op * fix dist pnorm dist_attr * fix engine startprogram & adapt highorder grad * fix set_grad_var_shape when mp * update unittest * update cmakelist * default strategy in engine: dp * bug fix * tiny fix * flatten outputs * fix default strategy * init default ctx * tiny fix * test=allcase 
 
- 
- 18 4月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* adapot dist op * [Auto Parallel] Support the auto completion of while_op * add dist_fill_constant_batch_size_like * align infer accuracy 
 
- 
- 28 3月, 2022 1 次提交
- 
- 
由 caozhou 提交于* fix code stype * update unitest 
 
- 
- 23 3月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add dist_saver and update engine * add dist_saver and update engine 
 
- 
- 16 3月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Support the auto completion of while_op * [Auto Parallel] Improve the completion algorithms * [Auto Parallel] Fix bugs for ernie inference * [Auto Parallel] Remove attrs which cannot be pickled * [Auto Parallel] make the dims_mappings of LodTensorArray vars empty * [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel * [Auto Parallel] Remove unncessary comments * [Auto Parallel] Fix a bug of the CMakeLists * [Auto Parallel] Use the newest APIs to write the unit test * [Auto Parallel] Remove unnecessary statements 
 
- 
- 07 3月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* engine support pp * fix format * avoid multi print * fix convert * bug fix * add pp unittest 
 
- 
- 24 2月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于
 
- 
- 22 2月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Add the high-level Engine API * Update the test cmakefile 
 
- 
