- 31 10月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * [Auto Parallel] Add the missiong import 
 
- 
- 28 10月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* fix engine build method * fix import * update engine cost * update raise error * update cmakelist * revert optimizer * revert optimizer * fix unittest * fix unittest Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
 
- 
- 23 10月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 
 
- 
- 18 10月, 2022 3 次提交
- 
- 
由 caozhou 提交于* add parallel tuner * add unittest * fix unittest * set timeout of unittest * set unittest timeout * fix auto_mode setting * update unittest * sync from develop and update unittest * remove unused import * update unittest * update cmakelist * add unittests 
- 
由 caozhou 提交于* add cost interface * update inferface and add unittest * update unittest * update inferface 
- 
由 zhaoyingli 提交于* [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist 
 
- 
- 14 10月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* for gpt-gen * fix reshard * adapt assign and shape op * add dist_assign & unittest * add conditional block unittest * rename unittest 
 
- 
- 12 10月, 2022 2 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Suppport different dataloaders * [Auto Parallel] Add num_shards config for dataset * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Add the prepare API and replace __call__ with run * [Auto Parallel] Improve the private implementations of Engine * [Auto Parallel] Set capacity of dataloader for opt tuning * [Auto Parallel] [WIP] Change the fine-grained API * [Auto Parallel] Improve APIs to support different user cases * [Auto Parallel] Add removed config * [Auto Parallel] Add imports * [Auto Parallel] Fix bugs for to_static * [Auto Parallel] Remove unnecessary imports 
- 
由 Nyakku Shigure 提交于
 
- 
- 10 10月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py 
 
- 
- 08 10月, 2022 1 次提交
- 
- 
由 caozhou 提交于* update comp cost and completion for gpt auto search * add unittest 
 
- 
- 28 9月, 2022 2 次提交
- 
- 
由 zhaoyingli 提交于
- 
由 zhaoyingli 提交于* [AutoParallel] fix dist_split * add unittest * update cmakelist 
 
- 
- 27 9月, 2022 2 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Imporve the user-defined fetches and logging * [Auto Parallel] Make Engine class callable * [Auto Parallel] Update the data loading of tuner 
- 
由 Nyakku Shigure 提交于* [CodeStyle] remove all future import * revert test_error.py * restore future import in example code 
 
- 
- 26 9月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 19 9月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于
 
- 
- 17 9月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 15 9月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Add the serialization process for dist attrs * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix the code style * [Auto Parallel] Remove unnecessary impls * [Auto Parallel] Fix the importing error * [Auto Parallel] Fix the copy from bugs of op dist attr * [Auto Parallel] Replace the use of constexpr if * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh * [Auto Parallel] Change API of the completion unittest * [Auto Parallel] Fix the bug when set_attr an int * [Auto Parallel] Add the unittest for the serialization * [Auto Parallel] Add some unit tests * [Auto Paralle] Unify the strategy * [Auto Parallel] Improve the engine api * [Auto Parallel] Reset the changes made to the framework * [Auto Parallel] Change the engine unittest * [Auto Parallel] Update API of the completion and partitioner * [Auto Parallel] Update unit tests using engine api * update shard annotation * [Auto Parallel] Remove the modifications of other modules * [Auto Parallel] Add docs for APIs * add new strategy * [Auto Parallel] Replace the logger * [Auto Parallel] Restore the test_program.py * [Auto Parallel] Change the import rules * [Auto Parallel] Add the examples for Engine * [Auto Parallel] Do some minor changes * [Auto Parallel] Remove yaml dependency * [Auto Parallel] Fix the unittests * add valid after train * bug fix Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com> 
 
- 
- 14 9月, 2022 2 次提交
- 
- 
由 Nyakku Shigure 提交于* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 
- 
由 Xiaoxu Chen 提交于* add reduce_mean,reduce_sum primitive ops * add ne_p gt_p primitive operators * add ge_p abs_p primitive oparators 
 
- 
- 09 9月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* adapt lazy init and fix pass * add unittest * update comment * fix amp and sharding * remove clip_by_norm 
 
- 
- 07 9月, 2022 1 次提交
- 
- 
由 caozhou 提交于* support iterable dataset for auto parallel * add split_data proto * fix unittest bug * fix recompute bug * update cmake 
 
- 
- 05 9月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* dist_matmul trans * update unittest * update cmakelist 
 
- 
- 31 8月, 2022 2 次提交
- 
- 
由 JZ-LIANG 提交于* bugfix (#45332) * dist embedding support lookup table v1 * add unitest * update unitest cmake 
- 
由 zhaoyingli 提交于* add grad_clip pass * add unittest * add notes * update func * add dist_attr for new op 
 
- 
- 25 8月, 2022 1 次提交
- 
- 
由 JZ-LIANG 提交于* support high order differential with data parallel overlap * update unitest 
 
- 
- 23 8月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add quant pass 
 
- 
- 18 8月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* add clip_grad * fix comments * add unittest * update logger 
 
- 
- 16 8月, 2022 1 次提交
- 
- 
由 caozhou 提交于* update reshard cost and cost estimator * add unittest * add dropout cost * fix import error * fix reshard code style error * improve unittest coverage 
 
- 
- 15 8月, 2022 2 次提交
- 
- 
由 zhaoyingli 提交于* add collate_fn * fix number of inputs 
- 
由 Yulong Ao 提交于* [Auto Parallel] Move the distributed info from python to c++ * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc * [Auto Parallel] Add the lost file * [Auto Parallel] Make the dist attr be unique_ptr * [Auto Parallel] Add the proto conversion * [Auto Parallel] Improve the proto support * [Auto Parallel] Fix the bugs for adding a device or a link * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper * [Auto Parallel] Improve the impl of these dist attrs * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h * [Auto Parallel] Remove unused codes * [Auto Parallel] Check whether the dist attr is null * [Auto Parallel] Implement the assign operator for OpDesc explicitly 
 
- 
- 12 8月, 2022 1 次提交
- 
- 
由 Yulong Ao 提交于* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h 
 
- 
- 09 8月, 2022 1 次提交
- 
- 
由 caozhou 提交于* add mul dist op cost * add mul unittest 
 
- 
- 03 8月, 2022 1 次提交
- 
- 
由 Aurelius84 提交于* [Dy2St]Support generate whole program in ProgramHelper for Engine * support to(mode) * fix word typo * fix unittest 
 
- 
- 29 7月, 2022 2 次提交
- 27 7月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于
 
- 
- 25 7月, 2022 1 次提交
- 
- 
由 caozhou 提交于* update comp cost * add dist default op cost * add dist fill constant batch size like op cost * add elewise op cost * add fill_constant_batch_size_like op cost unittest * add unittest and remove fill_constant_batch_size_like grad op cost * add to cmakelist * fix unittest bug 
 
- 
- 21 7月, 2022 1 次提交
- 
- 
由 zhaoyingli 提交于* fix unittest * fix log_dir * _enable_legacy_dygraph 
 
- 
