- 09 9月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* adapt gradient merge * fix op_role * fix strategy
-
由 zhaoyingli 提交于
* adapt lazy init and fix pass * add unittest * update comment * fix amp and sharding * remove clip_by_norm
-
- 07 9月, 2022 1 次提交
-
-
由 caozhou 提交于
* support iterable dataset for auto parallel * add split_data proto * fix unittest bug * fix recompute bug * update cmake
-
- 05 9月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* dist_matmul trans * update unittest * update cmakelist
-
- 01 9月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 31 8月, 2022 2 次提交
-
-
由 JZ-LIANG 提交于
* bugfix (#45332) * dist embedding support lookup table v1 * add unitest * update unitest cmake
-
由 zhaoyingli 提交于
* add grad_clip pass * add unittest * add notes * update func * add dist_attr for new op
-
- 25 8月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* support high order differential with data parallel overlap * update unitest
-
- 23 8月, 2022 3 次提交
-
-
由 zhaoyingli 提交于
* add quant pass
-
由 JZ-LIANG 提交于
-
由 JZ-LIANG 提交于
* bugfix * remove scaling * support rescale_grad opt * add unitest
-
- 18 8月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* add clip_grad * fix comments * add unittest * update logger
-
由 caozhou 提交于
-
- 16 8月, 2022 2 次提交
- 15 8月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* add collate_fn * fix number of inputs
-
- 12 8月, 2022 3 次提交
-
-
由 caozhou 提交于
* update reshard for auto search * fix unittest bug * update dist tensor * update reshard output * fix unittests bug * merge develop
-
由 JZ-LIANG 提交于
* bugfix * remove scaling * support rescale_grad opt
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
- 09 8月, 2022 1 次提交
-
-
由 caozhou 提交于
* add mul dist op cost * add mul unittest
-
- 03 8月, 2022 2 次提交
-
-
由 JZ-LIANG 提交于
-
由 Aurelius84 提交于
* [Dy2St]Support generate whole program in ProgramHelper for Engine * support to(mode) * fix word typo * fix unittest
-
- 29 7月, 2022 2 次提交
- 28 7月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
-
由 Yuang Liu 提交于
-
- 27 7月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 25 7月, 2022 2 次提交
-
-
由 caozhou 提交于
* update comp cost * add dist default op cost * add dist fill constant batch size like op cost * add elewise op cost * add fill_constant_batch_size_like op cost unittest * add unittest and remove fill_constant_batch_size_like grad op cost * add to cmakelist * fix unittest bug
-
由 Aurelius84 提交于
* [dy2st]Add ProgramHelper to polish build program logic in autoparallel.Engine * refine code
-
- 21 7月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fix unittest * fix log_dir * _enable_legacy_dygraph
-
- 18 7月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 13 7月, 2022 3 次提交
- 12 7月, 2022 1 次提交
-
-
由 caozhou 提交于
* update base cost * update unittest of cost model * add unittest
-
- 11 7月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* add 'to_static' in engine api * fix cmakelist
-
- 07 7月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fix op_role * fix engine * update op_role
-
- 29 6月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* fixed bug for pass & engine * fixed bug for benchmark GPT-3
-
- 24 6月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Use a fast completion for data parallelism * remove unuse cuSparse function * [Auto Parallel] Fix some bugs of the fast dp completion * [Auto Parallel] Add the cmake statements * [Auto Parallel] Make the unittest adapt to the new interface * [Auto Parallel] Modify the timeout of the unittest * [Auto Parallel] Remove unnecessary comments Co-authored-by: Nzhouwei25 <zhouwei25@baidu.com>
-
- 13 6月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fix fetch list * fix unittest
-