- 24 7月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* add shard tensor api * add DistAttr api * add unittest for coverage * fix process mesh sample code * fix checking error
-
- 20 7月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * matmul main logic done * shape int64 * common cc * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * define python api and wrap function in static mode for DistTensorSpec * revise syntax * map bugfix * broadcast func * compile 1 * add unitest * add registry * update unitest * bugfix * bugfix * add pybind * bugfix * bugfix macro gloabl name space * bugfix macro gloabl name space * pybind * pybind test * pybind bugfixed1 * pybind bugfixed2 * pybind unitest * merge dev * merge dev * merge dev * fixed cmake conflict * fixed cmake conflict * rename get method * revise inferforward output type * revise comment * replicated rule * replicated rule 2 * revert bug deps * add rule * add unitest * add rule * add unitest * move ut of auto_parallel * fix ut * bugfix * bugfix * bugfix * bugfix * bugfix * bugfix * bugfix * resolute input sharding conflict maybe * fixed comment * add rule * add unitest * fixed typoes --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
- 12 7月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* resolute input sharding conflict maybe * fixed comment --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
- 07 7月, 2023 3 次提交
-
-
由 Yichen Zhang 提交于
* add reduction spmd rule for auto parallel * fix the logic of handling partial * fix code style * fix the partial handling
-
由 Yichen Zhang 提交于
* add some basic functions * add elementwise rule for auto parallel * add unit test for elementwise rule * fix the lib name in spmd rule test cmake file * fix some bugs * add unit tests for elementwise spmd rule in python * bug fix * delete cpp unit test for elementwise spmd rule (use python ut now) * add cpp unit test for elementwise rule * use concrete op name in unit test * fix typo * fix code style * delete cpp unit test * add more details in log
-
由 Leo Chen 提交于
-
- 06 7月, 2023 2 次提交
-
-
由 zhaoyingli 提交于
-
由 zhaoyingli 提交于
-
- 04 7月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
-
- 29 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* add skip_gc_vars for 1f1b schedule mode * add pp_degree and pp_stage
-
- 27 6月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * matmul main logic done * define unified data class for inferencing dist_attr --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
-
- 25 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* auto parallel support pipeline scheduler with standalone executor * rm check_fetch * update cmakelist and flags env * rm set micro batch id * rm import * update utils func * raise error when merge tensor for return_numpy is False * fix _pipeline_opt * fix unittest
-
- 20 6月, 2023 1 次提交
-
-
由 Azure 提交于
* add auto tuner * compare and record module * revert launch main * add prune rule * add unit test * add auto tuner * revert launch main * add prune rule * modify unit test script * fix bug for dump nodes; fix bug for checking log file * fix bug --------- Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
- 14 6月, 2023 3 次提交
-
-
由 caozhou 提交于
* add auto tuner * fix prune * fix sharding prune and mbs candidates * fix cfg * fix launch * fix launch * add unittest * fix code style
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix problem of pickle and NCCL_P2P_DISABLE in distributed testcases in cuda12. * Fix problem of TimeOut of distributed testcases under cuda12. * Remove useless modification. * Remove useless modification.
-
由 sneaxiy 提交于
* fix A100 CUDA12 ut * fix ci uts * fix test_sync_batch_norm_op * fix sync bn op ut again by separating 2 files * fix codestyle ci * combine other PRs * fix codestyle * fix codestyle ci
-
- 13 6月, 2023 2 次提交
- 12 6月, 2023 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 06 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] fix sharding for 0D tensor and amp-o1 * add amp for sharding unittest
-
- 02 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] add 1F1B * rm amp
-
- 01 6月, 2023 2 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] update while control_flow with pipeline * update process group instantiate * fix micro_bsz for reshard * update api for micro batch size * add strategy for dp optimization
-
由 tianshuo78520a 提交于
* mv all unittests test * fix error * fix error * fix * fix * del unittests * fix paddle_build.sh * fix * fix test * fix add test * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * merge develop * fix test_async_read_write * fix test_async_read_write * merge develop * fix * fix import legacy_test * fix * fix * fix * fix * fix * fix * fix * fix * fix bug * fix * fix coverage test bug * fix * fix * fix * fix * fix * fix code sstyle * fix code * fix code * fix * fix * fix * del test_sequence_enumerate_op.py * fix
-
- 30 5月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Reorganize the fold structure * [Auto Parallel] Fix some import errors
-
- 22 5月, 2023 1 次提交
-
-
由 Meteor Liu 提交于
* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * fixed cyclic reference that caused patial import * fixed bad change * fix bad import * fix bad import * fix bad import * fix ut failed caused by change in_dynamic_mode * fix ut failed caused by change in_dynamic_mode * fixed usage of in_dynamic_mode() or in_dygraph_mode() * revert python3 to python in .pre-commit-config.yaml * fix merge conflicts
-
- 11 5月, 2023 1 次提交
-
-
由 Kaipeng Deng 提交于
* move DataLoader to paddle.io. test=develop
-
- 09 5月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 30 4月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 26 4月, 2023 1 次提交
-
-
由 zhenhailiu 提交于
* polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
-
- 24 4月, 2023 1 次提交
-
-
由 zqw_1997 提交于
* test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * fix doc erros, test=allcase
-
- 17 4月, 2023 1 次提交
-
-
由 tianshuo78520a 提交于
* mv ps distributed dir * fix * add del auto_parallel * add auto_parallel * fix ps * fix bug * fix test bug * fix test bug * merge develop fix error * merge develop fix error * merge develop fix error
-