- 05 9月, 2023 1 次提交
-
-
由 Wennie396 提交于
* replace amp.use_pure_fp16 with amp.dtype and amp.level * old api still use use_pure_fp16 * test_fuse_adamw_pass still use use_pure_fp16 * add test case tuning recompute with amp(float16,o2) * reset new test case properties TIMEOUT 60 * set smaller value of batch_size and batch_num * deepcopy dist_context fix _rename_input problem * fix loss name after cast * set tuning.enable=True and use engine._tune() * restore some changes in _rename_input()/_rename_output() * add self.amp_dtype for _cast_loss() in auto_parallel_amp.py * fix insert op index in _cast_loss()
-
- 04 9月, 2023 1 次提交
-
-
由 LiYuRio 提交于
-
- 31 8月, 2023 2 次提交
-
-
由 caozhou 提交于
-
由 Chen Weihang 提交于
* move matmul spmd rules into phi * add basic infer spmd utils * addspmd factory * fix compile error * add unittest * refine infer spmd test and utils * debug infer spmd test * adapt python test * poish details * change to vector attr arg * revert needless change * update matmul spmd rule test * remove original rule * polish details * fix marco error * add comment * pass backward test * fix compile error * add cmake rule for spmd_rules_test * add dist meta tensor * update pybind impl * add marco for rules
-
- 25 8月, 2023 3 次提交
-
-
由 JZ-LIANG 提交于
* add embedding backward rule * update backward api * revert api * matmul inferbackward * update unitest
-
由 LiYuRio 提交于
* support create shard dist tesnor * support non-zero shard to replicated * change reshard signature
-
由 Yichen Zhang 提交于
* fix small bug in reshape spmd rule * small fix in unit test
-
- 24 8月, 2023 1 次提交
-
-
由 Leo Chen 提交于
* shard_tensor support static graph * add comments * add dy2static ut * use property in c++ side
-
- 22 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* polish dist teensor design * adjust constructor * polish details * polish details design * fix compile error * refactor init tensor impl * fix reshard test * polish details * add unittest for coverage
-
- 16 8月, 2023 2 次提交
-
-
由 LiYuRio 提交于
-
由 Chen Weihang 提交于
* add phi forward api gen impl * add phi backward gen code * polish api code gen impl * polish code gen impl * remove auto_paralel namespace * add dygraph forward impl * add for_auto_parallel cond * fix code gen errors * add dygraph backward impl * resolve conflict with develop * refactor dist api gen impl * revert origin api gen impl * replace template for override func * fix dnnl marco error * revert third_party change * add with distributed marco * Update grad_tensor_holder.cc details * merge dist tensor constructor * change test tensor to replicate * fx typo * resolve conflict with develop * fix out dim error
-
- 15 8月, 2023 1 次提交
-
-
由 Wennie396 提交于
* fix sharding_pass stage3 precision problem * delete 'stage3 has precision problem' comment * add dp2 training after load dp_engine * unset grad_clip=clip for opt
-
- 14 8月, 2023 1 次提交
-
-
由 Yichen Zhang 提交于
* add reshape spmd rule * add unit test for reshape spmd rule * bug fix * replace the print_info function with to_string * fix typo * bug fix * add handling for "0" in target shape * remove the part of computing size in dim_trans.cc
-
- 10 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
-
- 09 8月, 2023 1 次提交
-
-
由 LoneRanger 提交于
remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970) * change the optimizer base for SGDOptimizer * change the optimizer base for SGDOptimizer * replace the SGDOptimizer with SGD * fix bug of sgd * change the optimizer base for MomentumOptimizer * fix the remaining tests * remove the Momentum in fluid/optimizer.py * fix bug * fix bug * fix bug * fix bug * Update test_resnet_cinn.py * Update test_resnet_prim_cinn.py * fix bug * fix bug * fix bug * remove the ModelAverage in fluid * remove the LookaheadOptimizer in fluid * fix bug * remove AdamOptimizer in fluid * Update test_image_classification_fp16.py * fix bug * relocate the ExponentialMovingAverage in fluid * restore the static api * remove the FtrlOptimizer in fluid * remove the DecayedAdagradOptimizer in fluid * remove the DpsgdOptimizer in fluid * fix bug * fix codestyle * fix bug * fix bug * relocate the PipelineOptimizer * relocate the GradientMergeOptimizer * fix bug * fix bug * fix bug * fix doc * Update __init__.py * Update test_fleet_qat_meta_optimizer.py * change optimizer base for LarsMomentumOptimizer * fix bug * fix conflict * fix code-style * fix sample codes * fix bug * fix bug * fix cinn bug * fix bug * fix bug * Update qat_optimizer.py * Update __init__.py * fix bug * change optimizer base for RecomputeOptimizer * fix bug * fix bug * Update test_imperative_optimizer_v2.py
-
- 04 8月, 2023 2 次提交
- 02 8月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* Update autoparallel DistributedDataLoader * add places for engine.dataloder()
-
- 01 8月, 2023 1 次提交
-
-
由 Zhan Rongrui 提交于
[CodeStyle] replace `assert np.allclose` with `np.testing.assert_allclose` and `assert np.array_equal` with `np.testing.assert_array_equal` (#55385)
-
- 31 7月, 2023 1 次提交
-
-
由 Difer 提交于
* simple reaplce * for debug * fix bugs * fix some bugs * del fill_constant_batch_size_like
-
- 24 7月, 2023 3 次提交
-
-
由 Yichen Zhang 提交于
* [Semi-Auto] Add transpose spmd rule * add unit test in cmake file * log perm info
-
由 Yichen Zhang 提交于
* add split spmd rule * add pytest in cmake file * small fix
-
由 Chen Weihang 提交于
* add shard tensor api * add DistAttr api * add unittest for coverage * fix process mesh sample code * fix checking error
-
- 20 7月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * matmul main logic done * shape int64 * common cc * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * define python api and wrap function in static mode for DistTensorSpec * revise syntax * map bugfix * broadcast func * compile 1 * add unitest * add registry * update unitest * bugfix * bugfix * add pybind * bugfix * bugfix macro gloabl name space * bugfix macro gloabl name space * pybind * pybind test * pybind bugfixed1 * pybind bugfixed2 * pybind unitest * merge dev * merge dev * merge dev * fixed cmake conflict * fixed cmake conflict * rename get method * revise inferforward output type * revise comment * replicated rule * replicated rule 2 * revert bug deps * add rule * add unitest * add rule * add unitest * move ut of auto_parallel * fix ut * bugfix * bugfix * bugfix * bugfix * bugfix * bugfix * bugfix * resolute input sharding conflict maybe * fixed comment * add rule * add unitest * fixed typoes --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
- 12 7月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* resolute input sharding conflict maybe * fixed comment --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
- 07 7月, 2023 3 次提交
-
-
由 Yichen Zhang 提交于
* add reduction spmd rule for auto parallel * fix the logic of handling partial * fix code style * fix the partial handling
-
由 Yichen Zhang 提交于
* add some basic functions * add elementwise rule for auto parallel * add unit test for elementwise rule * fix the lib name in spmd rule test cmake file * fix some bugs * add unit tests for elementwise spmd rule in python * bug fix * delete cpp unit test for elementwise spmd rule (use python ut now) * add cpp unit test for elementwise rule * use concrete op name in unit test * fix typo * fix code style * delete cpp unit test * add more details in log
-
由 Leo Chen 提交于
-
- 06 7月, 2023 2 次提交
-
-
由 zhaoyingli 提交于
-
由 zhaoyingli 提交于
-
- 04 7月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
-
- 29 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* add skip_gc_vars for 1f1b schedule mode * add pp_degree and pp_stage
-
- 27 6月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * matmul main logic done * define unified data class for inferencing dist_attr --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
-
- 25 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* auto parallel support pipeline scheduler with standalone executor * rm check_fetch * update cmakelist and flags env * rm set micro batch id * rm import * update utils func * raise error when merge tensor for return_numpy is False * fix _pipeline_opt * fix unittest
-
- 20 6月, 2023 1 次提交
-
-
由 Azure 提交于
* add auto tuner * compare and record module * revert launch main * add prune rule * add unit test * add auto tuner * revert launch main * add prune rule * modify unit test script * fix bug for dump nodes; fix bug for checking log file * fix bug --------- Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
- 14 6月, 2023 3 次提交
-
-
由 caozhou 提交于
* add auto tuner * fix prune * fix sharding prune and mbs candidates * fix cfg * fix launch * fix launch * add unittest * fix code style
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix problem of pickle and NCCL_P2P_DISABLE in distributed testcases in cuda12. * Fix problem of TimeOut of distributed testcases under cuda12. * Remove useless modification. * Remove useless modification.
-
由 sneaxiy 提交于
* fix A100 CUDA12 ut * fix ci uts * fix test_sync_batch_norm_op * fix sync bn op ut again by separating 2 files * fix codestyle ci * combine other PRs * fix codestyle * fix codestyle ci
-
- 13 6月, 2023 2 次提交