- 06 1月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 04 1月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
- 30 12月, 2022 2 次提交
-
-
由 zhouzj 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 29 12月, 2022 1 次提交
-
-
由 xu98bin 提交于
* auto parallel bf16
-
- 28 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] adapt for clip * fix unittest * enable_static * fix dist_fill_constant_batch_size_like * fix process_mesh.shape * update cond of modifying shape_list
-
- 27 12月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* fix input order * add unittest * update cmakelist
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-
- 26 12月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug
-
- 25 12月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* delete legacy dygraph code in python/paddle/distributed * refine
-
- 21 12月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] fix update complete and add_to_collection * fix annotation * fix amp fill_constant dist_attr
-
由 zhaoyingli 提交于
* [AutoParallel] keep input order * rm annotation
-
- 14 12月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] recompute tuning * fix conflict * update comment * bug fix * update rc algo * tiny fix * fix clear process_group * remove comment * update segment print * fix import OpRole * adapt amp pass and grad_clip pass for opt_tuner * update tuning config * fix import * annotate recompute info on ops and upgrade recompute pass * add op_namescope for seed op * record reserved vars * fix recompute var's dist_attr * fix strategy unittest * adapt for fp16 * update unittest * revert copy opt * update unittest * rename set_recompute_segments * fix unittest
-
由 JZ-LIANG 提交于
* recompute dep filter param * recompute dep for reshard
-
- 12 12月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][isort][F401] fix some regression issues * add import paddle to fix eval call
-
- 08 12月, 2022 1 次提交
-
-
由 Jianghai 提交于
* add cluster_partition and device_meshes to process_meshes funcs * add unitest
-
- 05 12月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 29 11月, 2022 3 次提交
-
-
由 caozhou 提交于
* add pattern match * add unittest
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
由 JZ-LIANG 提交于
* add depend * add origin amp files * fp16 distinguish None & False * engine log * dp add deps for graph exe * add dep for grad clip * dep ops in comm stream * unitest
-
- 28 11月, 2022 2 次提交
-
-
由 Aurelius84 提交于
* [Fluid Clean] Migrate program_translate.py/jit.py into paddle.jit dir
-
由 caozhou 提交于
* add pattern for auto search * add unittest
-
- 24 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 22 11月, 2022 4 次提交
-
-
由 caozhou 提交于
-
由 JZ-LIANG 提交于
* add depend * fp16 pass distinguish None & False * engine log
-
由 Nyakku Shigure 提交于
[CodeStyle][py2][py311] replace deprecated `inspect.getargspec` with `inspect.getfullargspec` (#48218) * [CodeStyle][py2] use inspect.getfullargspect instead of deprecated inspect.getargspec * refactor to f-string
-
由 zhaoyingli 提交于
-
- 18 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] selective recompute * add cmakelist
-
- 15 11月, 2022 3 次提交
-
-
由 zhouweiwei2014 提交于
-
由 zhaoyingli 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][F821] fix remained F821 issues * refine comment * fix _set_item
-
- 14 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
[CodeStyle][F821] fix undefined variables due to missing imports, misspelled variable names (#47899) * `hann` -> `_hann` * `false` -> `False` * a missing passed argument `reduce_all` * some missing imports * `device_type` -> `heter_device_type` * `PKVClient` -> `KVClient` * fix some typos and missing imports
-
- 10 11月, 2022 3 次提交
-
-
由 zhaoyingli 提交于
-
由 caozhou 提交于
* add c_concat pass for reshard * add unittest
-
由 zhaoyingli 提交于
* fix insert concat op * fix fp16 assert
-
- 09 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 08 11月, 2022 2 次提交
-
-
由 JZ-LIANG 提交于
[Auto Parallel] Sharding Optimization:Partition Algorithm & Stage2 Parameter Bucket communication (#47180) * partition param by order * add logging * reorder opt * config * stage2 bucket * update unitest
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment
-
- 07 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* expand op donot use naive data parallel * fix unittest
-
- 03 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U008] remove unnecessary args in `super()` * remove remained args * revert changes in test_pylayer_op * Revert "revert changes in test_pylayer_op" This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4. * revert some changes in example code
-