- 06 1月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 04 1月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
- 29 12月, 2022 1 次提交
-
-
由 xu98bin 提交于
* auto parallel bf16
-
- 28 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] adapt for clip * fix unittest * enable_static * fix dist_fill_constant_batch_size_like * fix process_mesh.shape * update cond of modifying shape_list
-
- 27 12月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* fix input order * add unittest * update cmakelist
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-
- 26 12月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug
-
- 19 12月, 2022 1 次提交
-
-
由 zqw_1997 提交于
* remove fluid.mul * remove mul in __all__
-
- 14 12月, 2022 2 次提交
-
-
由 feifei-111 提交于
* del array_write & array_read * fix import err * fix import err * fix example codes
-
由 zhaoyingli 提交于
* [AutoParallel] recompute tuning * fix conflict * update comment * bug fix * update rc algo * tiny fix * fix clear process_group * remove comment * update segment print * fix import OpRole * adapt amp pass and grad_clip pass for opt_tuner * update tuning config * fix import * annotate recompute info on ops and upgrade recompute pass * add op_namescope for seed op * record reserved vars * fix recompute var's dist_attr * fix strategy unittest * adapt for fp16 * update unittest * revert copy opt * update unittest * rename set_recompute_segments * fix unittest
-
- 12 12月, 2022 1 次提交
-
-
由 feifei-111 提交于
-
- 08 12月, 2022 2 次提交
-
-
由 Jianghai 提交于
* add cluster_partition and device_meshes to process_meshes funcs * add unitest
-
由 Ghost Screaming 提交于
* Merge branch 'reduce_sum' of https://github.com/GhostScreaming/Paddle into mine_fluid_clean_common. * Fix some bugs. * Clean APIs in python/paddle/fluid/layers/control_flow.py * Polish code style. * Change API. * Fix some bugs. * Fix some bugs.
-
- 06 12月, 2022 1 次提交
-
-
由 kangguangli 提交于
* remove layers.matmul in nets.py * remove layers.matmul in rnn_impl/test_quantization_pass/auto_parallel_gpt_model/test_auto_parallel_completion_gpt * remove layers.matmul in other files * fix * fix * remove layers.matmul itself * remove ref in CMakeLists.txt and tools directory * remove matmul in fluid.layers.nn.py * remove matmul in fluid.dygraph.rnn.py && resotre test_matmul_op.py * replace matmul in fluid.dygraph.rnn.py && clean api_test in test_matmul_op.py * fix error && restore empty test_auto_search_dist_matmul_op.py * fix check in test_auto_parallel_partitioner.py * fix test_dist_matmul && test_flags_mkldnn_ops_on_off * fix test_fused_attention_op_xpu.py && test_matmul_op_xpu.py * remove test_auto_search_dist_matmul_op.py * remove layers.matmul in auto_parallel_gpt_model.py && fix doc in fluid/io.py * fix for matmul_grad * fix codestyle * fix codestyle * resolve conflicts error * restore unit test file but not compiled it for later remove * fix codestyle * fix wrong unittest skip * fix unittest delete * fix scale cost * fix scale cost * resolve conflicts error * resolve conflicts error Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
- 02 12月, 2022 2 次提交
-
-
由 LiYuRio 提交于
-
由 cyber-pioneer 提交于
move paddle.fluid.layers.tensor.create_parameter to paddle.tensor.creation.create_parameter (#48579)
-
- 01 12月, 2022 1 次提交
-
-
由 heyanru 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 caozhou 提交于
* add pattern match * add unittest
-
- 28 11月, 2022 1 次提交
-
-
由 caozhou 提交于
* add pattern for auto search * add unittest
-
- 25 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* add isort config * isort all files
-
- 24 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 23 11月, 2022 1 次提交
-
-
由 xiaoguoguo626807 提交于
* remove transpose * codestyle check * modify CI_STATIC * modify CI_STATIC * modify enable static() * remove unused import * fix conflict about stack * fix conflict about stack * fix conflict about stack * endless conflict
-
- 22 11月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 18 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] selective recompute * add cmakelist
-
- 10 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 08 11月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
[Auto Parallel] Sharding Optimization:Partition Algorithm & Stage2 Parameter Bucket communication (#47180) * partition param by order * add logging * reorder opt * config * stage2 bucket * update unitest
-
- 07 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fp16 pass support assign op * choose assign op exec mode * add unittest * add cmakelist
-
- 03 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U008] remove unnecessary args in `super()` * remove remained args * revert changes in test_pylayer_op * Revert "revert changes in test_pylayer_op" This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4. * revert some changes in example code
-
- 01 11月, 2022 2 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)`
-
由 Nyakku Shigure 提交于
* [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` * revert changes in fluid * revert unrelated file * revert changes in norm * revert changes in auto_parallel_amp * fix norm and auto_parallel_amp * revert a typo fix due to fixed at #47477
-
- 31 10月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * [Auto Parallel] Add the missiong import
-
- 28 10月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fix engine build method * fix import * update engine cost * update raise error * update cmakelist * revert optimizer * revert optimizer * fix unittest * fix unittest Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 18 10月, 2022 3 次提交
-
-
由 caozhou 提交于
* add parallel tuner * add unittest * fix unittest * set timeout of unittest * set unittest timeout * fix auto_mode setting * update unittest * sync from develop and update unittest * remove unused import * update unittest * update cmakelist * add unittests
-
由 caozhou 提交于
* add cost interface * update inferface and add unittest * update unittest * update inferface
-
由 zhaoyingli 提交于
* [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist
-
- 14 10月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* for gpt-gen * fix reshard * adapt assign and shape op * add dist_assign & unittest * add conditional block unittest * rename unittest
-
- 12 10月, 2022 2 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Suppport different dataloaders * [Auto Parallel] Add num_shards config for dataset * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Add the prepare API and replace __call__ with run * [Auto Parallel] Improve the private implementations of Engine * [Auto Parallel] Set capacity of dataloader for opt tuning * [Auto Parallel] [WIP] Change the fine-grained API * [Auto Parallel] Improve APIs to support different user cases * [Auto Parallel] Add removed config * [Auto Parallel] Add imports * [Auto Parallel] Fix bugs for to_static * [Auto Parallel] Remove unnecessary imports
-
由 Nyakku Shigure 提交于
-
- 10 10月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py
-