- 16 3月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* update env setting * update pass logic * dist op support bf16 * backward cast update * update setting * update backward * revert amp pass * update fp16 backward logic * register c_embedding bf16 * revert engine * add unitest * add unitest * update unitest * update cmake * update math * update math.py * update unitest * update unitest * revise unitest * revise unitest * update unitest * update unitest * update unitest
-
- 14 3月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
-
- 13 3月, 2023 1 次提交
-
-
由 Aurelius84 提交于
-
- 15 2月, 2023 1 次提交
-
-
由 xu98bin 提交于
* auto parallel align tool * modify function get_var's return * add save and load in align_tool * modify load function and save function * add finding different ops in align tool * full auto parallel align tool add test file for auto parallel align tool set timeout for test modify get_backward_tmp_var function add annotation for align tool modify test file modify code to restart CI remove timeout * set timeout
-
- 14 2月, 2023 1 次提交
-
-
由 mhy-666 提交于
-
- 10 2月, 2023 1 次提交
-
-
由 yuehuayingxueluo 提交于
-
- 09 2月, 2023 2 次提交
-
-
由 zqw_1997 提交于
* remove dygraph.parallel.ParallelEnv * logger.py error: AttributeError: module 'paddle' has no attribute 'distributed' * move the implenmentation to the root folder * logger.py import ParallelEnv from paddle.parallel to avoid circular import * add the comment of why import ParallelEnv from paddle.parallel in logger.py and remove the api interface in the paddle/parallel.py * outdated Env and note removed * decouple the logger.py and ParallelEnv * remove another ref of parallel in init.py
-
由 yuehuayingxueluo 提交于
* fix the processing order of passes in pass_base.py * fix processing order * add _PASS_PROCESS_ORDER_LIST * delete some pass in _PASS_PROCESS_ORDER_LIST * add assert in pass_base.py * remove fuse_optimizer * add _fusion_opt_list_rule * add test_pass_base_list.py * fix some bug * add fused_attention * add some passes to list * fix ci bug * fix ci bug
-
- 02 2月, 2023 1 次提交
-
-
由 Jianghai 提交于
* fix bugs in cluster to device meshes * add tests * 1
-
- 12 1月, 2023 3 次提交
-
-
由 zhangkaihuo 提交于
-
由 yuehuayingxueluo 提交于
-
由 zhaoyingli 提交于
* recovery annotation * bugfix
-
- 11 1月, 2023 1 次提交
-
-
由 yuehuayingxueluo 提交于
* add FusedLinear pass * add fused_op_list and renname PASSES to OP_FUSION * add fused_passes_list to constants.py * add test_passes.py * fix test_fused_passes.py * fix add if float(paddle.version.cuda()) >= 11.6: * renamed test_fused_passes.py * fix CMakeList.txt
-
- 10 1月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Remove some fluid APIs * [Auto Parallel] Fix the wrong import * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix the importing bug
-
- 09 1月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParalle] balancing the calculation of global_norm in data parallel * fix unittest * update cond pure_data_parallel
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-
- 06 1月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 04 1月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
- 29 12月, 2022 1 次提交
-
-
由 xu98bin 提交于
* auto parallel bf16
-
- 28 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] adapt for clip * fix unittest * enable_static * fix dist_fill_constant_batch_size_like * fix process_mesh.shape * update cond of modifying shape_list
-
- 27 12月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* fix input order * add unittest * update cmakelist
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-
- 26 12月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug
-
- 19 12月, 2022 1 次提交
-
-
由 zqw_1997 提交于
* remove fluid.mul * remove mul in __all__
-
- 14 12月, 2022 2 次提交
-
-
由 feifei-111 提交于
* del array_write & array_read * fix import err * fix import err * fix example codes
-
由 zhaoyingli 提交于
* [AutoParallel] recompute tuning * fix conflict * update comment * bug fix * update rc algo * tiny fix * fix clear process_group * remove comment * update segment print * fix import OpRole * adapt amp pass and grad_clip pass for opt_tuner * update tuning config * fix import * annotate recompute info on ops and upgrade recompute pass * add op_namescope for seed op * record reserved vars * fix recompute var's dist_attr * fix strategy unittest * adapt for fp16 * update unittest * revert copy opt * update unittest * rename set_recompute_segments * fix unittest
-
- 12 12月, 2022 1 次提交
-
-
由 feifei-111 提交于
-
- 08 12月, 2022 2 次提交
-
-
由 Jianghai 提交于
* add cluster_partition and device_meshes to process_meshes funcs * add unitest
-
由 Ghost Screaming 提交于
* Merge branch 'reduce_sum' of https://github.com/GhostScreaming/Paddle into mine_fluid_clean_common. * Fix some bugs. * Clean APIs in python/paddle/fluid/layers/control_flow.py * Polish code style. * Change API. * Fix some bugs. * Fix some bugs.
-
- 06 12月, 2022 1 次提交
-
-
由 kangguangli 提交于
* remove layers.matmul in nets.py * remove layers.matmul in rnn_impl/test_quantization_pass/auto_parallel_gpt_model/test_auto_parallel_completion_gpt * remove layers.matmul in other files * fix * fix * remove layers.matmul itself * remove ref in CMakeLists.txt and tools directory * remove matmul in fluid.layers.nn.py * remove matmul in fluid.dygraph.rnn.py && resotre test_matmul_op.py * replace matmul in fluid.dygraph.rnn.py && clean api_test in test_matmul_op.py * fix error && restore empty test_auto_search_dist_matmul_op.py * fix check in test_auto_parallel_partitioner.py * fix test_dist_matmul && test_flags_mkldnn_ops_on_off * fix test_fused_attention_op_xpu.py && test_matmul_op_xpu.py * remove test_auto_search_dist_matmul_op.py * remove layers.matmul in auto_parallel_gpt_model.py && fix doc in fluid/io.py * fix for matmul_grad * fix codestyle * fix codestyle * resolve conflicts error * restore unit test file but not compiled it for later remove * fix codestyle * fix wrong unittest skip * fix unittest delete * fix scale cost * fix scale cost * resolve conflicts error * resolve conflicts error Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
- 02 12月, 2022 2 次提交
-
-
由 LiYuRio 提交于
-
由 cyber-pioneer 提交于
move paddle.fluid.layers.tensor.create_parameter to paddle.tensor.creation.create_parameter (#48579)
-
- 01 12月, 2022 1 次提交
-
-
由 heyanru 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 caozhou 提交于
* add pattern match * add unittest
-
- 28 11月, 2022 1 次提交
-
-
由 caozhou 提交于
* add pattern for auto search * add unittest
-
- 25 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* add isort config * isort all files
-
- 24 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 23 11月, 2022 1 次提交
-
-
由 xiaoguoguo626807 提交于
* remove transpose * codestyle check * modify CI_STATIC * modify CI_STATIC * modify enable static() * remove unused import * fix conflict about stack * fix conflict about stack * fix conflict about stack * endless conflict
-
- 22 11月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 18 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] selective recompute * add cmakelist
-