- 09 2月, 2023 1 次提交
-
-
由 yuehuayingxueluo 提交于
* fix the processing order of passes in pass_base.py * fix processing order * add _PASS_PROCESS_ORDER_LIST * delete some pass in _PASS_PROCESS_ORDER_LIST * add assert in pass_base.py * remove fuse_optimizer * add _fusion_opt_list_rule * add test_pass_base_list.py * fix some bug * add fused_attention * add some passes to list * fix ci bug * fix ci bug
-
- 01 2月, 2023 1 次提交
-
-
由 wangxiaoning 提交于
-
- 30 1月, 2023 1 次提交
-
-
由 zmxdream 提交于
* add set slot_num for psgpuwraper (#177) * add set slot_num_for_pull_feature for psgpuwarper * Add get_epoch_finish python interface (#182) * add get_epoch_finish interface * add return * delete return * add unzip op (#183) * fix miss key for error dataset (#186) * fix miss key for error dataset * fix miss key for error dataset Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * add excluded_train_pair and infer_node_type (#187) * support return of degree (#188) * fix task stuck in barrier (#189) Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * check node/feature format when loading (#190) * check node&feature format when loading * check node&feature format when loading (2£ (2) * degrade log (#191) * [PGLBOX]fix conflict * [PGLBOX]fix conflict * [PGLBOX]replace LodTensor with phi::DenseTensor * [PGLBOX]fix gpu_primitives.h include path * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip ut * [PGLBOX]fix unzip ut * [PGLBOX]fix code style * [PGLBOX]fix code style * [PGLBOX]fix code style * fix code style * fix code style * fix unzip ut * fix unzip ut * fix unzip ut * fix unzip * fix code stype * add ut * add c++ ut & fix train_mode_ set * fix load into memory * fix c++ ut * fix c++ ut * fix c++ ut * fix c++ ut * fix code style * fix collective * fix unzip_op.cc * fix barrier * fix code style * fix barrier * fix barrier * fix code styple * fix unzip * add unzip.py * add unzip.py * fix unzip.py --------- Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
- 16 1月, 2023 2 次提交
-
-
由 wangxiaoning 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style * [Auto Parallel] Clear some fluid APIs
-
- 12 1月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 10 1月, 2023 2 次提交
- 09 1月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParalle] balancing the calculation of global_norm in data parallel * fix unittest * update cond pure_data_parallel
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-
- 06 1月, 2023 2 次提交
-
-
由 Guanghua Yu 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 04 1月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
- 30 12月, 2022 2 次提交
-
-
由 zhouzj 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 29 12月, 2022 1 次提交
-
-
由 xu98bin 提交于
* auto parallel bf16
-
- 28 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
-
- 27 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-
- 26 12月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug
-
- 23 12月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* memcpy overlap * memcpy newexe
-
- 21 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] fix update complete and add_to_collection * fix annotation * fix amp fill_constant dist_attr
-
- 14 12月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] recompute tuning * fix conflict * update comment * bug fix * update rc algo * tiny fix * fix clear process_group * remove comment * update segment print * fix import OpRole * adapt amp pass and grad_clip pass for opt_tuner * update tuning config * fix import * annotate recompute info on ops and upgrade recompute pass * add op_namescope for seed op * record reserved vars * fix recompute var's dist_attr * fix strategy unittest * adapt for fp16 * update unittest * revert copy opt * update unittest * rename set_recompute_segments * fix unittest
-
- 12 12月, 2022 1 次提交
-
-
由 feifei-111 提交于
-
- 09 12月, 2022 1 次提交
-
-
由 cyber-pioneer 提交于
-
- 02 12月, 2022 1 次提交
-
-
由 Shijie 提交于
* Fix fuse_gemm_epilogue * update tests * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * fix random seed * use assert_allclose * Update test_dist_fuse_gemm_epilogue_pass.py * Update cpp_pass.py * Update test_dist_fuse_gemm_epilogue_pass.py * fix codestyle * update seed and atol
-
- 01 12月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 29 11月, 2022 2 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
由 JZ-LIANG 提交于
* add depend * add origin amp files * fp16 distinguish None & False * engine log * dp add deps for graph exe * add dep for grad clip * dep ops in comm stream * unitest
-
- 28 11月, 2022 1 次提交
-
-
由 wangguanqun 提交于
* remove fluid * update public * core * public * public1 * ci
-
- 25 11月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
-
- 22 11月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* add depend * fp16 pass distinguish None & False * engine log
-
- 18 11月, 2022 2 次提交
-
-
由 yuehuayingxueluo 提交于
* clear fluid apis in fleet and passes * fix model.py * fix model.py * fix cpp_pass.py
-
由 zhaoyingli 提交于
* [AutoParallel] selective recompute * add cmakelist
-
- 14 11月, 2022 2 次提交
-
-
由 Nyakku Shigure 提交于
[CodeStyle][F821] fix undefined variables due to missing imports, misspelled variable names (#47899) * `hann` -> `_hann` * `false` -> `False` * a missing passed argument `reduce_all` * some missing imports * `device_type` -> `heter_device_type` * `PKVClient` -> `KVClient` * fix some typos and missing imports
-
由 JZ-LIANG 提交于
* fixed cond state * fixed cond state
-
- 10 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fix insert concat op * fix fp16 assert
-
- 09 11月, 2022 1 次提交
-
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
- 08 11月, 2022 2 次提交
-
-
由 JZ-LIANG 提交于
[Auto Parallel] Sharding Optimization:Partition Algorithm & Stage2 Parameter Bucket communication (#47180) * partition param by order * add logging * reorder opt * config * stage2 bucket * update unitest
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment
-
- 07 11月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fp16 pass support assign op * choose assign op exec mode * add unittest * add cmakelist
-