- 19 4月, 2023 1 次提交
-
-
由 kangguangli 提交于
* fix * fix * fix * fix * fix * fix fuse group order
-
- 18 4月, 2023 2 次提交
-
-
由 张春乔 提交于
-
由 Meteor Liu 提交于
* rename _varbase_creator as create_tensor * rename _varbase_creator as create_tensor
-
- 17 4月, 2023 7 次提交
-
-
由 Yulong Ao 提交于
-
由 LiYuRio 提交于
* cherry-pick fleet executor from 2.4 * fix test case
-
由 Chitsing KUI 提交于
* add random control for fused dropout add * add __init__
-
由 Kim Yann 提交于
-
由 张春乔 提交于
* remove hccl in .py files * remove ascend in setup.py.in * remove ascend in setup.py
-
由 Haohongxiang 提交于
-
由 caozhou 提交于
* add o2 tune * add unittest * fix error * set unittest timeout
-
- 14 4月, 2023 2 次提交
-
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 ronnywang 提交于
-
- 13 4月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
- 12 4月, 2023 4 次提交
-
-
由 ShenLiang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Speedup the completion process * [Auto Parallel] Skip the property of dist_context when deepcopying * [Auto Parallel] Remove the unnecessary print * [Auto Parallel] Move some changes from 2.4 branch to develop * Update engine.py * [Auto Parallel] Fix a bug
-
由 张春乔 提交于
* remove c_comm_init_hccl_op.cc and c_gen_hccl_id_op.cc * remove gen_hccl_id_op.cc
-
由 CHANGer 提交于
-
- 11 4月, 2023 3 次提交
-
-
由 wangxiaoning 提交于
-
由 wuhuachaocoding 提交于
-
由 risemeup1 提交于
-
- 10 4月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* unique id for mesh * rng ctrl * support dropout * register op * adopt for recompute * update unitest * support pp
-
- 09 4月, 2023 2 次提交
-
-
由 Chitsing KUI 提交于
-
由 ShenLiang 提交于
* add seed control * fix bug
-
- 07 4月, 2023 3 次提交
-
-
由 kangguangli 提交于
* remove run_program * remove FLAGS_USE_STANDALONE_EXECUTOR
-
由 TaoTao Li 提交于
fix merge conflicts
-
由 Roc 提交于
* fix mkdir * update
-
- 06 4月, 2023 2 次提交
-
-
由 Nyakku Shigure 提交于
-
由 Kim Yann 提交于
* rem is_compiled_with_npu * rem nup related code * make lint happy * rem test * remove some tests * Update grad_scaler.py * fix an error
-
- 04 4月, 2023 2 次提交
-
-
由 Tian 提交于
-
由 LoneRanger 提交于
* relocate debugger.py * fix bug * fix bug * fix bug * fix bug
-
- 03 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_mlu * fix some mlu_place and mlu_device_coount * make lint happy
-
- 31 3月, 2023 2 次提交
-
-
由 zhenhailiu 提交于
* gather with doc * resolve comment * polish * polish * code style * polish doc * add_test * polish * polish * add test check * add test check * polish * polish * polish * polish * fix_time_out * polish * fix timeout * fix_timeout * polish * polish * polish * polish * polish
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 6 次提交
-
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase. * Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index' dtype is int64_t.
-
由 kangguangli 提交于
* remove sync_calc_stream and sync_comm_stream * fix ci bug * fix * fix * fix
-
由 ShenLiang 提交于
* fix bug in proto * add utest
-
由 Guoxia Wang 提交于
* support python object input data broadcast for model parallel * add unittest * fix * fix concat 0D tensor * fix codestyle
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
- 29 3月, 2023 1 次提交
-
-
由 Roc 提交于
-