- 16 9月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* unify core_avx and core_noavx * fix except error * revert mac compile logic * revert dylib to so * add core_noavx branch * remove core_noavx * replace paddle_core by lib paddle * polish var name * replace paddle_core by libpaddle * update custom device commit * polish code by comments
-
由 wuhuachaocoding 提交于
* refactor mp. * update setup.py. * update mp_layers.py for compatibility. * add documents for mp_layers.py * update init.py * update collective.py. * update. * update mp_ops.py * update. * update code style. * update code style.
-
由 Wen Sun 提交于
-
- 15 9月, 2022 7 次提交
-
-
由 Hui Zhang 提交于
* skip forward save * fix bug * more ci for jit skip forward
-
由 Yulong Ao 提交于
* [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Add the serialization process for dist attrs * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix the code style * [Auto Parallel] Remove unnecessary impls * [Auto Parallel] Fix the importing error * [Auto Parallel] Fix the copy from bugs of op dist attr * [Auto Parallel] Replace the use of constexpr if * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh * [Auto Parallel] Change API of the completion unittest * [Auto Parallel] Fix the bug when set_attr an int * [Auto Parallel] Add the unittest for the serialization * [Auto Parallel] Add some unit tests * [Auto Paralle] Unify the strategy * [Auto Parallel] Improve the engine api * [Auto Parallel] Reset the changes made to the framework * [Auto Parallel] Change the engine unittest * [Auto Parallel] Update API of the completion and partitioner * [Auto Parallel] Update unit tests using engine api * update shard annotation * [Auto Parallel] Remove the modifications of other modules * [Auto Parallel] Add docs for APIs * add new strategy * [Auto Parallel] Replace the logger * [Auto Parallel] Restore the test_program.py * [Auto Parallel] Change the import rules * [Auto Parallel] Add the examples for Engine * [Auto Parallel] Do some minor changes * [Auto Parallel] Remove yaml dependency * [Auto Parallel] Fix the unittests * add valid after train * bug fix Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
-
由 Guoxia Wang 提交于
-
由 Nyakku Shigure 提交于
-
由 WangZhen 提交于
Support 0 shapes input Tensor for MKL slice kernel
-
由 weishengying 提交于
-
由 wanghuancoder 提交于
* saved_tensors_hooks
-
- 14 9月, 2022 17 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
由 JingZhuangzhuang 提交于
* merge python lib * Update third_party.cmake * Update CMakeLists.txt
-
由 pangyoki 提交于
-
由 Charles-hit 提交于
* add convert rules for fill_any_like op in paddle science * add unit test for fill_any_like op in paddle science * modify fill_any_like convert rule * modify fill_any_like convert rule dtype
-
由 Li Min 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle] trim trailing whitespace in .md and .rst * empty commit, test=document_fix
-
由 Zhang Jun 提交于
* update * update * update
-
由 Charles-hit 提交于
-
由 JZ-LIANG 提交于
* bugfix (#45332) * dist embedding support lookup table v1 * add unitest * customize wait_comm * group gradients * bugfix * update program
-
由 Thunderbrook 提交于
-
由 zhangbo9674 提交于
* support bfloat16 for amp_decorate * add check_finite for bf16 * fix bug * add ut * add ut * refine code
-
由 Xinger 提交于
-
由 xiongkun 提交于
* fix select_input with different shape errors: 1. select_input_with_buildin_type directly return non-undefinedvar branch when meeting undefined var 2. the output shape of select_input is inferred from inputs. * reverse the logic in select_input
-
由 Charles-hit 提交于
-
由 Xiaoxu Chen 提交于
* add reduce_mean,reduce_sum primitive ops * add ne_p gt_p primitive operators * add ge_p abs_p primitive oparators
-
由 Chenxiao Niu 提交于
-
由 zhangkaihuo 提交于
-
- 13 9月, 2022 9 次提交
-
-
由 Charles-hit 提交于
-
由 Charles-hit 提交于
-
由 Charles-hit 提交于
-
由 Charles-hit 提交于
-
由 pangyoki 提交于
* optimize executor log * delete log in new exe * add log for old executor * use LOG_FIRST_N(INFO, 1)
-
由 Charles-hit 提交于
-
由 feifei-111 提交于
* support deco * fix deco ast type * arg_str * 1 * support callable deco * code style * codestyle * test_error * fix decos in another file * recover conflict codes
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * adapt for nn fl-ps
-
由 Ruibiao Chen 提交于
* Enable standalone executor for single-GPU training * Disable CompiledProgram._graph * Fix CI errors
-
- 10 9月, 2022 1 次提交
-
-
由 qipengh 提交于
-
- 09 9月, 2022 3 次提交
-
-
由 zhaoyingli 提交于
* adapt gradient merge * fix op_role * fix strategy
-
由 zhangbo9674 提交于
* add scope cache & reuse * add gc scope for end of each train step * del scope reuse for jit * refine code * test
-
由 Yuang Liu 提交于
-