- 19 9月, 2022 8 次提交
-
-
由 Yulong Ao 提交于
* [AutoParallel] adapt gradient merge pass (#45915) * adapt gradient merge * fix op_role * fix strategy * [Auto Parallel] Gradient Fuse Allreduce (#45643) * bugfix (#45332) * dist embedding support lookup table v1 * add unitest * customize wait_comm * group gradients * bugfix * update program * [Auto Parallel] Improve the APIs (#45776) * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Add the serialization process for dist attrs * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix the code style * [Auto Parallel] Remove unnecessary impls * [Auto Parallel] Fix the importing error * [Auto Parallel] Fix the copy from bugs of op dist attr * [Auto Parallel] Replace the use of constexpr if * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh * [Auto Parallel] Change API of the completion unittest * [Auto Parallel] Fix the bug when set_attr an int * [Auto Parallel] Add the unittest for the serialization * [Auto Parallel] Add some unit tests * [Auto Paralle] Unify the strategy * [Auto Parallel] Improve the engine api * [Auto Parallel] Reset the changes made to the framework * [Auto Parallel] Change the engine unittest * [Auto Parallel] Update API of the completion and partitioner * [Auto Parallel] Update unit tests using engine api * update shard annotation * [Auto Parallel] Remove the modifications of other modules * [Auto Parallel] Add docs for APIs * add new strategy * [Auto Parallel] Replace the logger * [Auto Parallel] Restore the test_program.py * [Auto Parallel] Change the import rules * [Auto Parallel] Add the examples for Engine * [Auto Parallel] Do some minor changes * [Auto Parallel] Remove yaml dependency * [Auto Parallel] Fix the unittests * add valid after train * bug fix Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com> * [Auto Parallel] Bugfix allreduce fuse for MP (#46086) * bugfix * bugfix * typos fixed * update strategy (#46138) Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com> Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com> Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
-
由 sneaxiy 提交于
-
由 Jiabin Yang 提交于
* make eager log readable * fix compile error * recover test * invoke ci again
-
由 xiaoxiaohehe001 提交于
-
由 Chen Weihang 提交于
This reverts commit c252b1de.
-
由 ShenLiang 提交于
-
由 MayYouBeProsperous 提交于
fix bug for TransformedDistribution
-
由 Chen Weihang 提交于
* unify core_avx and core_noavx * fix except error * revert mac compile logic * revert dylib to so * add core_noavx branch * remove core_noavx * replace paddle_core by lib paddle * polish var name * replace paddle_core by libpaddle * update custom device commit * polish code by comments
-
- 17 9月, 2022 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fix gloo compile warning * adapt for nn fl-ps
-
- 16 9月, 2022 4 次提交
-
-
由 JingZhuangzhuang 提交于
-
由 Charles-hit 提交于
(cherry-pick)Fix split infershape in static mode and add convert rules for fill_any_like op (#46079) * Fix split bug in static mode (#45906) * fix split bug in static mode * modify code style * modify code style * add unit test for split * add convert rules for fill_any_like op in paddle science (#45985) * add convert rules for fill_any_like op in paddle science * add unit test for fill_any_like op in paddle science * modify fill_any_like convert rule * modify fill_any_like convert rule dtype
-
由 Hui Zhang 提交于
* skip forward save * fix bug * more ci for jit skip forward
-
由 Chen Weihang 提交于
* normalize yaml file name (#45894) * Clear extra attributes of activation op in OpMaker (#45772) * clear extra attr of activation op in opmaker * fix syntax bug * fix mkldnn kernel * fix merge conflict * fix bug * [PHI] Normalize yaml op label (#45976) * normalize yaml op label * revert op_compat yaml change * fix prelu and rnn compat problem * replace api by op * support assign op backward refuse forward (#45879) * normize yaml backward op label (#46028) Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: NCharles-hit <56987902+Charles-hit@users.noreply.github.com>
-
- 15 9月, 2022 8 次提交
-
-
由 xiongkun 提交于
[ Dy2Static ] Fix bugs when select inputs meeting different shape or undefined-var (#45916) (#46020) * fix select_input with different shape errors: 1. select_input_with_buildin_type directly return non-undefinedvar branch when meeting undefined var 2. the output shape of select_input is inferred from inputs. * reverse the logic in select_input
-
由 WangZhen 提交于
Support 0 shapes input Tensor for MKL slice kernel
-
由 weishengying 提交于
-
由 zyfncg 提交于
* delete eigen header in data_type.h * fix complie bug * refactor
-
由 Charles-hit 提交于
-
由 JingZhuangzhuang 提交于
-
由 Chen Weihang 提交于
* fix arm fp16 compile error * polish macro impl
-
由 Zhang Jun 提交于
* Support dynamic shape in multiclass_nms3 Plugin for Paddle-TensorRT.
-
- 14 9月, 2022 7 次提交
-
-
由 JingZhuangzhuang 提交于
-
由 JingZhuangzhuang 提交于
* cherry pick delay tensorrt log * Update trt_plugin.h
-
由 Leo Chen 提交于
-
由 engineer1109 提交于
修复cuda11.7编译出错的问题
-
由 ykkk2333 提交于
-
由 wenbin 提交于
-
由 pangyoki 提交于
-
- 13 9月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
-
由 Ruibiao Chen 提交于
* Allow manaully set py_reader name in standalone executor * Fix CI errors
-
- 09 9月, 2022 9 次提交
-
-
由 Charles-hit 提交于
-
由 zhaoyingli 提交于
* adapt lazy init and fix pass * add unittest * update comment * fix amp and sharding * remove clip_by_norm
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
由 xiongkun 提交于
* convert_call support staticmethod for class. * while support for python container. It is convenient to convert more dynamic graph codes into static graphs. * cond support python container * add unittest for staticmethod convert_call * fix bugs * add unittest for item interface * fix bugs * change to np.testing.assert_allclose * code format * fix comments. * code format
-
由 Chen Weihang 提交于
* add fusion dir and fuse_softmax_mask kernel * remove fusion kernel dir * migrate infershape * fix code errror
-
由 WangZhen 提交于
* Enhance slice to support 0 dims Tensor * Add UT
-
由 Chenxiao Niu 提交于
-
由 xiaoguoguo626807 提交于
* modify slice infershape * code style * modify slice_unittest
-
由 Chen Weihang 提交于
* simplify size op * trans to cuda manuly * fix copy error
-