- 19 9月, 2022 3 次提交
-
-
由 minghaoBD 提交于
Co-authored-by: NRichardWooSJTU <37864677+RichardWooSJTU@users.noreply.github.com>
-
由 Yulong Ao 提交于
* [AutoParallel] adapt gradient merge pass (#45915) * adapt gradient merge * fix op_role * fix strategy * [Auto Parallel] Gradient Fuse Allreduce (#45643) * bugfix (#45332) * dist embedding support lookup table v1 * add unitest * customize wait_comm * group gradients * bugfix * update program * [Auto Parallel] Improve the APIs (#45776) * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Use c++ dist attr in the completion process * [Auto Parallel] Add minor changes * [Auto Parallel] Add the serialization process for dist attrs * [Auto Parallel] Remove unnecessary comments * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix the code style * [Auto Parallel] Remove unnecessary impls * [Auto Parallel] Fix the importing error * [Auto Parallel] Fix the copy from bugs of op dist attr * [Auto Parallel] Replace the use of constexpr if * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh * [Auto Parallel] Change API of the completion unittest * [Auto Parallel] Fix the bug when set_attr an int * [Auto Parallel] Add the unittest for the serialization * [Auto Parallel] Add some unit tests * [Auto Paralle] Unify the strategy * [Auto Parallel] Improve the engine api * [Auto Parallel] Reset the changes made to the framework * [Auto Parallel] Change the engine unittest * [Auto Parallel] Update API of the completion and partitioner * [Auto Parallel] Update unit tests using engine api * update shard annotation * [Auto Parallel] Remove the modifications of other modules * [Auto Parallel] Add docs for APIs * add new strategy * [Auto Parallel] Replace the logger * [Auto Parallel] Restore the test_program.py * [Auto Parallel] Change the import rules * [Auto Parallel] Add the examples for Engine * [Auto Parallel] Do some minor changes * [Auto Parallel] Remove yaml dependency * [Auto Parallel] Fix the unittests * add valid after train * bug fix Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com> * [Auto Parallel] Bugfix allreduce fuse for MP (#46086) * bugfix * bugfix * typos fixed * update strategy (#46138) Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com> Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com> Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com> Co-authored-by: Ncaozhou <caozhou@radi.ac.cn> Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
-
由 Chen Weihang 提交于
* unify core_avx and core_noavx * fix except error * revert mac compile logic * revert dylib to so * add core_noavx branch * remove core_noavx * replace paddle_core by lib paddle * polish var name * replace paddle_core by libpaddle * update custom device commit * polish code by comments
-
- 16 9月, 2022 3 次提交
-
-
由 Charles-hit 提交于
(cherry-pick)Fix split infershape in static mode and add convert rules for fill_any_like op (#46079) * Fix split bug in static mode (#45906) * fix split bug in static mode * modify code style * modify code style * add unit test for split * add convert rules for fill_any_like op in paddle science (#45985) * add convert rules for fill_any_like op in paddle science * add unit test for fill_any_like op in paddle science * modify fill_any_like convert rule * modify fill_any_like convert rule dtype
-
由 Hui Zhang 提交于
* skip forward save * fix bug * more ci for jit skip forward
-
由 Chen Weihang 提交于
* normalize yaml file name (#45894) * Clear extra attributes of activation op in OpMaker (#45772) * clear extra attr of activation op in opmaker * fix syntax bug * fix mkldnn kernel * fix merge conflict * fix bug * [PHI] Normalize yaml op label (#45976) * normalize yaml op label * revert op_compat yaml change * fix prelu and rnn compat problem * replace api by op * support assign op backward refuse forward (#45879) * normize yaml backward op label (#46028) Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: NCharles-hit <56987902+Charles-hit@users.noreply.github.com>
-
- 15 9月, 2022 4 次提交
-
-
由 xiongkun 提交于
[ Dy2Static ] Fix bugs when select inputs meeting different shape or undefined-var (#45916) (#46020) * fix select_input with different shape errors: 1. select_input_with_buildin_type directly return non-undefinedvar branch when meeting undefined var 2. the output shape of select_input is inferred from inputs. * reverse the logic in select_input
-
由 WangZhen 提交于
Support 0 shapes input Tensor for MKL slice kernel
-
由 weishengying 提交于
-
由 Zhang Jun 提交于
* Support dynamic shape in multiclass_nms3 Plugin for Paddle-TensorRT.
-
- 14 9月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
-
由 pangyoki 提交于
-
- 13 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Allow manaully set py_reader name in standalone executor * Fix CI errors
-
- 09 9月, 2022 6 次提交
-
-
由 Charles-hit 提交于
-
由 zhaoyingli 提交于
* adapt lazy init and fix pass * add unittest * update comment * fix amp and sharding * remove clip_by_norm
-
由 xiongkun 提交于
* convert_call support staticmethod for class. * while support for python container. It is convenient to convert more dynamic graph codes into static graphs. * cond support python container * add unittest for staticmethod convert_call * fix bugs * add unittest for item interface * fix bugs * change to np.testing.assert_allclose * code format * fix comments. * code format
-
由 WangZhen 提交于
* Enhance slice to support 0 dims Tensor * Add UT
-
由 xiaoguoguo626807 提交于
* modify slice infershape * code style * modify slice_unittest
-
由 LiYuRio 提交于
-
- 08 9月, 2022 9 次提交
-
-
由 xiongkun 提交于
* 1. fix non-local error while dealing push_pop names 2. escape "'" in push_pop_names to avoid syntax errors. 3. unified the non-local stmt creation processes in getter and setter. 4. split the nonlocal_names and getter/setter names. * fix bugs * 1. revert setter and getter, push_pop_names must have non-local * fix bugs. * code format
-
由 wenbin 提交于
-
由 Charles-hit 提交于
* support more op for high level * add unit test for high level op * remove unnecessary comments
-
由 taixiurong 提交于
* add gemm_epilogue * xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持 test=kunlun
-
由 pangyoki 提交于
-
由 OccupyMars2025 提交于
-
由 Leo Chen 提交于
-
由 LiYuRio 提交于
-
由 Guanghua Yu 提交于
-
- 07 9月, 2022 11 次提交
-
-
由 houj04 提交于
* [XPU] update xdnn to 0906. test=kunlun * [XPU] update xdnn to 0907. test=kunlun
-
由 Chen Weihang 提交于
* fix test_custom_relu_op_jit windows error * polish assert format
-
由 WangZhen 提交于
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
-
由 Charles-hit 提交于
* relace fill_zeros_like op with fill_any_like op in backward.py and tensor.py * Remove unnecessary comments * modify create op_desc param
-
由 Yuang Liu 提交于
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
由 caozhou 提交于
* support iterable dataset for auto parallel * add split_data proto * fix unittest bug * fix recompute bug * update cmake
-
由 qipengh 提交于
* [MLU] fix sync_bn of mlu and add unittests * [MLU] remove redunant code of pytest
-
由 Weilong Wu 提交于
-
由 xiongkun 提交于
* add compile-time infermeta logic for stack infermeta. * add unittest for stack infermeta where -1 exists in shapes. * remove backward changes.
-
由 zhangkaihuo 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add tensor array * fix ci bugs * fix ci bugs * fix ci bugs * fix ci bugs * update by comment * update code
-