提交 · 6833ecfe94272cdf97bfaa667d100d3f6318ba49 · PaddlePaddle / Paddle

14 9月, 2022 4 次提交
- S
  Fix DistributedFusedLAMB NaN problem (#46011) · 6833ecfe
  由 sneaxiy 提交于 9月 14, 2022
```
* fix distributed_fused_lamb nan

* remove CUDA_ASSERT
```
  6833ecfe
- Y
  
  Simplify the codes of conv. (#45966) · 3a5b5048
  由 Yiqun Liu 提交于 9月 14, 2022
  
  3a5b5048
- X
  add mean,sum,ge,gt,ne,abs primitive operators for supporting deepxde (#45888) · 62176f63
  由 Xiaoxu Chen 提交于 9月 14, 2022
```
* add reduce_mean,reduce_sum primitive ops

* add ne_p gt_p primitive operators

* add ge_p abs_p primitive oparators
```
  62176f63
- C
  
  [MLU] add mergedAdam kernel. (#45965) · bf6ec262
  由 Chenxiao Niu 提交于 9月 14, 2022
  
  bf6ec262
13 9月, 2022 3 次提交
- R
  
  [CustomDevice] register load_combine op (#45980) · b2122239
  由 ronnywang 提交于 9月 13, 2022
  
  b2122239
- Z
  Clear extra attributes of activation op in OpMaker (#45772) · c7b373f2
  由 zyfncg 提交于 9月 13, 2022
```
* clear extra attr of activation op in opmaker

* fix syntax bug

* fix mkldnn kernel

* fix merge conflict

* fix bug
```
  c7b373f2
- Y
  
  migrate squeeze kernel to phi, test=kunlun (#45968) · d3366853
  由 ykkk2333 提交于 9月 13, 2022
  
  d3366853
10 9月, 2022 1 次提交
- Q
  
  [MLU] fix compute error of dropout op (#45923) · 36915474
  由 qipengh 提交于 9月 10, 2022
  
  36915474
09 9月, 2022 7 次提交
- D
  make memcpy op to support custom_device (#45918) · 1ed8e9b8
  由 duanyanhui 提交于 9月 09, 2022
```
* make memcpy op to support custom device

* fix bug
```
  1ed8e9b8
- L
  [new-exe] convert fused_all_reduce_op_handle to program (#45774) · e755c07e
  由 Leo Chen 提交于 9月 09, 2022
```
* add operator<< for BuildStrategy

* add fake_coalesce

* fit allreduce mode for new_exe

* remove dubeg code

* follow comments
```
  e755c07e
- C
  [Phi] Migrate load kernel (#45891) · a001f263
  由 Chen Weihang 提交于 9月 09, 2022
```
* migrate load kernel

* remove load op

* fix test failed
```
  a001f263
- X
  
  convfusion_cache (#45902) · 3bad26ec
  由 xiaoxiaohehe001 提交于 9月 09, 2022
  
  3bad26ec
- R
  [CustomDevice] add dy2static support (#45878) · abc85c50
  由 ronnywang 提交于 9月 09, 2022
```
* [CustomDevice] add dy2static support

* update
```
  abc85c50
- C
  [Phi] Add fusion kernel dir and migrate fused_softmax_mask op (#45802) · 2b4f44d5
  由 Chen Weihang 提交于 9月 09, 2022
```
* add fusion dir and fuse_softmax_mask kernel

* remove fusion kernel dir

* migrate infershape

* fix code errror
```
  2b4f44d5
- S
  
  fix fused_gemm_epilogue compile error (#45899) · 7d000112
  由 sneaxiy 提交于 9月 09, 2022
  
  7d000112
08 9月, 2022 7 次提交
- P
  [PHI] Migrate cast, clip+grad and pool+grad oneDNN kernels (#45775) · 1a929c31
  由 piotrekobi 提交于 9月 08, 2022
```
* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI

* Refactor grad kernels into separate files

* Fix CI failures

* Fix Codestyle

* Implement reviewer suggestions

* Add new lines after includes for readability
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
```
  1a929c31
- L
  
  Migrate roi_align and roi_align_grad to phi. test=kunlun (#45858) · 8add11a0
  由 Leo Guo 提交于 9月 08, 2022
  
  8add11a0
- H
  
  polish code comment, test=doc (#45859) · 447d79da
  由 HongyuJia 提交于 9月 08, 2022
  
  447d79da
- T
  xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持xpu (#45706) · 7085cb97
  由 taixiurong 提交于 9月 08, 2022
```
* add gemm_epilogue

* xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持 test=kunlun
```
  7085cb97
- T
  
  cinn_launch op: fix dtype of tensor is always mutable_data<float> (#45835) · ef53e1b4
  由 TeFeng Chen 提交于 9月 08, 2022
  
  ef53e1b4
- X
  [Dy2Static] Filter int64/int32/int16/bool in conditional op (#45759) · 36046a89
  由 xiongkun 提交于 9月 08, 2022
```
* stop pass filter int32/int16/int64/bool inputs in cond_op

* fix bugs: except block 0, the backward vars and forward vars exist in different blocks.

* fix code by review
```
  36046a89
- S
  
  fix fused_gemm_epilogue_op compile error (#45862) · 569d6c5b
  由 sneaxiy 提交于 9月 08, 2022
  
  569d6c5b
07 9月, 2022 11 次提交
- C
  [Phi] Migrate save kernel (#45665) · fc66fdb7
  由 Chen Weihang 提交于 9月 07, 2022
```
* add save kernel

* add save_sr_kernel

* remove original save_op

* add save gpu kernel

* remove combine kernel

* add port.h include

* add save selected rows test

* remove useless kernel.h
```
  fc66fdb7
- Y
  
  rename the template type name for tranpose (#45834) · 9b70c556
  由 Yuang Liu 提交于 9月 07, 2022
  
  9b70c556
- W
  Construct exec and ctx only once in cond op to speed up (#45794) · ba653e7b
  由 WangZhen 提交于 9月 07, 2022
```
* Construct exec and ctx only once in cond op to speed up

* Fix construct function error
```
  ba653e7b
- W
  
  Fix fused cuda op's mutable data [2] (#45562) · 4bbbed9a
  由 Wilber 提交于 9月 07, 2022
  
  4bbbed9a
- P
  [PHI] Migrate reduce sum+grad, mean+grad, min and max oneDNN kernels (#45536) · 22255528
  由 piotrekobi 提交于 9月 07, 2022
```
* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* Migrate reduce_op oneDNN kernels to phi

* Remove unnecessary header

* remove fluid code

* onednn renaming

* Change std::vector<int64_t> to IntArray

* Fix code style

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Move more functions from mkldnn_helper.h to onednn_helpper.h

* Change MKLDNN to OneDNN in VLOG message

* Implement reviewer suggestions
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
```
  22255528
- W
  [OpAttr]Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose (#45620) · fe169bf1
  由 WangZhen 提交于 9月 07, 2022
```
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
```
  fe169bf1
- Y
  
  [alphafold] Transpose support large tensors where there numel is bigger than INT32_MAX (#45753) · d9a9e638
  由 Yuang Liu 提交于 9月 07, 2022
  
  d9a9e638
- Z
  Clear extra attrs of reduce op in OpMaker (#45786) · 63b6a11b
  由 zyfncg 提交于 9月 07, 2022
```
* clear extra attrs of reduce op in opmaker

* fix reduce_mean
```
  63b6a11b
- H
  
  [XPU] move rnn op to phi. (#45822) · 91631492
  由 houj04 提交于 9月 07, 2022
  
  91631492
- Q
  [MLU] fix sync_bn of mlu and add unittests (#45707) · 500f070d
  由 qipengh 提交于 9月 07, 2022
```
* [MLU] fix sync_bn of mlu and add unittests

* [MLU] remove redunant code of pytest
```
  500f070d
- S
  [PHI] Migrate scale kernel (#45537) · 429b5b5b
  由 Sławomir Siwek 提交于 9月 07, 2022
```
* scale kernel

* endline

* add inplace

* fix merge conflicts

* Merge conflicts
```
  429b5b5b
06 9月, 2022 7 次提交
- Y
  [PHI]Add TensorArray for PHI (#45479) · 68f99b78
  由 YuanRisheng 提交于 9月 06, 2022
```
* add tensor array

* fix ci bugs

* fix ci bugs

* fix ci bugs

* fix ci bugs

* update by comment

* update code
```
  68f99b78
- J
  Added concat workaround for vivo model (#45091) · 8f37c66f
  由 jakpiase 提交于 9月 06, 2022
```
* concat workaround

* CI rerun
```
  8f37c66f
- Y
  
  migrate deformable_conv and merged momentum kernels to phi, test=kunlun (#45691) · 7f3c7aeb
  由 ykkk2333 提交于 9月 06, 2022
  
  7f3c7aeb
- Y
  
  migrate unsqueeze kernels to phi, test=kunlun (#45673) · 4acf1ef7
  由 ykkk2333 提交于 9月 06, 2022
  
  4acf1ef7
- W
  
  Fix DequantizeTwoScale kernel (#45632) · 98a5af1a
  由 whs 提交于 9月 06, 2022
  
  98a5af1a
- Z
  Clear extra attributes of matmul_v2 in OpMaker (#45708) · d4c4c53d
  由 zyfncg 提交于 9月 06, 2022
```
* set use_cudnn=true for conv2d

* clear opmaker of matmul_v2

* fix bug of set_attr

* add extra attr checker in infer_shape
```
  d4c4c53d
- Z
  
  clear extra attrs of some op in opmaker (#45758) · 22f042ba
  由 zyfncg 提交于 9月 06, 2022
  
  22f042ba

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功