提交 · d3f52bccd9f419b28ac449bd20fe3476c09f8da8 · PaddlePaddle / Paddle

08 9月, 2022 14 次提交
- C
  backward refuse foward part1 (#45868) · d3f52bcc
  由 Charles-hit 提交于 9月 08, 2022
```
* support more op for high level

* add unit test for high level op

* remove unnecessary comments
```
  d3f52bcc
- T
  xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持xpu (#45706) · 7085cb97
  由 taixiurong 提交于 9月 08, 2022
```
* add gemm_epilogue

* xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持 test=kunlun
```
  7085cb97
- P
  
  new executor support compiled_program constructed by graph (#45836) · ca1cab3e
  由 pangyoki 提交于 9月 08, 2022
  
  ca1cab3e
- T
  
  cinn_launch op: fix dtype of tensor is always mutable_data<float> (#45835) · ef53e1b4
  由 TeFeng Chen 提交于 9月 08, 2022
  
  ef53e1b4
- A
  
  [unittest]Fix test_reinforment_learing unittest failure (#45867) · 2b0857be
  由 Aurelius84 提交于 9月 08, 2022
  
  2b0857be
- O
  
  [ Hackathon 3rd No.2 ] add paddle.iinfo (#45321) · 40a0a46b
  由 OccupyMars2025 提交于 9月 08, 2022
  
  40a0a46b
- A
  [OpAttr]Refine Teller logic if encounter OpDesc with Variable type Attribute (#45795) · a642365e
  由 Aurelius84 提交于 9月 08, 2022
```
* [OpAttr]Refine Teller logic if encounter OpDesc with Variable type Attribute

* fix iterator

* fix typo

* fix lambda expr

* fix ptr
```
  a642365e
- L
  
  Increase the threshold of softmax and imperative qat UT (#45819) · bd4ce23e
  由 Leo Chen 提交于 9月 08, 2022
  
  bd4ce23e
- X
  [Dy2Static] Filter int64/int32/int16/bool in conditional op (#45759) · 36046a89
  由 xiongkun 提交于 9月 08, 2022
```
* stop pass filter int32/int16/int64/bool inputs in cond_op

* fix bugs: except block 0, the backward vars and forward vars exist in different blocks.

* fix code by review
```
  36046a89
- L
  
  add group argument (#44758) · bb725e3a
  由 LiYuRio 提交于 9月 08, 2022
  
  bb725e3a
- S
  
  Fix where xpu bug (#45832) · 2cda4e21
  由 Siming Dai 提交于 9月 08, 2022
  
  2cda4e21
- W
  
  bug fix (#45853) · b971ba04
  由 wenbin 提交于 9月 08, 2022
  
  b971ba04
- G
  
  fix ptq UT error (#45846) · 83cf6758
  由 Guanghua Yu 提交于 9月 08, 2022
  
  83cf6758
- S
  
  fix fused_gemm_epilogue_op compile error (#45862) · 569d6c5b
  由 sneaxiy 提交于 9月 08, 2022
  
  569d6c5b
07 9月, 2022 26 次提交
- C
  [Phi] Migrate save kernel (#45665) · fc66fdb7
  由 Chen Weihang 提交于 9月 07, 2022
```
* add save kernel

* add save_sr_kernel

* remove original save_op

* add save gpu kernel

* remove combine kernel

* add port.h include

* add save selected rows test

* remove useless kernel.h
```
  fc66fdb7
- L
  
  use xxhash instead of cryptopp (#45837) · a89e48fe
  由 Leo Chen 提交于 9月 07, 2022
  
  a89e48fe
- V
  update security policy test=document_fix (#45843) · f7832fea
  由 Vigi Zhang 提交于 9月 07, 2022
```
add running untrusted models in security policy
```
  f7832fea
- H
  [XPU] update xdnn to 0907. (#45777) · 1e981d0d
  由 houj04 提交于 9月 07, 2022
```
* [XPU] update xdnn to 0906. test=kunlun

* [XPU] update xdnn to 0907. test=kunlun
```
  1e981d0d
- Y
  
  rename the template type name for tranpose (#45834) · 9b70c556
  由 Yuang Liu 提交于 9月 07, 2022
  
  9b70c556
- C
  [Phi] Fix infermeta bug for vector input and output (#45810) · 420d186a
  由 Chen Weihang 提交于 9月 07, 2022
```
* fix infermeta bug for vector input and output

* add unittest
```
  420d186a
- W
  Construct exec and ctx only once in cond op to speed up (#45794) · ba653e7b
  由 WangZhen 提交于 9月 07, 2022
```
* Construct exec and ctx only once in cond op to speed up

* Fix construct function error
```
  ba653e7b
- W
  
  Fix fused cuda op's mutable data [2] (#45562) · 4bbbed9a
  由 Wilber 提交于 9月 07, 2022
  
  4bbbed9a
- B
  
  fix nullptr bug of BmmGradInferMeta (#45765) · 26d161ef
  由 BiynXu 提交于 9月 07, 2022
  
  26d161ef
- P
  [PHI] Migrate reduce sum+grad, mean+grad, min and max oneDNN kernels (#45536) · 22255528
  由 piotrekobi 提交于 9月 07, 2022
```
* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* Migrate reduce_op oneDNN kernels to phi

* Remove unnecessary header

* remove fluid code

* onednn renaming

* Change std::vector<int64_t> to IntArray

* Fix code style

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Move more functions from mkldnn_helper.h to onednn_helpper.h

* Change MKLDNN to OneDNN in VLOG message

* Implement reviewer suggestions
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
```
  22255528
- C
  Fix test_custom_relu_op_jit windows error (#45812) · 352babaa
  由 Chen Weihang 提交于 9月 07, 2022
```
* fix test_custom_relu_op_jit windows error

* polish assert format
```
  352babaa
- W
  [OpAttr]Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose (#45620) · fe169bf1
  由 WangZhen 提交于 9月 07, 2022
```
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
```
  fe169bf1
- Y
  
  [alphafold] Transpose support large tensors where there numel is bigger than INT32_MAX (#45753) · d9a9e638
  由 Yuang Liu 提交于 9月 07, 2022
  
  d9a9e638
- C
  replace fill_zeros_like op with fill_any_like op (#45657) · 0ddcf30c
  由 Charles-hit 提交于 9月 07, 2022
```
* relace fill_zeros_like op with fill_any_like op in backward.py and tensor.py

* Remove unnecessary comments

* modify create op_desc param
```
  0ddcf30c
- R
  
  Fix bug for AutoGrowthBestFitAllocator build (#45806) · fcbb307c
  由 Ruibiao Chen 提交于 9月 07, 2022
  
  fcbb307c
- W
  Optimiza params sync between CPU and GPU. (#45805) · a2b2af90
  由 Wilber 提交于 9月 07, 2022
```
* enable memory optimize when fp16.

* optimiza params sync between cpu and gpu.
```
  a2b2af90
- Z
  Clear extra attrs of reduce op in OpMaker (#45786) · 63b6a11b
  由 zyfncg 提交于 9月 07, 2022
```
* clear extra attrs of reduce op in opmaker

* fix reduce_mean
```
  63b6a11b
- H
  
  [XPU] move rnn op to phi. (#45822) · 91631492
  由 houj04 提交于 9月 07, 2022
  
  91631492
- Y
  
  [dygraph hybrid pp for interleave] Save/Load for interleaved pipeline. (#45797) · a9cc0274
  由 Yuang Liu 提交于 9月 07, 2022
  
  a9cc0274
- W
  Layernorm shift partition (#45736) · 960109af
  由 wenbin 提交于 9月 07, 2022
```
* first commit

* conver done

* correct format

* layernorm_shift_partition

* correct convert

* redefine plugin

* runable

* bug fix

* modify ShiftPartitionPattern

* correct

* add UT

* modify ut

* compile

* modify enforce

* modify UT
```
  960109af
- C
  [Auto Parallel] Support Iterable dataset for auto parallel (#45518) · b77fa1d9
  由 caozhou 提交于 9月 07, 2022
```
* support iterable dataset for auto parallel

* add split_data proto

* fix unittest bug

* fix recompute bug

* update cmake
```
  b77fa1d9
- Q
  [MLU] fix sync_bn of mlu and add unittests (#45707) · 500f070d
  由 qipengh 提交于 9月 07, 2022
```
* [MLU] fix sync_bn of mlu and add unittests

* [MLU] remove redunant code of pytest
```
  500f070d
- L
  
  add device context getter (#45790) · b7d219be
  由 LiYuRio 提交于 9月 07, 2022
  
  b7d219be
- W
  
  [Eager, Performance optimization] polish uniform_random (#45807) · 1a372bd1
  由 Weilong Wu 提交于 9月 07, 2022
  
  1a372bd1
- L
  Performance fix for broadcast kernel [Part2] (#40051) · 87cba48b
  由 limingshu 提交于 9月 07, 2022
```
* first commit

* merged with develop

* merged with develop

* fix merge sequential one dims bugs
```
  87cba48b
- S
  [PHI] Migrate scale kernel (#45537) · 429b5b5b
  由 Sławomir Siwek 提交于 9月 07, 2022
```
* scale kernel

* endline

* add inplace

* fix merge conflicts

* Merge conflicts
```
  429b5b5b

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功