提交 · a8dee3bbe6de6312d88393481b5cafde3c56a128 · PaddlePaddle / Paddle

24 8月, 2021 7 次提交

add fetch, test=develop (#35019) · a5060b55

由 wanghuancoder 提交于 8月 24, 2021

* add fetch, test=develop

* fix fetch2op, test=develop

* fix fetch2op, test=develop

* refine, test=develop

* fix fetch ctx, test=develop

* add wait, test=develop

* rename fetch2 to fetch_v2, test=develop

* merge, test=develop

a5060b55

D
fix bmm bug (#35098) · de645153
由 duanboqiang 提交于 8月 24, 2021
```
* fix bmm bug

* bmm style

* fix bmm
```
de645153

[oneDNN] Concat refactoring and disabling caching (#35002) · d9c0f09b

由 Jacek Czaja 提交于 8月 24, 2021

* - concat refactoring draft

* - cmpilation fixes

* - yet another compilation fix

* - fix

* - compilation fix

* - fixes to compilation

* - another compilation fix

* - fix

* - Added overloaded AcquirePrimitiveDesc for concat

* - fix

* - reserve introduced

* - UT fixes

* - test concat int8 improved

* - fixes

* - fix to crash

* - lint fixes

* - fixes after review

* - some other fixes from review

d9c0f09b

王

add the extra and quantization for op def, test=develop (#35076) · cb28753c
由王明冬提交于 8月 24, 2021

cb28753c
R
[NPU] add conv_op_npu and test (#34055) · 00a269de
由 ronnywang 提交于 8月 24, 2021
```
* add conv_op_npu and test

* add more tests

* clean headers & support fp16

* update
```
00a269de
R
[NPU] add pool2 op and tests (#34770) · da261732
由 ronnywang 提交于 8月 24, 2021
```
* add pool2d_op_npu and test

* update

* update pool2d_backward_navie

* clean headers
```
da261732
T

Fix a bug of transpose op, about accessing memory out of bounds of the perm param (#35079) · 10563791
由 TeslaZhao 提交于 8月 24, 2021

10563791

23 8月, 2021 7 次提交
- J
  [oneDNN] disable caching for interpolate and batch Norm (#35030) · 673bf719
  由 Jacek Czaja 提交于 8月 23, 2021
```
* - disabled interpolate onednn

* - compilation fix

* - draft of batch_norm cache disabling

* - fixes to UT
```
  673bf719
- L
  Refactor the organization of layer_norm cuda impl. (#34883) · 7f5eb533
  由 Li Min 提交于 8月 23, 2021
```
Refactor the organization of layer_norm cuda impl so that it can be reused in fused attention op.

    Extract the layer_norm cuda impl form layer_norm_op.cu to layer_norm_kernel.cu.h.
    Define fused/attention_layer_norm.h, which can be used in fused attention op in next PR.
```
  7f5eb533
- Z
  Support gettiem by Bool index (#35026) · b6dc16cb
  由 zyfncg 提交于 8月 23, 2021
```
* Support getitem by Bool index

* delete some debug info of bool index

* support the case that the shape of bool index is different from indexed tensor
```
  b6dc16cb
- P
  
  add beam_search_decode npu op (#34967) · 4ce272ed
  由 pangyoki 提交于 8月 23, 2021
  
  4ce272ed
- P
  
  add fill_constant_batch_size_like npu op (#34563) · 7d86737c
  由 pangyoki 提交于 8月 23, 2021
  
  7d86737c
- T
  
  Fix a bug of strided_slice op, about the axes parameter access memory out of bounds (#35062) · aefec228
  由 TeslaZhao 提交于 8月 23, 2021
  
  aefec228
- Z
  add adamw cuda kernel (#35020) · 77a8a394
  由 zhaoyingli 提交于 8月 23, 2021
```
* adamw support cuda

* adamw support cuda
```
  77a8a394
22 8月, 2021 1 次提交
- Z
  
  implementation of broadcast add backward by reduce (#34143) · 56c5e210
  由 Zhang Zheng 提交于 8月 22, 2021
  
  56c5e210
20 8月, 2021 7 次提交
- H
  
  Add paddle.linalg.matrix_power OP (#34667) · e2241a43
  由 Hao Lin 提交于 8月 20, 2021
  
  e2241a43
- Y
  
  [hybrid performance] Grad fuse for gradient merge under pipeline mode (#35004) · 4d9b2d6d
  由 Yuang Liu 提交于 8月 20, 2021
  
  4d9b2d6d
- L
  [npu]Add argsort op (#34865) · 99ffeffe
  由 lzzyzlbb 提交于 8月 20, 2021
```
* add rmsprop npu

* add argsort npu

* add argsort npu

* modify according to review

* modify sharedatawith according to review

* modify reshape according to review

* rm dygraph=false
```
  99ffeffe
- S
  [NPU] Support npu kernel for pad3d op (#34815) · ef517a56
  由 Sing_chan 提交于 8月 20, 2021
```
* [NPU] Support npu kernel for pad3d op

* fix for comment of zhouwei25

* fix some bugs according to qili93's comments

* add support and test for paddings in input

* delete VLOG used for debug
```
  ef517a56
- Z
  [NPU] Support npu op depthwise_conv2d (#34853) · 4c115a82
  由 zhaoyingli 提交于 8月 20, 2021
```
* add depthwise_conv2d npu

* add some tests

* Delete test_unique_op_npu.py

* delete trans input
```
  4c115a82
- Z
  [NPU] Support npu op where and where grad (#34587) · d082955e
  由 zhaoyingli 提交于 8月 20, 2021
```
* [NPU] Support npu op where and where grad

* fix use const_cast

* delete a test
```
  d082955e
- J
  add (N,C,*) input support for GroupNorm (#34773) · 46371515
  由 JYChen 提交于 8月 20, 2021
```
* add (N,C,*) input support for GroupNorm

* --amend
```
  46371515
19 8月, 2021 3 次提交

[NPU] Support npu kernel for sin op (#34844) · 4641e8fc

由 JingZhuangzhuang 提交于 8月 19, 2021

* add npu sin op

* [NPU] Support npu kernel for sin op

* modify support npu kernel for sin op

* modify support npu kernel for sin op

* modify nou sin op

* modify npu sin op

* add sin op npu

4641e8fc

Y
Add dimension check for inverse to avoid dividing by 0 error when input's... · a2e08657
由 Yiqun Liu 提交于 8月 19, 2021
```
Add dimension check for inverse to avoid dividing by 0 error when input's shape is [0, 0, 0]. (#34996)
```
a2e08657
C
fix batch_norm and instance norm when input is [] (#34107) · ca7f5208
由 ceci3 提交于 8月 19, 2021
```
* fix batch_norm and instance norm when input is []
```
ca7f5208

18 8月, 2021 8 次提交
- L
  [NPU]add rmsprop op (#34864) · 9cbba97b
  由 lzzyzlbb 提交于 8月 18, 2021
```
* [npu]add rmsprop op
```
  9cbba97b
- X
  Add NPU kernel for norm Op: float16 and float32 (#34609) · 755c8a19
  由 xiongkun 提交于 8月 18, 2021
```
* Add NPU kernel for norm Op: float16 and float32

* fix code for code review

* fix for code review

* add type for paddle_throw

* remove unnecessary head file.\nAdd more testcase

* remove a broadcast
```
  755c8a19
- fix pad outliers err (#34979) · 248e27b7
  由 littletomatodonkey 提交于 8月 18, 2021
```
* fix pad outliers err

* fix pad api input type and doc

* fix example of pad

* add unittest for pad3d

* fix unittest

* fix error format

* fix pad doc
```
  248e27b7
- J
  [NPU] Add square grad (#34889) · 1b71a718
  由 Jackwaterveg 提交于 8月 18, 2021
```
* test=develop

* test=develop
```
  1b71a718
- J
  [NPU] Add leaky Relu (#34894) · 40f62737
  由 Jackwaterveg 提交于 8月 18, 2021
```
* test=develop

* test=develop
```
  40f62737
- W
  
  add the safe check for the some ops (#34978) · 12bf046b
  由 wawltor 提交于 8月 18, 2021
  
  12bf046b
- L
  [NPU] add retry on HcclGetRootInfo to fix "bind fail" (#34977) · 52a7b0c4
  由 Leo Chen 提交于 8月 18, 2021
```
* add retry for HcclGetRootInfo

* refine code

* reduce retry interval
```
  52a7b0c4
- G
  support class center sample of PartialFC (#34106) · 100db44f
  由 Guoxia Wang 提交于 8月 18, 2021
```
* support class center sample of PartialFC
```
  100db44f
17 8月, 2021 7 次提交

R

[NPU]Adamw skip update for npu (#34897) · b4474fb4
由 Roc 提交于 8月 17, 2021

b4474fb4
A

[NPU] add where_index op and tests (#34951) · 1ef21855
由 Aganlengzi 提交于 8月 17, 2021

1ef21855

Copy boost optional to Paddle (#34780) · 9be41447

由 chentianyu03 提交于 8月 17, 2021

* copy boost optional.hpp to paddle

* copy boost optional.hpp to paddle

* move directions

* del fluid/utils

* modify .hpp to .h

* move directions

* modify to paddle::optional

* add modification description

* format code stype for the files in paddle/utils

* format code stype

9be41447

[oneDNN ] disabling more ops caching (#34830) · f1c1d9e0

由 Jacek Czaja 提交于 8月 17, 2021

* - disabled caching of layer norm

- fix in compilation

- compilation fix

- transpose caching disabled

- compilation fix

- more compilation fixes

- sum caching disabled

- compilation fix

* - LRN with disabled cache

* lint fixes

f1c1d9e0

S
[bug fix] fix unfold negative_size_param (#34943) · 8ef1bf87
由 shangliang Xu 提交于 8月 17, 2021
```
* [bug fix] fix unfold negative_size_param
```
8ef1bf87

Align CTC grad scale same with ESPNet (#34729) · 10f9644c

由 Hui Zhang 提交于 8月 16, 2021

* dygraph support more ctc grad scale

* scale for 1.x

* fix unitest

* fix unitest

* format code

* fix unittest

* fix log info

* unittest cov

* fix format;notest,test=cpu,coverage

* skip ctc_loss egs;test=cpu

* warpctc grad cov;test=coverage

* add dygraph test;test=coverage

* format;test=cpu,coverage

* format;test=cpu

* add api compat;test=cpu

* add cpu test

* rename

* rename

* fix

* fix test

* format

* eigen cpu

* eigen gpu grad pass

* cuda gpu pass

* format

* fix ci

10f9644c

Add some passes which can be applied to Program (#34730) · 8046e33d

由 Zeng Jinle 提交于 8月 17, 2021

* add inplace passes and tests

* update

* fix use_cuda undefined
fix compile error of op compat

* add more ut

* fix CPU CI error

* check adam unique

* fix mac/windows ci, improve coverage

* fix ci error

* follow weihang's comment

* fix BlockDesc::MoveFrom

* follow qiuliang's comment

* update

* follow huihuang's comments

8046e33d

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功