提交 · 6ff179abc4a37840071ef1fc86d37287b31c7511 · BaiXuePrincess / Paddle

30 8月, 2021 2 次提交

[Op Def] Add extra def of linear_interp & linear_interp_v2 & addmm (#35233) · 6ff179ab

由 zhulei 提交于 8月 30, 2021

* [Op Def] Add extra def of linear_interp & linear_interp_v2

* [Op Def] Add extra def of linear_interp & linear_interp_v2 & addmm

6ff179ab

Set value (#34886) · 37d281c9

由 xiongkun 提交于 8月 30, 2021

* tmp

* Tile - Assign - Crop

* Finish the set value npu kernel and test case in npu

* improve the error message

* Modify according to zhangliujie

* code review

37d281c9

27 8月, 2021 6 次提交

J

add uniform_ op and UT (#33934) · be29b8ee
由 JYChen 提交于 8月 27, 2021

be29b8ee

Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0

由 xiaoting 提交于 8月 27, 2021

* add maxunppol2d op, test=develop

* fix typo, test=develop

* fix unpool unitest, test=develop

* fix unpool code-example, test=develop

* fix for unpool_op_unittest,test=develop

* fix example code, test=develop

* add noqa:F401, test=develop

* fix converage, test=develop

* fix unitest for unpool, test=develop

* rename unpool2d to unpool, test=develop

* rename unpool2d to unpool, test=develop

ceee71a0

G
sparse_momentum_op is used to save w@GRAD memory for gather_op (#34942) · 234ce932
由 Guoxia Wang 提交于 8月 27, 2021
```
* sparse_momentum_op is used to save w@GRAD memory for gather_op when gather from a large parameter
```
234ce932
Z

gelu/logsigmoid add AsExtra (#35198) · 2006fbc4
由 zhupengyang 提交于 8月 27, 2021

2006fbc4

add elementwise max grad op for npu (#34862) · 5310ceab

由 baoachun 提交于 8月 27, 2021

* add elementwise max grad op for npu

* add elementwise max grad op for npu

* add elementwise max grad op for npu

* add elementwise max grad op for npu

* add elementwise max grad op for npu

5310ceab

W
Polish the error message of paddle.slice. (#35179) · 669853f5
由 WeiXin 提交于 8月 27, 2021
```
* polish the error message of paddle.slice.

* polish code.
```
669853f5

26 8月, 2021 9 次提交

[oneDNN] disable caching oneDNN primitives in matmul v2, Reduce grad and... · 31f0221f

由 Jacek Czaja 提交于 8月 26, 2021

[oneDNN] disable caching oneDNN primitives in  matmul v2, Reduce grad and elementwise_add grad, expand_v2 (#35132)

* - grad caching disabled of matmul_v1

- compilation fix

- compilation fix

* - reduction removed

* - Matmul v2 disabled caching

* Draft of further changes

* - workaround for reducegrad

* - fixes to UT

* - fix to compilation

* - another fix

* - fix

31f0221f

fix assign bug support fp16 uint8 (#35153) · 270efb96

由 duanboqiang 提交于 8月 26, 2021

* fix assign bug support fp16 uint8

* fix dygragh assign bool bug

* modify code style

* revoke bool modification

270efb96

S
Support dropout backward in eval mode (#35122) · f1275fb6
由 smallv0221 提交于 8月 26, 2021
```
* Support dropout backward in eval mode

* add downscale case

* minor fix

* minor fix
```
f1275fb6

Support Multi-Stream, Single-Thread in New Executor (#35024) · 678a259a

由 Aurelius84 提交于 8月 26, 2021

* Modify into QueueSync QueueAsync

* fix complie on MacOS

* fix pointer

* fix conflict

* polish unittest

* fix windows fetch error

* polish code according reviewer

* fix device_guard on CPU place

678a259a

Add feed_forward for fused attention op. (#34945) · d1a33bc7

由 Li Min 提交于 8月 26, 2021

Describe

Add feed_forward for fused attention op.
(1) Encapsulate matmul impl (forward and backward) used in attention op.
(2) Implement bias_add (forward and backward) used in attention op.

d1a33bc7

B

[NPU] Support npu kernel for StridedSlice op without grad (#34601) · fa6c59a4
由 Bo Liu 提交于 8月 26, 2021

fa6c59a4
S
Add roi align op npu (#34973) · 289e1818
由 shiyutang 提交于 8月 26, 2021
```
* add_roi_align_npu

* update

* update

* update
```
289e1818
D

fix cast op (#35156) · 412877e6
由 duanboqiang 提交于 8月 26, 2021

412877e6
X

fix the bug of channel-wise quantization for ernie (#34948) · c71025eb
由 XGZhang 提交于 8月 26, 2021

c71025eb

25 8月, 2021 6 次提交
- J
  Fix for expand_v2 op (#35101) · 1f34f7ec
  由 jakpiase 提交于 8月 25, 2021
```
* temporary change

* fix for expand_v2

* changes after review, activated ppyolov inference test
```
  1f34f7ec
- Z
  
  fix cpu adamw problem for np.float64 (#35124) · 700205e8
  由 zhaoyingli 提交于 8月 25, 2021
  
  700205e8
- R
  
  [NPU] Fix the performance problem when 'axis' is not specified (#35116) · 91ba86b1
  由 ronnywang 提交于 8月 25, 2021
  
  91ba86b1
- Y
  
  [hybrid performance] optim npu coalesce set constant (#35105) · 4bfd0445
  由 Yuang Liu 提交于 8月 25, 2021
  
  4bfd0445
- R
  
  [NPU] add npu_one_hot_v2 (#34937) · d710c3a0
  由 ronnywang 提交于 8月 25, 2021
  
  d710c3a0
- T
  
  update elementwise api in kunlun (#35021) · ff96a7d5
  由 taixiurong 提交于 8月 25, 2021
  
  ff96a7d5
24 8月, 2021 8 次提交
- G
  
  Add flags to control whether to check Nan value of hccl_allreduce_sum. (#35093) · 5b737834
  由 gongweibao 提交于 8月 24, 2021
  
  5b737834
- W
  add fetch, test=develop (#35019) · a5060b55
  由 wanghuancoder 提交于 8月 24, 2021
```
* add fetch, test=develop

* fix fetch2op, test=develop

* fix fetch2op, test=develop

* refine, test=develop

* fix fetch ctx, test=develop

* add wait, test=develop

* rename fetch2 to fetch_v2, test=develop

* merge, test=develop
```
  a5060b55
- D
  fix bmm bug (#35098) · de645153
  由 duanboqiang 提交于 8月 24, 2021
```
* fix bmm bug

* bmm style

* fix bmm
```
  de645153
- J
  [oneDNN] Concat refactoring and disabling caching (#35002) · d9c0f09b
  由 Jacek Czaja 提交于 8月 24, 2021
```
* - concat refactoring draft

* - cmpilation fixes

* - yet another compilation fix

* - fix

* - compilation fix

* - fixes to compilation

* - another compilation fix

* - fix

* - Added overloaded AcquirePrimitiveDesc for concat

* - fix

* - reserve introduced

* - UT fixes

* - test concat int8 improved

* - fixes

* - fix to crash

* - lint fixes

* - fixes after review

* - some other fixes from review
```
  d9c0f09b
- 王
  
  add the extra and quantization for op def, test=develop (#35076) · cb28753c
  由王明冬提交于 8月 24, 2021
  
  cb28753c
- R
  [NPU] add conv_op_npu and test (#34055) · 00a269de
  由 ronnywang 提交于 8月 24, 2021
```
* add conv_op_npu and test

* add more tests

* clean headers & support fp16

* update
```
  00a269de
- R
  [NPU] add pool2 op and tests (#34770) · da261732
  由 ronnywang 提交于 8月 24, 2021
```
* add pool2d_op_npu and test

* update

* update pool2d_backward_navie

* clean headers
```
  da261732
- T
  
  Fix a bug of transpose op, about accessing memory out of bounds of the perm param (#35079) · 10563791
  由 TeslaZhao 提交于 8月 24, 2021
  
  10563791
23 8月, 2021 7 次提交
- J
  [oneDNN] disable caching for interpolate and batch Norm (#35030) · 673bf719
  由 Jacek Czaja 提交于 8月 23, 2021
```
* - disabled interpolate onednn

* - compilation fix

* - draft of batch_norm cache disabling

* - fixes to UT
```
  673bf719
- L
  Refactor the organization of layer_norm cuda impl. (#34883) · 7f5eb533
  由 Li Min 提交于 8月 23, 2021
```
Refactor the organization of layer_norm cuda impl so that it can be reused in fused attention op.

    Extract the layer_norm cuda impl form layer_norm_op.cu to layer_norm_kernel.cu.h.
    Define fused/attention_layer_norm.h, which can be used in fused attention op in next PR.
```
  7f5eb533
- Z
  Support gettiem by Bool index (#35026) · b6dc16cb
  由 zyfncg 提交于 8月 23, 2021
```
* Support getitem by Bool index

* delete some debug info of bool index

* support the case that the shape of bool index is different from indexed tensor
```
  b6dc16cb
- P
  
  add beam_search_decode npu op (#34967) · 4ce272ed
  由 pangyoki 提交于 8月 23, 2021
  
  4ce272ed
- P
  
  add fill_constant_batch_size_like npu op (#34563) · 7d86737c
  由 pangyoki 提交于 8月 23, 2021
  
  7d86737c
- T
  
  Fix a bug of strided_slice op, about the axes parameter access memory out of bounds (#35062) · aefec228
  由 TeslaZhao 提交于 8月 23, 2021
  
  aefec228
- Z
  add adamw cuda kernel (#35020) · 77a8a394
  由 zhaoyingli 提交于 8月 23, 2021
```
* adamw support cuda

* adamw support cuda
```
  77a8a394
22 8月, 2021 1 次提交
- Z
  
  implementation of broadcast add backward by reduce (#34143) · 56c5e210
  由 Zhang Zheng 提交于 8月 22, 2021
  
  56c5e210
20 8月, 2021 1 次提交
- H
  
  Add paddle.linalg.matrix_power OP (#34667) · e2241a43
  由 Hao Lin 提交于 8月 20, 2021
  
  e2241a43

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致