提交 · c1113400284d09d64debb70fbd13a80a41ef47cb · PaddlePaddle / Paddle

29 12月, 2021 1 次提交

[cherry-pick] support data_format='NHWC' for prelu channel mode (#38495) · c1113400

由 Guoxia Wang 提交于 12月 29, 2021

* support data_format='NHWC' for prelu channel mode (#37019)

* support data_format='NHWC' for prelu channel mode

* fix prelu weight shape for NHWC of static mode (#38310)

c1113400

29 11月, 2021 1 次提交

Fix dropout static when axis != None (#37223) (#37589) · 3a0c550f

由 smallv0221 提交于 11月 29, 2021

* fix dropout static when axis != None

* update dropout test

* add dropout test

* fix test

* Update test_dropout_op.py

* Update test_dropout_op.py

* fix testcase

* fix testcase

* Update test_dropout_op.py

* fix testcase

* fix testcase

* optimize perf

* add new test

* fix testcase

3a0c550f

23 11月, 2021 1 次提交
- Z
  
  elu support alpha < 0 (#37316) (#37437) · 436808c6
  由 zhupengyang 提交于 11月 23, 2021
  
  436808c6
19 11月, 2021 1 次提交

[cherry-pick]Add sparse attention doc warning (#37189) · 5fd8312d

由 Liu-xiandong 提交于 11月 19, 2021

* fix cusparse compile bug in CUDA11.2, test=develop

* modify sparse_attention docs, test=document_fix (#36554)

* modify sparse_attention docs, test=develop

* add warning

* add warning ,test=document_fix

5fd8312d

16 11月, 2021 1 次提交

[cherry-pick-2.2.1]fix fused_transformer_encoder_layer bug (#37229) · 36dd295e

由 zhangkaihuo 提交于 11月 16, 2021

修复了fused_transformer_encoder_layer fine-tune过程发现的一些问题：

    fused_attention_op添加attn_mask=None的支持：PR
    pre_layer_norm处理问题：PR
    参数处理，计算错误的问题：PR
    add_bias计算错误问题：PR
    添加pure fp16的支持：PR

36dd295e

28 10月, 2021 1 次提交

[Cherry-pick] Enable CTC grad compute on GPU (#36780) · 8ede9e6f

由 Hui Zhang 提交于 10月 28, 2021

* Revert "Align CTC grad scale same with ESPNet (#34729)"

This reverts commit 10f9644c.

* ctc grad compute on gpu

8ede9e6f

26 10月, 2021 4 次提交

[cherry pick] add op: fused_feedforward(backward) (#36730) · 76c1bae1

由 zhangkaihuo 提交于 10月 26, 2021

* add op: fused_feedforward(backward) (#35611)

这个PR是fused_feedforward反向的代码

相关kernel实现：fused_dropout_act_bias, fused_residual_dropout_bias, fused_layernorm_residual_dropout_bias

fused_feedforward是一个融合算子，该算子对transformer模型的feed forward层的算子进行融合和封装，使得前端只呈现一个接口，通过融合减少部分访存和kernel launch的时间，以此提升性能。

* Move fused_attention and fused_feedforward functional api path to incubate (#36704)

将 #35905 和 #35843 PR中新增的的python api接口移到incubate目录下。

76c1bae1

Z
[cherry-pick]add op: fused_feedforward(forward) (#36729) · 77034fc3
由 zhangkaihuo 提交于 10月 26, 2021
```
This is a fusion operator to compute feed forward layer in transformer model architecture.
```
77034fc3
H

cherry pick CrossEntropy's bug fix (#36647) · 32fe5a49
由 HydrogenSulfate 提交于 10月 26, 2021

32fe5a49

[cherry-pick-2.2] Fused attention op forward (#35905) (#36708) · d2be870a

由 Li Min 提交于 10月 26, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

d2be870a

25 10月, 2021 1 次提交

Add nn.functional.sparse_attention and some test cases, test=develop (#35757) (#36551) · c57d1e91

由 Liu-xiandong 提交于 10月 25, 2021

Add paddle.nn.functional.sparse_attention API

本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676

此外，对于封装的python 接口，增加了相应的单测。

c57d1e91

30 9月, 2021 1 次提交
- G
  
  support fp16 (#35888) (#36191) · 87cc8d48
  由 Guoxia Wang 提交于 9月 30, 2021
  
  87cc8d48
26 9月, 2021 1 次提交
- fix pad tuple (#36043) · 2e473f23
  由 littletomatodonkey 提交于 9月 26, 2021
```
* fix pad tuple

* fix format
```
  2e473f23
17 9月, 2021 1 次提交
- X
  fix unpool doc, test=document_fix (#35806) · 652e655f
  由 xiaoting 提交于 9月 17, 2021
```
* fix unpool doc, test=document_fix

* fix typo for python example, test=document_fix
```
  652e655f
15 9月, 2021 3 次提交

Q
[NPU] fix depthwise_conv2d_grad, test=develop (#35626) · d3e06a51
由 Qi Li 提交于 9月 15, 2021
```
* [NPU] fix depthwise_conv2d_grad, test=develop

* remove debug files, test=develop
```
d3e06a51

Add New OP: gumbel_softmax (#35506) · 18eda6c3

由 YuanRisheng 提交于 9月 15, 2021

* Add New Op: gumbel_softmax

* Add New Op: gumbel_softmax

* Add New Op: gumbel_softmax (amend)

* add __main__ function in unit test

* fix bugs when test in windows ci

* update en docs

* delete reletive error in unit test

* delete relative error in unit test

* set hard=True in unit test

18eda6c3

G

fix dim check of class center sample (#35733) · a9577347
由 Guoxia Wang 提交于 9月 15, 2021

a9577347

14 9月, 2021 2 次提交
- Update loss.py · ed728506
  由 XYZ_916 提交于 8月 31, 2021
```
delete main function
```
  ed728506
- 1. optimize the error message of softmax_with_cross_entropy_op;2. add input... · fbf784dd
  由 XYZ_916 提交于 8月 31, 2021
```
1. optimize the error message of softmax_with_cross_entropy_op;2. add input value check for cross_entropy, if the dimention of input is zero, raise error. test = develop
```
  fbf784dd
13 9月, 2021 2 次提交

X
fix interpolate launch error (#35577) · 5f31737b
由 xiaoting 提交于 9月 13, 2021
```
* fix interpolate launch error, test=develop

* fix area mode for interp, test=develop
```
5f31737b

[RC22] Fix linear with matmul_op replace (#35445) · 53e294ca

由 zhulei 提交于 9月 13, 2021

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

53e294ca

08 9月, 2021 1 次提交
- G
  
  fix bug (#35482) · e133d8ef
  由 Guoxia Wang 提交于 9月 08, 2021
  
  e133d8ef
07 9月, 2021 1 次提交
- W
  add conv op check for illegal input or attributes (#35337) · 8307b0cb
  由 wangxinxin08 提交于 9月 07, 2021
```
* add conv op check for illegal input or attributes
```
  8307b0cb
06 9月, 2021 1 次提交

add kernel, stride check (#35106) · 13bbb6b6

由 Double_V 提交于 9月 06, 2021

* add kernel, stride check

* add unitest for param out of range

* delete max limit check

13bbb6b6

29 8月, 2021 1 次提交
- G
  
  test=document_fix (#35221) · 31cd1065
  由 Guoxia Wang 提交于 8月 29, 2021
  
  31cd1065
27 8月, 2021 15 次提交
- G
  
  test=document_fix (#35222) · 5dcff7c8
  由 Guoxia Wang 提交于 8月 27, 2021
  
  5dcff7c8
- X
  Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0
  由 xiaoting 提交于 8月 27, 2021
```
* add maxunppol2d op, test=develop

* fix typo, test=develop

* fix unpool unitest, test=develop

* fix unpool code-example, test=develop

* fix for unpool_op_unittest,test=develop

* fix example code, test=develop

* add noqa:F401, test=develop

* fix converage, test=develop

* fix unitest for unpool, test=develop

* rename unpool2d to unpool, test=develop

* rename unpool2d to unpool, test=develop
```
  ceee71a0
- H
  
  Update loss.py · cf6e543b
  由 HydrogenSulfate 提交于 8月 18, 2021
  
  cf6e543b
- H
  
  Update loss.py · 11e9d4e3
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  11e9d4e3
- H
  
  Update loss.py · 0c2d6bcb
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  0c2d6bcb
- H
  
  Update loss.py · 52804cd8
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  52804cd8
- H
  
  Update loss.py · 00467688
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  00467688
- H
  
  Update loss.py · f2df33e3
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  f2df33e3
- H
  
  Update loss.py · dd0140bd
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  dd0140bd
- H
  
  Update loss.py · 3ca813e6
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  3ca813e6
- H
  
  Update loss.py · b9f665d8
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  b9f665d8
- H
  
  Update loss.py · de972c50
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  de972c50
- H
  
  Update loss.py · b4a3f21c
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  b4a3f21c
- H
  
  Update loss.py · fa4805b4
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  fa4805b4
- H
  
  Update loss.py · 39e81532
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  39e81532

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功