提交 · bb51d6dc779f9d42b66adb9f150a8be138d4df2f · Crayon鑫 / Paddle

10 12月, 2021 1 次提交
- L
  Transfer MultiHeadAttention's matmul to v2 op (#36222) · 65494051
  由 liu zhengxi 提交于 12月 10, 2021
```
* promote to v2

* alter
```
  65494051
07 12月, 2021 1 次提交
- X
  add maxunpool2d in __all__ (#37698) · 890bd626
  由 xiaoting 提交于 12月 07, 2021
```
* add maxunpool2d in __all__

* fix MaxUnPool2D example
```
  890bd626
30 11月, 2021 1 次提交
- G
  support data_format='NHWC' for prelu channel mode (#37019) · 3f2a665a
  由 Guoxia Wang 提交于 11月 30, 2021
```
* support data_format='NHWC' for prelu channel mode
```
  3f2a665a
26 11月, 2021 1 次提交

Fix dropout static when axis != None (#37223) · f25fda37

由 smallv0221 提交于 11月 26, 2021

* fix dropout static when axis != None

* update dropout test

* add dropout test

* fix test

* Update test_dropout_op.py

* Update test_dropout_op.py

* fix testcase

* fix testcase

* Update test_dropout_op.py

* fix testcase

* fix testcase

* optimize perf

* add new test

* fix testcase

f25fda37

25 11月, 2021 2 次提交

add new API paddle.nn.initializer.Dirac (#37389) · bbb9b28a
由 zhouweiwei2014 提交于 11月 25, 2021
```
* add new API paddle.nn.initializer.Dirac

* fix doc
```
bbb9b28a

【PaddlePaddle Hackathon】6、在 Paddle 中新增 ZeroPad2d (#37151) · 81861f69

由 Matsumoto GAO 提交于 11月 25, 2021

* add zeropad2d v0.1

* add zeropad2d v0.2

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.4

* add zeropad2d v0.5

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

81861f69

22 11月, 2021 1 次提交
- Z
  
  elu support alpha < 0 (#37316) · e3503de8
  由 zhupengyang 提交于 11月 22, 2021
  
  e3503de8
19 11月, 2021 1 次提交
- add new API paddle.nn.initializer.Orthogonal and calculate_gain (#37163) · 62ad3594
  由 zhouweiwei2014 提交于 11月 19, 2021
```
* add new API paddle.nn.initializer.Orthogonal and calculate_gain

* fix comment

* fix comment
```
  62ad3594
18 11月, 2021 1 次提交
- L
  Fix the slow running speed of kl_div when option 'reduction' is set (#37283) · a6e9ff85
  由 LielinJiang 提交于 11月 18, 2021
```
* Fix the slow running speed of kl_div when option reduction is set

* fix unittest coverage
```
  a6e9ff85
15 11月, 2021 1 次提交
- L
  modify sparse_attention docs, test=document_fix (#36554) · 6b0cc2b1
  由 Liu-xiandong 提交于 11月 15, 2021
```
* modify sparse_attention docs, test=develop

* add warning

* add warning ,test=document_fix
```
  6b0cc2b1
12 11月, 2021 1 次提交
- Z
  [fix]fix the bug of fused_attention and fused_feedforward (#36972) · 6486e242
  由 zhangkaihuo 提交于 11月 12, 2021
```
* fix bug:
1. atten: set the default value of attn_dropout_rate to None
2. ffn: add activation parameter
```
  6486e242
28 10月, 2021 1 次提交

ctc grad compute on gpu (#36756) · 54ef9d06

由 Hui Zhang 提交于 10月 28, 2021

* Revert "Align CTC grad scale same with ESPNet (#34729)"

This reverts commit 10f9644c.

* ctc grad compute on gpu

54ef9d06

26 10月, 2021 2 次提交

Add fused attention op backward and python layer. (#36498) · 5119428e

由 Li Min 提交于 10月 26, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

5119428e

L
Move fused_attention and fused_feedforward functional api path to incubate (#36704) · 9aeca2f1
由 Li Min 提交于 10月 26, 2021
```
将 #35905 和 #35843 PR中新增的的python api接口移到incubate目录下。
```
9aeca2f1

25 10月, 2021 1 次提交

add op: fused_feedforward(forward) (#35843) · b18cbfb2

由 zhangkaihuo 提交于 10月 25, 2021

这个PR只包含fused_feedforward前向的代码。

相关kernel实现：fused_dropout_act_bias, fused_residual_dropout_bias, fused_layernorm_residual_dropout_bias

fused_feedforward是一个融合算子，该算子对transformer模型的feed forward层的算子进行融合和封装，使得前端只呈现一个接口，通过融合减少部分访存和kernel launch的时间，以此提升性能。

b18cbfb2

22 10月, 2021 1 次提交

Fused attention op forward (#35905) · d4906214

由 Li Min 提交于 10月 22, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

d4906214

19 10月, 2021 1 次提交
- X
  
  fix out of range for area interp, test=develop (#36466) · 77f4597f
  由 xiaoting 提交于 10月 19, 2021
  
  77f4597f
18 10月, 2021 1 次提交
- Q
  
  [NPU] fix dtype for arg_max, test=develop (#36457) · 8757fc5b
  由 Qi Li 提交于 10月 18, 2021
  
  8757fc5b
13 10月, 2021 2 次提交
- G
  fix BatchNorm for fp16 (#36376) · 8fd1b6ad
  由 Guoxia Wang 提交于 10月 13, 2021
```
* fix BatchNorm for fp16
```
  8fd1b6ad
- Y
  [PaddlePaddle hackathon] + ADD CELU (#36088) · d7064f04
  由 yujun 提交于 10月 13, 2021
```
* update

* update

* update

* try make CI pass

* doc typo

* update doc string
```
  d7064f04
12 10月, 2021 6 次提交
- H
  
  Update loss.py · f77083bb
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  f77083bb
- H
  
  Update loss.py · 6cd41cec
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  6cd41cec
- H
  
  Update loss.py · 3675f25d
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  3675f25d
- H
  
  Update loss.py · 53dc0143
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  53dc0143
- H
  
  Update loss.py · 8c2fbc31
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  8c2fbc31
- H
  
  Fix the bug when axis is specified and weight is provided · 1d660eb6
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  1d660eb6
11 10月, 2021 1 次提交

Add nn.functional.sparse_attention and some test cases, test=develop (#35757) · 85b77232

由 Liu-xiandong 提交于 10月 11, 2021

Add paddle.nn.functional.sparse_attention API

本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676

此外，对于封装的python 接口，增加了相应的单测。

85b77232

24 9月, 2021 1 次提交
- fix pad tuple (#35985) · 0c0817cf
  由 littletomatodonkey 提交于 9月 24, 2021
```
* fix pad tuple

* fix format
```
  0c0817cf
21 9月, 2021 1 次提交
- G
  
  support fp16 (#35888) · 087c23a9
  由 Guoxia Wang 提交于 9月 21, 2021
  
  087c23a9
17 9月, 2021 1 次提交
- X
  fix unpool doc, test=document_fix (#35806) · 652e655f
  由 xiaoting 提交于 9月 17, 2021
```
* fix unpool doc, test=document_fix

* fix typo for python example, test=document_fix
```
  652e655f
15 9月, 2021 4 次提交

Change the invoking method of settiem from numpy to set_value op when value isn't tensor (#35701) · 86d4af39

由 zyfncg 提交于 9月 15, 2021

* Change the invoking method of settiem from numpy to set_value op when value is not tensor

* fix the check logic for inplace in setitem

* fix the unittest problem caused by setitem doesn't support fp16

* modify some code format in setitem

86d4af39

Q
[NPU] fix depthwise_conv2d_grad, test=develop (#35626) · d3e06a51
由 Qi Li 提交于 9月 15, 2021
```
* [NPU] fix depthwise_conv2d_grad, test=develop

* remove debug files, test=develop
```
d3e06a51

Add New OP: gumbel_softmax (#35506) · 18eda6c3

由 YuanRisheng 提交于 9月 15, 2021

* Add New Op: gumbel_softmax

* Add New Op: gumbel_softmax

* Add New Op: gumbel_softmax (amend)

* add __main__ function in unit test

* fix bugs when test in windows ci

* update en docs

* delete reletive error in unit test

* delete relative error in unit test

* set hard=True in unit test

18eda6c3

G

fix dim check of class center sample (#35733) · a9577347
由 Guoxia Wang 提交于 9月 15, 2021

a9577347

14 9月, 2021 2 次提交
- Update loss.py · ed728506
  由 XYZ_916 提交于 8月 31, 2021
```
delete main function
```
  ed728506
- 1. optimize the error message of softmax_with_cross_entropy_op;2. add input... · fbf784dd
  由 XYZ_916 提交于 8月 31, 2021
```
1. optimize the error message of softmax_with_cross_entropy_op;2. add input value check for cross_entropy, if the dimention of input is zero, raise error. test = develop
```
  fbf784dd
13 9月, 2021 2 次提交

X
fix interpolate launch error (#35577) · 5f31737b
由 xiaoting 提交于 9月 13, 2021
```
* fix interpolate launch error, test=develop

* fix area mode for interp, test=develop
```
5f31737b

[RC22] Fix linear with matmul_op replace (#35445) · 53e294ca

由 zhulei 提交于 9月 13, 2021

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

53e294ca

09 9月, 2021 1 次提交
- X
  
  Update quant_layers.py (#35392) · 2d6871d3
  由 XGZhang 提交于 9月 09, 2021
  
  2d6871d3
08 9月, 2021 1 次提交
- G
  
  fix bug (#35482) · e133d8ef
  由 Guoxia Wang 提交于 9月 08, 2021
  
  e133d8ef

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致