提交 · e788c7b5939ed822437fd0429218604ca1a60bf4 · BaiXuePrincess / Paddle

19 11月, 2021 1 次提交
- add new API paddle.nn.initializer.Orthogonal and calculate_gain (#37163) · 62ad3594
  由 zhouweiwei2014 提交于 11月 19, 2021
```
* add new API paddle.nn.initializer.Orthogonal and calculate_gain

* fix comment

* fix comment
```
  62ad3594
18 11月, 2021 1 次提交
- L
  Fix the slow running speed of kl_div when option 'reduction' is set (#37283) · a6e9ff85
  由 LielinJiang 提交于 11月 18, 2021
```
* Fix the slow running speed of kl_div when option reduction is set

* fix unittest coverage
```
  a6e9ff85
15 11月, 2021 1 次提交
- L
  modify sparse_attention docs, test=document_fix (#36554) · 6b0cc2b1
  由 Liu-xiandong 提交于 11月 15, 2021
```
* modify sparse_attention docs, test=develop

* add warning

* add warning ,test=document_fix
```
  6b0cc2b1
12 11月, 2021 1 次提交
- Z
  [fix]fix the bug of fused_attention and fused_feedforward (#36972) · 6486e242
  由 zhangkaihuo 提交于 11月 12, 2021
```
* fix bug:
1. atten: set the default value of attn_dropout_rate to None
2. ffn: add activation parameter
```
  6486e242
28 10月, 2021 1 次提交

ctc grad compute on gpu (#36756) · 54ef9d06

由 Hui Zhang 提交于 10月 28, 2021

* Revert "Align CTC grad scale same with ESPNet (#34729)"

This reverts commit 10f9644c.

* ctc grad compute on gpu

54ef9d06

26 10月, 2021 2 次提交

Add fused attention op backward and python layer. (#36498) · 5119428e

由 Li Min 提交于 10月 26, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

5119428e

L
Move fused_attention and fused_feedforward functional api path to incubate (#36704) · 9aeca2f1
由 Li Min 提交于 10月 26, 2021
```
将 #35905 和 #35843 PR中新增的的python api接口移到incubate目录下。
```
9aeca2f1

25 10月, 2021 1 次提交

add op: fused_feedforward(forward) (#35843) · b18cbfb2

由 zhangkaihuo 提交于 10月 25, 2021

这个PR只包含fused_feedforward前向的代码。

相关kernel实现：fused_dropout_act_bias, fused_residual_dropout_bias, fused_layernorm_residual_dropout_bias

fused_feedforward是一个融合算子，该算子对transformer模型的feed forward层的算子进行融合和封装，使得前端只呈现一个接口，通过融合减少部分访存和kernel launch的时间，以此提升性能。

b18cbfb2

22 10月, 2021 1 次提交

Fused attention op forward (#35905) · d4906214

由 Li Min 提交于 10月 22, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

d4906214

19 10月, 2021 1 次提交
- X
  
  fix out of range for area interp, test=develop (#36466) · 77f4597f
  由 xiaoting 提交于 10月 19, 2021
  
  77f4597f
18 10月, 2021 1 次提交
- Q
  
  [NPU] fix dtype for arg_max, test=develop (#36457) · 8757fc5b
  由 Qi Li 提交于 10月 18, 2021
  
  8757fc5b
13 10月, 2021 2 次提交
- G
  fix BatchNorm for fp16 (#36376) · 8fd1b6ad
  由 Guoxia Wang 提交于 10月 13, 2021
```
* fix BatchNorm for fp16
```
  8fd1b6ad
- Y
  [PaddlePaddle hackathon] + ADD CELU (#36088) · d7064f04
  由 yujun 提交于 10月 13, 2021
```
* update

* update

* update

* try make CI pass

* doc typo

* update doc string
```
  d7064f04
12 10月, 2021 6 次提交
- H
  
  Update loss.py · f77083bb
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  f77083bb
- H
  
  Update loss.py · 6cd41cec
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  6cd41cec
- H
  
  Update loss.py · 3675f25d
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  3675f25d
- H
  
  Update loss.py · 53dc0143
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  53dc0143
- H
  
  Update loss.py · 8c2fbc31
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  8c2fbc31
- H
  
  Fix the bug when axis is specified and weight is provided · 1d660eb6
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  1d660eb6
11 10月, 2021 1 次提交

Add nn.functional.sparse_attention and some test cases, test=develop (#35757) · 85b77232

由 Liu-xiandong 提交于 10月 11, 2021

Add paddle.nn.functional.sparse_attention API

本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676

此外，对于封装的python 接口，增加了相应的单测。

85b77232

24 9月, 2021 1 次提交
- fix pad tuple (#35985) · 0c0817cf
  由 littletomatodonkey 提交于 9月 24, 2021
```
* fix pad tuple

* fix format
```
  0c0817cf
21 9月, 2021 1 次提交
- G
  
  support fp16 (#35888) · 087c23a9
  由 Guoxia Wang 提交于 9月 21, 2021
  
  087c23a9
17 9月, 2021 1 次提交
- X
  fix unpool doc, test=document_fix (#35806) · 652e655f
  由 xiaoting 提交于 9月 17, 2021
```
* fix unpool doc, test=document_fix

* fix typo for python example, test=document_fix
```
  652e655f
15 9月, 2021 4 次提交

Change the invoking method of settiem from numpy to set_value op when value isn't tensor (#35701) · 86d4af39

由 zyfncg 提交于 9月 15, 2021

* Change the invoking method of settiem from numpy to set_value op when value is not tensor

* fix the check logic for inplace in setitem

* fix the unittest problem caused by setitem doesn't support fp16

* modify some code format in setitem

86d4af39

Q
[NPU] fix depthwise_conv2d_grad, test=develop (#35626) · d3e06a51
由 Qi Li 提交于 9月 15, 2021
```
* [NPU] fix depthwise_conv2d_grad, test=develop

* remove debug files, test=develop
```
d3e06a51

Add New OP: gumbel_softmax (#35506) · 18eda6c3

由 YuanRisheng 提交于 9月 15, 2021

* Add New Op: gumbel_softmax

* Add New Op: gumbel_softmax

* Add New Op: gumbel_softmax (amend)

* add __main__ function in unit test

* fix bugs when test in windows ci

* update en docs

* delete reletive error in unit test

* delete relative error in unit test

* set hard=True in unit test

18eda6c3

G

fix dim check of class center sample (#35733) · a9577347
由 Guoxia Wang 提交于 9月 15, 2021

a9577347

14 9月, 2021 2 次提交
- Update loss.py · ed728506
  由 XYZ_916 提交于 8月 31, 2021
```
delete main function
```
  ed728506
- 1. optimize the error message of softmax_with_cross_entropy_op;2. add input... · fbf784dd
  由 XYZ_916 提交于 8月 31, 2021
```
1. optimize the error message of softmax_with_cross_entropy_op;2. add input value check for cross_entropy, if the dimention of input is zero, raise error. test = develop
```
  fbf784dd
13 9月, 2021 2 次提交

X
fix interpolate launch error (#35577) · 5f31737b
由 xiaoting 提交于 9月 13, 2021
```
* fix interpolate launch error, test=develop

* fix area mode for interp, test=develop
```
5f31737b

[RC22] Fix linear with matmul_op replace (#35445) · 53e294ca

由 zhulei 提交于 9月 13, 2021

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

53e294ca

09 9月, 2021 1 次提交
- X
  
  Update quant_layers.py (#35392) · 2d6871d3
  由 XGZhang 提交于 9月 09, 2021
  
  2d6871d3
08 9月, 2021 1 次提交
- G
  
  fix bug (#35482) · e133d8ef
  由 Guoxia Wang 提交于 9月 08, 2021
  
  e133d8ef
07 9月, 2021 1 次提交
- W
  add conv op check for illegal input or attributes (#35337) · 8307b0cb
  由 wangxinxin08 提交于 9月 07, 2021
```
* add conv op check for illegal input or attributes
```
  8307b0cb
06 9月, 2021 2 次提交

add kernel, stride check (#35106) · 13bbb6b6

由 Double_V 提交于 9月 06, 2021

* add kernel, stride check

* add unitest for param out of range

* delete max limit check

13bbb6b6

replase pass with error exception (#35367) · 5675042d

由 Feng Xing 提交于 9月 06, 2021

This PR adds error exception in fused transformer python interface.
The function body are not implemented (will be implemented later).
Following zhiqiu's comment in previous PR-35206 (merged already), it is better to raise an exception instead of using "pass".

5675042d

31 8月, 2021 1 次提交

transformer opt python files (#35206) · e2991555

由 Feng Xing 提交于 8月 31, 2021

This PR adds fused transformer python related files. It defines interface of fused transformer.

Fused transformer implements an optimized version of transformer layer (in python/paddle/nn/layer/transformer.py). In this PR, four layers (functions) are defined:
(1) FusedMultiHeadAttention: multi-head attention layer
(2) FusedFeedForward: feed forward layer
(3) FusedTransformerEncoderLayer: transformer encoder layer
(4) FusedTransformer: transformer layer

e2991555

29 8月, 2021 1 次提交
- G
  
  test=document_fix (#35221) · 31cd1065
  由 Guoxia Wang 提交于 8月 29, 2021
  
  31cd1065
27 8月, 2021 2 次提交

G

test=document_fix (#35222) · 5dcff7c8
由 Guoxia Wang 提交于 8月 27, 2021

5dcff7c8

Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0

由 xiaoting 提交于 8月 27, 2021

* add maxunppol2d op, test=develop

* fix typo, test=develop

* fix unpool unitest, test=develop

* fix unpool code-example, test=develop

* fix for unpool_op_unittest,test=develop

* fix example code, test=develop

* add noqa:F401, test=develop

* fix converage, test=develop

* fix unitest for unpool, test=develop

* rename unpool2d to unpool, test=develop

* rename unpool2d to unpool, test=develop

ceee71a0

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致