提交 · 92568edbf7a6023f897b8d7e5f9f1ea985f28fa2 · PaddlePaddle / Paddle

31 5月, 2022 2 次提交

C
[Eager] Polish append op using for model perf (#43102) · e9589e35
由 Chen Weihang 提交于 5月 31, 2022
```
* polish append op using

* fix var error

* fix group norm impl
```
e9589e35

【PaddlePaddle Hackathon 2】16 新增 API RRelu (#41823) · 21e1d10f

由 thunder95 提交于 5月 31, 2022

* rrelu逻辑部分

* unregistered op kernel (unresolved)

* commit before merge

* 丰富测试用例

* 修复rrelu-sig的bug

* 修复cpu环境测试

* 修改拼写错误

* 修改code format

* 尝试优化测试用例timeout的问题

* 优化测试用例

* 移除seed, 优化随机函数

* update en doc for rrelu

* fix rrelu en docs, test=document_fix

* add paper link for en docs, test=document_fix

* udpate en doc

* add r,test=document_fix

21e1d10f

19 5月, 2022 1 次提交
- Z
  Fix typos in the comment doc of SimpleRNN, LSTM, GRU: hidden_size -> input_size. (#42770) · 155fe05b
  由 Zhengyang Song 提交于 5月 19, 2022
```
test=document_fix
```
  155fe05b
13 5月, 2022 1 次提交
- L
  
  Update api docs (#42725) · cde2b24d
  由 Linjie Chen 提交于 5月 13, 2022
  
  cde2b24d
12 5月, 2022 1 次提交
- S
  
  Fix some typos in paddle/. (#42408) · 2012672c
  由 Shuangchi He 提交于 5月 12, 2022
  
  2012672c
28 4月, 2022 1 次提交
- P
  fix collections.Sequence in python3.10 (#42242) · edb61a52
  由 pangyoki 提交于 4月 28, 2022
```
* fix collections.Sequence in python3.10

* fix format
```
  edb61a52
26 4月, 2022 1 次提交

【PaddlePaddle Hackathon 2】29、为 Paddle 新增 PixelUnshuffle 组网 API (#40728) · 5be9b824

由 BrilliantYuKaimin 提交于 4月 26, 2022

* 增加PixelUnshuffle的形状推断

* 增加PixelUnshuffle的算子注册

* 增加PixelUnshuffle及其梯度的核函数

* 增加PixelUnshuffle算子的描述

* 增加PixelUnshuffle算子的签名

* 在Python层面增加PixelUnshuffle

* 增加PixelUnshuffle的单测

* Update test_pixel_unshuffle.py

* test=document_fix

* Update test_pixel_unshuffle.py

增加对extra_repr的测试

* 修正代码格式

* Update test_pixel_unshuffle.py

修正对extra_repr的测试

* 修改pixel_unshuffle核函数的实现位置

* 修正代码格式

* 完善对输入的检查

* Update test_pixel_unshuffle.py

* 完善pixel_unshuffle的输入检查

* Update pixel_unshuffle_op.cc

* Update unary.cc

* add pixel_unshuffle

* Update test_pixel_unshuffle.py

* Update vision.py

* 调整代码格式

* Update vision.py

* Delete extra spaces

* Update pixel_unshuffle_sig.cc

* Update vision.py

* Update vision.py

* add PixelUnshuffleGradInferMeta

* remove PixelUnshuffleOpArgumentMapping

* Update pixel_unshuffle_op.cc

* 调整pixel_unshuffle及其梯度的核函数的实现位置

* Update pixel_unshuffle_op.cc

5be9b824

25 4月, 2022 1 次提交

【PaddlePaddle Hackathon 2】24、为 Paddle 新增 nn.ChannelShuffle 组网 API (#40743) · bbaaf217

由 BrilliantYuKaimin 提交于 4月 25, 2022

* Add infermeta for ChannelShuffle

* Create channel_shuffle_grad_kernel.h

* Create channel_shuffle_kernel.h

* Create channel_shuffle_sig.cc

* Create channel_shuffle_op.cc

ChannelShuffle算子的描述

* Create channel_shuffle_kernel_impl.h

ChannelShuffle核函数的实现

* Create channel_shuffle_grad_kernel_impl.h

ChannelShuffle反向核函数的实现

* Add kernel register of channel shuffle and grad

注册ChannelShuffle及其反向的核函数

* add nn.functional.channel_shuffle

* add nn.ChannelShuffle

* Create test_channel_shuffle.py

* Update example of ChannelShuffle in vision.py

* Update test_channel_shuffle.py

* 修改channel_shuffle核函数的实现位置

* 修正代码格式

* 删除多余空格

* 完善channel_shuffle的错误检查

* Update unary.cc

* Update channel_shuffle_op.cc

* Update test_channel_shuffle.py

* Update unary.cc

* add channel_shuffle

* Update test_channel_shuffle.py

* Update vision.py

* 调整代码格式

* Update channel_shuffle_sig.cc

* 更新ChannelShuffle的文档

* 更新channel_shuffle的文档

* remove ChannelShuffleOpArgumentMapping

* add ChannelShuffleGradInferMeta

* Update channel_shuffle_op.cc

* 调整channel_shuffle及其梯度的核函数的位置

bbaaf217

21 4月, 2022 1 次提交
- A
  【PaddlePaddle Hackathon 2】23、为 Paddle 新增 Softmax2D 组网API (#40910) · 920d44df
  由 Asthestarsfalll 提交于 4月 21, 2022
```
* Hackathon 23

* fix bug

* fix pylint error

* try

* fix CI-Coverage

* update and add more unittest

* update
```
  920d44df
18 4月, 2022 1 次提交

[Eager] use final op in maskrcnn and hrnet (#41927) · aaabb796

由 wanghuancoder 提交于 4月 18, 2022

* update

* add conv yaml

* add backward

* remove useless code

* fix bug

* fix bug

* revert fluid dygraph conv2d

* remove useless infermeta function

* fix meta fn deluplicat error

* conv using custom impl

* remove amp include

* fix bug

* use final op in maskrcnn and hrnet

* refine
Co-authored-by: Nphlrain <phliuhongyu@126.com>

aaabb796

02 4月, 2022 1 次提交

[Yaml] transfer around 22 ops yaml file and pass the final state OpTest. (#41024) · 16bfcd18

由 xiongkun 提交于 4月 02, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

16bfcd18

24 3月, 2022 1 次提交

Fix rnn, wmt16 docs;test=document_fix (#40783) · cc8e98c7

由 Jack Zhou 提交于 3月 24, 2022

* Fix rnn, wmt16 docs;test=document_fix

* Fix wmt14 docs;test=document_fix

* Add more description;test=document_fix

cc8e98c7

08 3月, 2022 1 次提交

Fix fold python examples (#38636) · d4a4eb9d

由 xiaoting 提交于 3月 08, 2022

* fix fold python examples, test=develop

* fix size type, test=develop

* fix python example, test=develop

* fix fold shape check

* fix fold dygraph mode, test=develop

d4a4eb9d

24 2月, 2022 2 次提交
- X
  [doc]Fix maxunpool2d example (#39862) · eb4ad509
  由 xiaoting 提交于 2月 24, 2022
```
* fix maxunpool2d example, test=document_fix

* fix maxunpool2d example, test=document_fix
```
  eb4ad509
- L
  fix 'invalid escape sequence' (#39842) · 4e26fa57
  由 Leo Chen 提交于 2月 24, 2022
```
* fix 'invalid escape sequence'

* fix assert error
```
  4e26fa57
23 2月, 2022 1 次提交
- L
  fix 'is with a literal' warning (#39798) · 22abb6b3
  由 Leo Chen 提交于 2月 23, 2022
```
* fix 'is with a literal'

* fix typo
```
  22abb6b3
22 2月, 2022 1 次提交
- Z
  
  unset fluid in nn.others (#34935) · a710738e
  由 zhiboniu 提交于 2月 22, 2022
  
  a710738e
20 1月, 2022 1 次提交

[Eager] Support Eager mode for some testcase (#38783) · d21074cd

由 wanghuancoder 提交于 1月 20, 2022

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* eager test case

* support inference test

* refine test and fix initializer failed

* modify eagertensor patch method

* add eagertensor.clear_grandint, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* support create varbase and fix retain grad error

* call monkey_patch_varbase in _test_eager_guard, test=develop

* fix windows error

* split clear_gradient to clear_gradient and zero_grads, test=develop

* refine, test=develop

* refine, test=develop

* support test_imperative_basic test in eager mode

* remove additional log in variable.h

* remove additional log in variable.h

* remove additional code create in merge

* eager

* fix some eager logic, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* patch_tensor_method_func, test=develop

* refine, test=develop

* eager test case, test=develop

* refine, test=develop

* eager, test=develop

* eager, test=develop

* eager optimizer, test=develop

* eager optimizer, test=develop

* eager test_imperative_optimizer_v2, test=develop

* eager, test=develop

* refine, test=develop

* refine, test=develop

* eager, test=develop

* add resize in share buffer to, test=develop

* eager, test=develop

* fix _share_buffer_to, test=develop

* refine, test=develop

* refine, test=develop

* support eager for dataloader,test=develop
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NJiabinYang <360788950@qq.com>

d21074cd

12 1月, 2022 1 次提交

support 5d for nearest interp (#38868) · d296456c

由 xiaoting 提交于 1月 12, 2022

* support 5d for nearest

* update nearest3d unittest, test=develop

* fix approve ci, test=develop

* fix approve ci, test=develop

d296456c

10 1月, 2022 2 次提交

W

modify comment of mish (#38805) · 492e6dd0
由 wangxinxin08 提交于 1月 10, 2022

492e6dd0

Add MaxUnPool3D op and MaxUnPool1D op (#38716) · 7e31542c

由 andyjpaddle 提交于 1月 10, 2022

* add maxunpool3d op

* update doc for maxunpool3d op

* update doc for maxunpool3d op

* update doc for maxunpool3d op

* update sample code for maxunpool3d

* add maxunpool1d op

* update some code for maxunpool1d

7e31542c

07 1月, 2022 1 次提交

modify mish op and add mish api (#38734) · 8c92337c

由 wangxinxin08 提交于 1月 07, 2022

* add mish operator and api

* remove redundant code and modify grad_atol of mish unittest

* modify mish code to be consistent with other activation implementation

8c92337c

31 12月, 2021 1 次提交

Add fold opereators (#38613) · 8898dce1

由 xiaoting 提交于 12月 31, 2021

* add fold opereators, test=develop

* add fold opereators, test=develop

* add fold opereators, test=develop

* update fold op error test, test=develop

* fix unitext, test=develop

* fix unitext, test=develop

8898dce1

29 12月, 2021 1 次提交
- fix extra_repr in _InstanceNormBase, test=develop (#38537) · 21366a92
  由小湉湉提交于 12月 29, 2021
  
  21366a92
15 12月, 2021 1 次提交

Add New API nn.HingeEmbeddingLoss (#37540) · 3b85864a

由 Skr.B 提交于 12月 15, 2021

* add hinge_embedding_loss

* fix test_API

* test_API succeed

* add English doc

* fixed using of expired fluid api

* fix doc

* fix doc and rm python/paddle/fluid/layers/loss.py

* get raw python/paddle/fluid/layers/loss.py back

* fix Examples bug in English doc

* unique -> flatten

* fix api code

* fix English doc

* fix functional loss English doc

* fix Example doc

* .numpy() -> paddle.unique()

* fix unique

* fix label_item_set

* modified judgment equation

* Got a beautiful loss equation

* use paddle.to_tensor

* fix loss and add static check

* fix loss and add static check

* delta -> margin

3b85864a

10 12月, 2021 1 次提交
- L
  Transfer MultiHeadAttention's matmul to v2 op (#36222) · 65494051
  由 liu zhengxi 提交于 12月 10, 2021
```
* promote to v2

* alter
```
  65494051
07 12月, 2021 1 次提交
- X
  add maxunpool2d in __all__ (#37698) · 890bd626
  由 xiaoting 提交于 12月 07, 2021
```
* add maxunpool2d in __all__

* fix MaxUnPool2D example
```
  890bd626
30 11月, 2021 1 次提交
- G
  support data_format='NHWC' for prelu channel mode (#37019) · 3f2a665a
  由 Guoxia Wang 提交于 11月 30, 2021
```
* support data_format='NHWC' for prelu channel mode
```
  3f2a665a
25 11月, 2021 1 次提交

【PaddlePaddle Hackathon】6、在 Paddle 中新增 ZeroPad2d (#37151) · 81861f69

由 Matsumoto GAO 提交于 11月 25, 2021

* add zeropad2d v0.1

* add zeropad2d v0.2

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.4

* add zeropad2d v0.5

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

81861f69

22 11月, 2021 1 次提交
- Z
  
  elu support alpha < 0 (#37316) · e3503de8
  由 zhupengyang 提交于 11月 22, 2021
  
  e3503de8
28 10月, 2021 1 次提交

ctc grad compute on gpu (#36756) · 54ef9d06

由 Hui Zhang 提交于 10月 28, 2021

* Revert "Align CTC grad scale same with ESPNet (#34729)"

This reverts commit 10f9644c.

* ctc grad compute on gpu

54ef9d06

26 10月, 2021 1 次提交

Add fused attention op backward and python layer. (#36498) · 5119428e

由 Li Min 提交于 10月 26, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

5119428e

22 10月, 2021 1 次提交

Fused attention op forward (#35905) · d4906214

由 Li Min 提交于 10月 22, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

d4906214

13 10月, 2021 2 次提交
- G
  fix BatchNorm for fp16 (#36376) · 8fd1b6ad
  由 Guoxia Wang 提交于 10月 13, 2021
```
* fix BatchNorm for fp16
```
  8fd1b6ad
- Y
  [PaddlePaddle hackathon] + ADD CELU (#36088) · d7064f04
  由 yujun 提交于 10月 13, 2021
```
* update

* update

* update

* try make CI pass

* doc typo

* update doc string
```
  d7064f04
17 9月, 2021 1 次提交
- X
  fix unpool doc, test=document_fix (#35806) · 652e655f
  由 xiaoting 提交于 9月 17, 2021
```
* fix unpool doc, test=document_fix

* fix typo for python example, test=document_fix
```
  652e655f
15 9月, 2021 1 次提交

Change the invoking method of settiem from numpy to set_value op when value isn't tensor (#35701) · 86d4af39

由 zyfncg 提交于 9月 15, 2021

* Change the invoking method of settiem from numpy to set_value op when value is not tensor

* fix the check logic for inplace in setitem

* fix the unittest problem caused by setitem doesn't support fp16

* modify some code format in setitem

86d4af39

06 9月, 2021 1 次提交

replase pass with error exception (#35367) · 5675042d

由 Feng Xing 提交于 9月 06, 2021

This PR adds error exception in fused transformer python interface.
The function body are not implemented (will be implemented later).
Following zhiqiu's comment in previous PR-35206 (merged already), it is better to raise an exception instead of using "pass".

5675042d

31 8月, 2021 1 次提交

transformer opt python files (#35206) · e2991555

由 Feng Xing 提交于 8月 31, 2021

This PR adds fused transformer python related files. It defines interface of fused transformer.

Fused transformer implements an optimized version of transformer layer (in python/paddle/nn/layer/transformer.py). In this PR, four layers (functions) are defined:
(1) FusedMultiHeadAttention: multi-head attention layer
(2) FusedFeedForward: feed forward layer
(3) FusedTransformerEncoderLayer: transformer encoder layer
(4) FusedTransformer: transformer layer

e2991555

27 8月, 2021 1 次提交

Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0

由 xiaoting 提交于 8月 27, 2021

* add maxunppol2d op, test=develop

* fix typo, test=develop

* fix unpool unitest, test=develop

* fix unpool code-example, test=develop

* fix for unpool_op_unittest,test=develop

* fix example code, test=develop

* add noqa:F401, test=develop

* fix converage, test=develop

* fix unitest for unpool, test=develop

* rename unpool2d to unpool, test=develop

* rename unpool2d to unpool, test=develop

ceee71a0

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功