提交 · a710738e4ff5eaec9924cf9ce9164daa6d8b8c1b · PaddlePaddle / Paddle

22 2月, 2022 1 次提交
- Z
  
  unset fluid in nn.others (#34935) · a710738e
  由 zhiboniu 提交于 2月 22, 2022
  
  a710738e
16 2月, 2022 1 次提交
- F
  
  [MLU] fix TensorAdd for mlu (#39523) · 24b8f63e
  由 fwenguang 提交于 2月 16, 2022
  
  24b8f63e
10 2月, 2022 2 次提交

W
change dtype of pooling mask to 'int32' for Paddle2ONNX (#39314) · 29d31606
由 Wei Shengyu 提交于 2月 10, 2022
```
* change dtype of pooling mask to 'int32' for Paddle2ONNX

* empty commit to rerun ci

* fix format
```
29d31606

Modify the unsqueeze dimension of input data in conv1d NCL And NLC format (#38425) · 224bc511

由 crystal 提交于 2月 10, 2022

* optimize conv1d forward

* add conv opt

* Optimize memory copy

* delete share data with

* set num_filters=512

* add nlc optimize

* Optimize num_filter=512 data on A100 and V100

* Fix the workspace_size size setting of filter

224bc511

09 2月, 2022 1 次提交
- S
  
  add more int type support for softmax_with_cross_entropy (#39409) · eaa3fd45
  由 sneaxiy 提交于 2月 09, 2022
  
  eaa3fd45
08 2月, 2022 1 次提交
- S
  Make Embedding layer support more int ids type (#39381) · 60f1461a
  由 sneaxiy 提交于 2月 08, 2022
```
* add more int id type support for embedding

* add ut

* add more ut

* fix ci error
```
  60f1461a
12 1月, 2022 1 次提交

support 5d for nearest interp (#38868) · d296456c

由 xiaoting 提交于 1月 12, 2022

* support 5d for nearest

* update nearest3d unittest, test=develop

* fix approve ci, test=develop

* fix approve ci, test=develop

d296456c

10 1月, 2022 11 次提交
- H
  
  replace where with min and max · e30150dd
  由 HydrogenSulfate 提交于 1月 10, 2022
  
  e30150dd
- H
  
  update code · 3ab9ace5
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  3ab9ace5
- H
  
  add static label check · 09d4a3a4
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  09d4a3a4
- H
  
  replace .where to '==' · b4eec5d5
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  b4eec5d5
- H
  
  remove hard labels check · 51398ab9
  由 HydrogenSulfate 提交于 12月 27, 2021
  
  51398ab9
- H
  
  change to IndexError · 7ddfec00
  由 HydrogenSulfate 提交于 12月 27, 2021
  
  7ddfec00
- H
  
  Remove the labels range check under the dynamic graph · 1e3e17df
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  1e3e17df
- H
  
  Remove the labels range check under the dynamic graph · 87d9fdae
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  87d9fdae
- H
  
  Remove the labels range check under the dynamic graph · 46e856c7
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  46e856c7
- W
  
  modify comment of mish (#38805) · 492e6dd0
  由 wangxinxin08 提交于 1月 10, 2022
  
  492e6dd0
- A
  Add MaxUnPool3D op and MaxUnPool1D op (#38716) · 7e31542c
  由 andyjpaddle 提交于 1月 10, 2022
```
* add maxunpool3d op

* update doc for maxunpool3d op

* update doc for maxunpool3d op

* update doc for maxunpool3d op

* update sample code for maxunpool3d

* add maxunpool1d op

* update some code for maxunpool1d
```
  7e31542c
07 1月, 2022 1 次提交

modify mish op and add mish api (#38734) · 8c92337c

由 wangxinxin08 提交于 1月 07, 2022

* add mish operator and api

* remove redundant code and modify grad_atol of mish unittest

* modify mish code to be consistent with other activation implementation

8c92337c

31 12月, 2021 1 次提交

Add fold opereators (#38613) · 8898dce1

由 xiaoting 提交于 12月 31, 2021

* add fold opereators, test=develop

* add fold opereators, test=develop

* add fold opereators, test=develop

* update fold op error test, test=develop

* fix unitext, test=develop

* fix unitext, test=develop

8898dce1

22 12月, 2021 1 次提交
- Z
  
  Replaced core.ops with _C_ops (#38337) · 242ef2b9
  由 Zhanlue Yang 提交于 12月 22, 2021
  
  242ef2b9
16 12月, 2021 1 次提交

Add sparse_attention mask ,test=develop (#37973) · fa463b90

由 Liu-xiandong 提交于 12月 16, 2021

Add key_padding_mask and attn_mask in sparse_attention Api

1.Key padding mask is a tensor with dimensions [batch_size, seq_len], and attention mask is a tensor with dimensions [seq_len, seq_len]. The data types of the two masks are consistent with Q, K, and V, which are float32 or float64. If the value in Mask is 0, it means that the position needs to be masked.

2.The changed files are mainly paddle/fluid/operators/sparse_attention_op.cu and python/paddle/fluid/tests/unittests/test_sparse_attention_op.py. sparse_attention has three parts: sddmm, softmax, and dsd. Adding the mask operation only needs to modify the softmax. It has no effect on the other two parts. In addition, in order to test the mask function, related tests has been added.

fa463b90

15 12月, 2021 1 次提交

Add New API nn.HingeEmbeddingLoss (#37540) · 3b85864a

由 Skr.B 提交于 12月 15, 2021

* add hinge_embedding_loss

* fix test_API

* test_API succeed

* add English doc

* fixed using of expired fluid api

* fix doc

* fix doc and rm python/paddle/fluid/layers/loss.py

* get raw python/paddle/fluid/layers/loss.py back

* fix Examples bug in English doc

* unique -> flatten

* fix api code

* fix English doc

* fix functional loss English doc

* fix Example doc

* .numpy() -> paddle.unique()

* fix unique

* fix label_item_set

* modified judgment equation

* Got a beautiful loss equation

* use paddle.to_tensor

* fix loss and add static check

* fix loss and add static check

* delta -> margin

3b85864a

30 11月, 2021 1 次提交
- G
  support data_format='NHWC' for prelu channel mode (#37019) · 3f2a665a
  由 Guoxia Wang 提交于 11月 30, 2021
```
* support data_format='NHWC' for prelu channel mode
```
  3f2a665a
26 11月, 2021 1 次提交

Fix dropout static when axis != None (#37223) · f25fda37

由 smallv0221 提交于 11月 26, 2021

* fix dropout static when axis != None

* update dropout test

* add dropout test

* fix test

* Update test_dropout_op.py

* Update test_dropout_op.py

* fix testcase

* fix testcase

* Update test_dropout_op.py

* fix testcase

* fix testcase

* optimize perf

* add new test

* fix testcase

f25fda37

25 11月, 2021 1 次提交

【PaddlePaddle Hackathon】6、在 Paddle 中新增 ZeroPad2d (#37151) · 81861f69

由 Matsumoto GAO 提交于 11月 25, 2021

* add zeropad2d v0.1

* add zeropad2d v0.2

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.4

* add zeropad2d v0.5

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

81861f69

22 11月, 2021 1 次提交
- Z
  
  elu support alpha < 0 (#37316) · e3503de8
  由 zhupengyang 提交于 11月 22, 2021
  
  e3503de8
18 11月, 2021 1 次提交
- L
  Fix the slow running speed of kl_div when option 'reduction' is set (#37283) · a6e9ff85
  由 LielinJiang 提交于 11月 18, 2021
```
* Fix the slow running speed of kl_div when option reduction is set

* fix unittest coverage
```
  a6e9ff85
15 11月, 2021 1 次提交
- L
  modify sparse_attention docs, test=document_fix (#36554) · 6b0cc2b1
  由 Liu-xiandong 提交于 11月 15, 2021
```
* modify sparse_attention docs, test=develop

* add warning

* add warning ,test=document_fix
```
  6b0cc2b1
12 11月, 2021 1 次提交
- Z
  [fix]fix the bug of fused_attention and fused_feedforward (#36972) · 6486e242
  由 zhangkaihuo 提交于 11月 12, 2021
```
* fix bug:
1. atten: set the default value of attn_dropout_rate to None
2. ffn: add activation parameter
```
  6486e242
28 10月, 2021 1 次提交

ctc grad compute on gpu (#36756) · 54ef9d06

由 Hui Zhang 提交于 10月 28, 2021

* Revert "Align CTC grad scale same with ESPNet (#34729)"

This reverts commit 10f9644c.

* ctc grad compute on gpu

54ef9d06

26 10月, 2021 1 次提交
- L
  Move fused_attention and fused_feedforward functional api path to incubate (#36704) · 9aeca2f1
  由 Li Min 提交于 10月 26, 2021
```
将 #35905 和 #35843 PR中新增的的python api接口移到incubate目录下。
```
  9aeca2f1
25 10月, 2021 1 次提交

add op: fused_feedforward(forward) (#35843) · b18cbfb2

由 zhangkaihuo 提交于 10月 25, 2021

这个PR只包含fused_feedforward前向的代码。

相关kernel实现：fused_dropout_act_bias, fused_residual_dropout_bias, fused_layernorm_residual_dropout_bias

fused_feedforward是一个融合算子，该算子对transformer模型的feed forward层的算子进行融合和封装，使得前端只呈现一个接口，通过融合减少部分访存和kernel launch的时间，以此提升性能。

b18cbfb2

22 10月, 2021 1 次提交

Fused attention op forward (#35905) · d4906214

由 Li Min 提交于 10月 22, 2021

功能：本PR的目标是提高attention模块的计算性能。
为了减少框架层对op的调度开销，本PR通过在C++层手动实现attention模块，对外提供attention 大op；
为了减少防存开销，本PR采取了两种优化方法：
（1）在q,k,v计算时通过共享输入X，将该处的gemm，transpose和bias add从三次调用减少为一次；
（2）使用kernel融合优化技术，在不同cuda kernel之间通过寄存器传输数据；

d4906214

19 10月, 2021 1 次提交
- X
  
  fix out of range for area interp, test=develop (#36466) · 77f4597f
  由 xiaoting 提交于 10月 19, 2021
  
  77f4597f
18 10月, 2021 1 次提交
- Q
  
  [NPU] fix dtype for arg_max, test=develop (#36457) · 8757fc5b
  由 Qi Li 提交于 10月 18, 2021
  
  8757fc5b
13 10月, 2021 1 次提交
- Y
  [PaddlePaddle hackathon] + ADD CELU (#36088) · d7064f04
  由 yujun 提交于 10月 13, 2021
```
* update

* update

* update

* try make CI pass

* doc typo

* update doc string
```
  d7064f04
12 10月, 2021 3 次提交
- H
  
  Update loss.py · f77083bb
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  f77083bb
- H
  
  Update loss.py · 6cd41cec
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  6cd41cec
- H
  
  Update loss.py · 3675f25d
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  3675f25d

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功