提交 · 0b3c229606207acaca8b9625c2d7748c83ba7e2f · Crayon鑫 / Paddle

03 3月, 2021 1 次提交

[ROCM] update fluid elementwise op for rocm (part10), test=develop (#31361) · 7cdf6ea7

由 Qi Li 提交于 3月 03, 2021

* [ROCM] update fluid elementwise op for rocm (part10), test=develop

* update, test=develop

* address review comments, test=develop

7cdf6ea7

03 2月, 2021 1 次提交
- W
  fix the broadcast for the large second input (#30818) · b7560a59
  由 wawltor 提交于 2月 03, 2021
```
fix the broadcast for the large second input 
```
  b7560a59
10 1月, 2021 1 次提交
- W
  reduce the occupied size of memory for the fused pattern of elementwise_add... · af80859d
  由 wangchaochaohu 提交于 1月 10, 2021
```
reduce the  occupied size  of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
```
  af80859d
05 8月, 2020 1 次提交
- Z
  add eltwise clip cuda impl. (#25689) · 5970871a
  由 Zhaolong Xing 提交于 8月 05, 2020
```
test=develop
```
  5970871a
16 6月, 2020 1 次提交
- L
  
  fix dtype error of compare op, test=develop (#25059) · 028de857
  由 Leo Chen 提交于 6月 16, 2020
  
  028de857
12 5月, 2020 1 次提交
- W
  Fix the elementwise ops in broadcast in the process of backward (#24319) · 2de5075a
  由 wawltor 提交于 5月 12, 2020
```
* Remove the error in the elementwise op, use the backup mode to calculate
```
  2de5075a
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

13 4月, 2020 1 次提交

elementwise ops error message enhancement，the python error message had add before · 289edf39

由 LutaoChu 提交于 4月 13, 2020

Those ops add the kernel message enhancement, as follows
paddle.fluid.layers.elementwise_add	
paddle.fluid.layers.elementwise_div
paddle.fluid.layers.elementwise_floordiv
paddle.fluid.layers.elementwise_max	
paddle.fluid.layers.elementwise_min	
paddle.fluid.layers.elementwise_mod	
paddle.fluid.layers.elementwise_mul	
paddle.fluid.layers.elementwise_pow	
paddle.fluid.layers.elementwise_sub

289edf39

03 4月, 2020 2 次提交
- Z
  Fix elementwise compile error, test=develop (#23381) · 01d7ccd4
  由 zhaoyuchen2018 提交于 4月 03, 2020
```
elementwise function used before definition then failed in cuda 8, move it ahead.
```
  01d7ccd4
- Z
  improve elementwise performance. (#23405) · 4fe9ca69
  由 zhaoyuchen2018 提交于 4月 03, 2020
```
* improve elementwise performance.

* Add contiguous check, test=develop
```
  4fe9ca69
29 3月, 2020 1 次提交

Improve elementwise performance. (#23001) · 58615a62

由 zhaoyuchen2018 提交于 3月 29, 2020

* Improve elementwise performance.

Elementwise performace is poor as walk into CommonGradBroadcastCUDA, add some new kernels for different data pattern.

* Add some cuda kernel to speedup common broadcast cases. test=develop

* Add more test cases and fix cuda kernel bug. test=develop

* Remove tests as cpu percision fails.test=develop

* Refine SplitDims, test=develop

* Change file mode, test=develop

58615a62

25 3月, 2020 1 次提交
- Z
  
  add Tensor::IsSharedBufferWith method, test=develop (#23175) · 7ca77a90
  由 Zeng Jinle 提交于 3月 25, 2020
  
  7ca77a90
17 1月, 2020 1 次提交
- Q
  
  Fix infer_shape in compling for elementwise_op (#22291) · 2d20869c
  由 qingqing01 提交于 1月 17, 2020
  
  2d20869c
19 11月, 2019 1 次提交
- D
  
  extend elementwise broadcast function (#20957) · 0e7baabe
  由 danleifeng 提交于 11月 19, 2019
  
  0e7baabe
10 10月, 2019 1 次提交
- D
  
  fix error message for elementwise_add/mul (#20283) · 3a0f93b3
  由 danleifeng 提交于 10月 10, 2019
  
  3a0f93b3
04 9月, 2019 1 次提交
- D
  elementwise broadcast function enhancement (#19536) · 8672e153
  由 danleifeng 提交于 9月 04, 2019
```
elementwise broadcast function enhancement
```
  8672e153
20 8月, 2019 1 次提交
- Z
  Fix elementwise performance poor issue (#19278) · 5296294d
  由 zhaoyuchen2018 提交于 8月 20, 2019
```
For small case use 1D block is better than 2D block.

Refer to this issue: #19275
```
  5296294d
14 6月, 2019 1 次提交
- Y
  Optimize fused_elewise_activation_grad op. (#18041) · 660c1a65
  由 Yiqun Liu 提交于 6月 14, 2019
```
test=develop
```
  660c1a65
20 5月, 2019 1 次提交

Double backward elementwise div (#17416) · 10b23a72

由 lvmengsi 提交于 5月 20, 2019

* double backward, elementwise_div

* fix dx empty. test=develop

* bug fix (#17392)

fix secure bug

* Eanble stack operator for a Ngraph, test=develop (#17406)

* fix sqrt_grad_grad unittest. test=develop (#17410)

* fix sqrt_grad_grad unittest. test=develop

* disable sqrt_grad_grad unittest. test=develop

* test=develop, fix unittest

* test=develop, fix unittest

* test=develop, fix unittest

* test=develop, fix bug

* fix unittest. test=develop

* fix unittest dx. test=develop

* tmp fix! for test... test=develop

* reduce tmp, test=develop

* test=develop, reduce tmp

* fix broadcast unittest. test=develop

* fix format. test=develop

* refine code. test=develop

* refine code. test=develop

* refine GetDoubleGradSafeTensor. test=develop

* fix format. test=develop

10b23a72

13 5月, 2019 1 次提交

add double grad for elementwise_mul op (#17255) · 8bae8590

由 Kaipeng Deng 提交于 5月 13, 2019

* add double grad for elementwise_mul. test=develop

* remove comment. test=develop

* fix grad sum. test=develop

* fix for axis expand. test=develop

* add test for axis expand. test=develop

8bae8590

08 5月, 2019 1 次提交

Refine elementwise kernel. (#16952) · 792443ef

由 zhaoyuchen2018 提交于 5月 08, 2019

* Refine elementwise kernel.

Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

792443ef

24 1月, 2019 1 次提交
- C
  Clean elementwise_op_function (#15502) · bf91d11e
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  bf91d11e
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

14 11月, 2018 1 次提交
- P
  code style fix · 1a9008c4
  由 peizhilin 提交于 11月 14, 2018
```
test=develop
```
  1a9008c4
08 11月, 2018 1 次提交
- Z
  
  Revert "cherry picked windows patches." · ba8b5619
  由 Zhaolong Xing 提交于 11月 08, 2018
  
  ba8b5619
07 11月, 2018 1 次提交

Add fp16 backward support (#14202) · a9b5d42d

由 chengduo 提交于 11月 07, 2018

* add fp16 backward support
test=develop

* add sum_op fp16 test

* disable test_dist_save_load
test=develop

* add check_grad for sum

* add unit test for softmax_grad fp16
test=develop

* add scale_op unit test

* add mul_grad_op unit test for fp16

* add cross_entropy_grad and eman_grad unit test for fp16
test=develop

* fix cross_entropy unit test

* add pool2d fp16 unit test

* refine conv2d fp16 unit test
test=develop

* refine activation unit test
test=develop

* fix ci
test=develop

* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop

a9b5d42d

05 11月, 2018 1 次提交
- P
  
  cpu build support · 9d67c1fb
  由 peizhilin 提交于 11月 05, 2018
  
  9d67c1fb
14 10月, 2018 1 次提交
- W
  
  compile in linux · 3ae96450
  由 wanghaoshuang 提交于 10月 14, 2018
  
  3ae96450
20 9月, 2018 1 次提交

Feature/op_fuse_pass (#12440) · d402234b

由 chengduo 提交于 9月 20, 2018

* Add Preface

* Add demo code

* Save file

* Refine code

* seems can work

* use elementwise strategy

* Use ElementwiseComputeEx

* Add comments

* extract functions from operator

* Refine code

* Follow comment

* code refine

* add op_fuse  pass

* add backward

* code refine

* use TopologySortOperations

* follow comments

* refine IsFusible

* code enhance

* fix op_fusion_pass

* refine code

* refine fuse_elemwise_act_op

* adjust the input and output

* refine logic

* add intermediate_edge

* disable inplace

* follow comments

* refine logic

* follow comments

* Remove the removable IntermediateOut

* change strategy

* code refine

* enable fuse backward

* code refine

* code refine

* rename unit test

* follow comments

d402234b

12 9月, 2018 1 次提交
- D
  
  add demo · c3e1fb5a
  由 dzhwinter 提交于 9月 12, 2018
  
  c3e1fb5a
03 9月, 2018 1 次提交
- D
  
  fix elementwise (#13146) · 856c26fa
  由 dzhwinter 提交于 9月 03, 2018
  
  856c26fa
30 8月, 2018 1 次提交

Enhance fused_elementwise_activation_op (#12837) · 3bd1d22a

由 chengduo 提交于 8月 30, 2018

* Enhance the function of fused_elementwise_activation_op

* enhance unit test

* Clean Code And Add Doc

* Add compound functors

* Fix doc and enhance unit test

* define Dx and Dy for d_binary_func

* add mul_scale

* add mul_scale

* add elementwise_mul

* code refine

* code refine

* add doc

* add  AsIntermediate

3bd1d22a

27 8月, 2018 1 次提交
- D
  
  operator module is done · cd8f3e9e
  由 dzhwinter 提交于 8月 27, 2018
  
  cd8f3e9e
20 8月, 2018 1 次提交
- T
  
  fix SEGV elementwise add at debug mode · 0507f7bc
  由 tensor-tang 提交于 8月 20, 2018
  
  0507f7bc
17 8月, 2018 1 次提交
- D
  Revert ""cherry picked operators changes" (#12184)" (#12747) · 4069262f
  由 dzhwinter 提交于 8月 17, 2018
```
This reverts commit bf3c3496.
```
  4069262f
16 8月, 2018 1 次提交

"cherry picked operators changes" (#12184) · bf3c3496

由 dzhwinter 提交于 8月 16, 2018

* "cherry picked operators changes"

* "remove duplicated code"

* "add constant setter"

* "add get expected kernel"

* "fix ci"

* "add fill constant"

bf3c3496

10 8月, 2018 1 次提交
- D
  
  "fix style" (#12600) · 8499559c
  由 dzhwinter 提交于 8月 10, 2018
  
  8499559c
01 8月, 2018 1 次提交

explicit gradient of elementwise_add/elementwise_sub (#11970) · 595a2c83

由 dzhwinter 提交于 8月 01, 2018

* "add gradient register"

* "make some enhance"

* "better format"

* "fix typo"

* "fix reuse"

* "fix get expected kernel"

* "change the mkldnn code"

* "fix mkldnn"

* "fix mkldnn failed test"

* "add comment"

595a2c83

03 5月, 2018 1 次提交
- C
  Fix __shfl_down_sync_ of cross_entropy (#10345) · 4fbde42c
  由 chengduo 提交于 5月 03, 2018
```
* fix __shfl_down_sync_ of cross_entropy

* use reduceSum

* "fix ci"
```
  4fbde42c
30 4月, 2018 1 次提交
- D
  Feature/cuda9 cudnn7 (#10140) · eb6f9dd5
  由 dzhwinter 提交于 4月 30, 2018
```
* "re-commit "

* "picked up"

* "fix ci"

* "fix pdb hang up issue in cuda 9"
```
  eb6f9dd5

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致