提交 · 01d7ccd4b65f5c1aa822c570c1d985804022e94a · Crayon鑫 / Paddle

03 4月, 2020 2 次提交
- Z
  Fix elementwise compile error, test=develop (#23381) · 01d7ccd4
  由 zhaoyuchen2018 提交于 4月 03, 2020
```
elementwise function used before definition then failed in cuda 8, move it ahead.
```
  01d7ccd4
- Z
  improve elementwise performance. (#23405) · 4fe9ca69
  由 zhaoyuchen2018 提交于 4月 03, 2020
```
* improve elementwise performance.

* Add contiguous check, test=develop
```
  4fe9ca69
29 3月, 2020 1 次提交

Improve elementwise performance. (#23001) · 58615a62

由 zhaoyuchen2018 提交于 3月 29, 2020

* Improve elementwise performance.

Elementwise performace is poor as walk into CommonGradBroadcastCUDA, add some new kernels for different data pattern.

* Add some cuda kernel to speedup common broadcast cases. test=develop

* Add more test cases and fix cuda kernel bug. test=develop

* Remove tests as cpu percision fails.test=develop

* Refine SplitDims, test=develop

* Change file mode, test=develop

58615a62

25 3月, 2020 1 次提交
- Z
  
  add Tensor::IsSharedBufferWith method, test=develop (#23175) · 7ca77a90
  由 Zeng Jinle 提交于 3月 25, 2020
  
  7ca77a90
17 1月, 2020 1 次提交
- Q
  
  Fix infer_shape in compling for elementwise_op (#22291) · 2d20869c
  由 qingqing01 提交于 1月 17, 2020
  
  2d20869c
19 11月, 2019 1 次提交
- D
  
  extend elementwise broadcast function (#20957) · 0e7baabe
  由 danleifeng 提交于 11月 19, 2019
  
  0e7baabe
10 10月, 2019 1 次提交
- D
  
  fix error message for elementwise_add/mul (#20283) · 3a0f93b3
  由 danleifeng 提交于 10月 10, 2019
  
  3a0f93b3
04 9月, 2019 1 次提交
- D
  elementwise broadcast function enhancement (#19536) · 8672e153
  由 danleifeng 提交于 9月 04, 2019
```
elementwise broadcast function enhancement
```
  8672e153
20 8月, 2019 1 次提交
- Z
  Fix elementwise performance poor issue (#19278) · 5296294d
  由 zhaoyuchen2018 提交于 8月 20, 2019
```
For small case use 1D block is better than 2D block.

Refer to this issue: #19275
```
  5296294d
14 6月, 2019 1 次提交
- Y
  Optimize fused_elewise_activation_grad op. (#18041) · 660c1a65
  由 Yiqun Liu 提交于 6月 14, 2019
```
test=develop
```
  660c1a65
20 5月, 2019 1 次提交

Double backward elementwise div (#17416) · 10b23a72

由 lvmengsi 提交于 5月 20, 2019

* double backward, elementwise_div

* fix dx empty. test=develop

* bug fix (#17392)

fix secure bug

* Eanble stack operator for a Ngraph, test=develop (#17406)

* fix sqrt_grad_grad unittest. test=develop (#17410)

* fix sqrt_grad_grad unittest. test=develop

* disable sqrt_grad_grad unittest. test=develop

* test=develop, fix unittest

* test=develop, fix unittest

* test=develop, fix unittest

* test=develop, fix bug

* fix unittest. test=develop

* fix unittest dx. test=develop

* tmp fix! for test... test=develop

* reduce tmp, test=develop

* test=develop, reduce tmp

* fix broadcast unittest. test=develop

* fix format. test=develop

* refine code. test=develop

* refine code. test=develop

* refine GetDoubleGradSafeTensor. test=develop

* fix format. test=develop

10b23a72

13 5月, 2019 1 次提交

add double grad for elementwise_mul op (#17255) · 8bae8590

由 Kaipeng Deng 提交于 5月 13, 2019

* add double grad for elementwise_mul. test=develop

* remove comment. test=develop

* fix grad sum. test=develop

* fix for axis expand. test=develop

* add test for axis expand. test=develop

8bae8590

08 5月, 2019 1 次提交

Refine elementwise kernel. (#16952) · 792443ef

由 zhaoyuchen2018 提交于 5月 08, 2019

* Refine elementwise kernel.

Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

792443ef

24 1月, 2019 1 次提交
- C
  Clean elementwise_op_function (#15502) · bf91d11e
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  bf91d11e
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

14 11月, 2018 1 次提交
- P
  code style fix · 1a9008c4
  由 peizhilin 提交于 11月 14, 2018
```
test=develop
```
  1a9008c4
08 11月, 2018 1 次提交
- Z
  
  Revert "cherry picked windows patches." · ba8b5619
  由 Zhaolong Xing 提交于 11月 08, 2018
  
  ba8b5619
07 11月, 2018 1 次提交

Add fp16 backward support (#14202) · a9b5d42d

由 chengduo 提交于 11月 07, 2018

* add fp16 backward support
test=develop

* add sum_op fp16 test

* disable test_dist_save_load
test=develop

* add check_grad for sum

* add unit test for softmax_grad fp16
test=develop

* add scale_op unit test

* add mul_grad_op unit test for fp16

* add cross_entropy_grad and eman_grad unit test for fp16
test=develop

* fix cross_entropy unit test

* add pool2d fp16 unit test

* refine conv2d fp16 unit test
test=develop

* refine activation unit test
test=develop

* fix ci
test=develop

* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop

a9b5d42d

05 11月, 2018 1 次提交
- P
  
  cpu build support · 9d67c1fb
  由 peizhilin 提交于 11月 05, 2018
  
  9d67c1fb
14 10月, 2018 1 次提交
- W
  
  compile in linux · 3ae96450
  由 wanghaoshuang 提交于 10月 14, 2018
  
  3ae96450
20 9月, 2018 1 次提交

Feature/op_fuse_pass (#12440) · d402234b

由 chengduo 提交于 9月 20, 2018

* Add Preface

* Add demo code

* Save file

* Refine code

* seems can work

* use elementwise strategy

* Use ElementwiseComputeEx

* Add comments

* extract functions from operator

* Refine code

* Follow comment

* code refine

* add op_fuse  pass

* add backward

* code refine

* use TopologySortOperations

* follow comments

* refine IsFusible

* code enhance

* fix op_fusion_pass

* refine code

* refine fuse_elemwise_act_op

* adjust the input and output

* refine logic

* add intermediate_edge

* disable inplace

* follow comments

* refine logic

* follow comments

* Remove the removable IntermediateOut

* change strategy

* code refine

* enable fuse backward

* code refine

* code refine

* rename unit test

* follow comments

d402234b

12 9月, 2018 1 次提交
- D
  
  add demo · c3e1fb5a
  由 dzhwinter 提交于 9月 12, 2018
  
  c3e1fb5a
03 9月, 2018 1 次提交
- D
  
  fix elementwise (#13146) · 856c26fa
  由 dzhwinter 提交于 9月 03, 2018
  
  856c26fa
30 8月, 2018 1 次提交

Enhance fused_elementwise_activation_op (#12837) · 3bd1d22a

由 chengduo 提交于 8月 30, 2018

* Enhance the function of fused_elementwise_activation_op

* enhance unit test

* Clean Code And Add Doc

* Add compound functors

* Fix doc and enhance unit test

* define Dx and Dy for d_binary_func

* add mul_scale

* add mul_scale

* add elementwise_mul

* code refine

* code refine

* add doc

* add  AsIntermediate

3bd1d22a

27 8月, 2018 1 次提交
- D
  
  operator module is done · cd8f3e9e
  由 dzhwinter 提交于 8月 27, 2018
  
  cd8f3e9e
20 8月, 2018 1 次提交
- T
  
  fix SEGV elementwise add at debug mode · 0507f7bc
  由 tensor-tang 提交于 8月 20, 2018
  
  0507f7bc
17 8月, 2018 1 次提交
- D
  Revert ""cherry picked operators changes" (#12184)" (#12747) · 4069262f
  由 dzhwinter 提交于 8月 17, 2018
```
This reverts commit bf3c3496.
```
  4069262f
16 8月, 2018 1 次提交

"cherry picked operators changes" (#12184) · bf3c3496

由 dzhwinter 提交于 8月 16, 2018

* "cherry picked operators changes"

* "remove duplicated code"

* "add constant setter"

* "add get expected kernel"

* "fix ci"

* "add fill constant"

bf3c3496

10 8月, 2018 1 次提交
- D
  
  "fix style" (#12600) · 8499559c
  由 dzhwinter 提交于 8月 10, 2018
  
  8499559c
01 8月, 2018 1 次提交

explicit gradient of elementwise_add/elementwise_sub (#11970) · 595a2c83

由 dzhwinter 提交于 8月 01, 2018

* "add gradient register"

* "make some enhance"

* "better format"

* "fix typo"

* "fix reuse"

* "fix get expected kernel"

* "change the mkldnn code"

* "fix mkldnn"

* "fix mkldnn failed test"

* "add comment"

595a2c83

03 5月, 2018 1 次提交
- C
  Fix __shfl_down_sync_ of cross_entropy (#10345) · 4fbde42c
  由 chengduo 提交于 5月 03, 2018
```
* fix __shfl_down_sync_ of cross_entropy

* use reduceSum

* "fix ci"
```
  4fbde42c
30 4月, 2018 1 次提交
- D
  Feature/cuda9 cudnn7 (#10140) · eb6f9dd5
  由 dzhwinter 提交于 4月 30, 2018
```
* "re-commit "

* "picked up"

* "fix ci"

* "fix pdb hang up issue in cuda 9"
```
  eb6f9dd5
24 4月, 2018 1 次提交
- C
  
  fix elementwise_grad op kernel and add unit test · d06c79c7
  由 chengduoZH 提交于 4月 24, 2018
  
  d06c79c7
10 4月, 2018 1 次提交
- C
  Move reduceSum to elementwise_op_function.h (#9773) · b1224da8
  由 chengduo 提交于 4月 10, 2018
```
* add cuda_device_functions.h

* move reduceSum to elementwise_op_function.h
```
  b1224da8
06 3月, 2018 1 次提交
- C
  
  refine elementwise_mul_op · a1331f98
  由 chengduoZH 提交于 3月 06, 2018
  
  a1331f98
28 2月, 2018 1 次提交

Correctly handling variable with batch dimension for math ops. · e9b8ebf4

由 xuwei06 提交于 2月 22, 2018

When the second argument contains batch dimension, the axis should be 0.

Also makes elementwise ops more tolerant at handling tensors with trailing
singular dimensions.

e9b8ebf4

26 2月, 2018 1 次提交
- C
  
  refine Sum · b8938b44
  由 chengduoZH 提交于 2月 24, 2018
  
  b8938b44
24 2月, 2018 2 次提交
- C
  
  follow comments · a8288392
  由 chengduoZH 提交于 2月 24, 2018
  
  a8288392
- C
  
  refine Sum · 22b9ab05
  由 chengduoZH 提交于 2月 24, 2018
  
  22b9ab05
23 2月, 2018 1 次提交
- C
  
  fix get_mid_dims annotation (#8490) · 0e187bc9
  由 chengduo 提交于 2月 23, 2018
  
  0e187bc9

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致