提交 · 05df39ac06e0dbd99e43ce9e86e8b165470ba3cc · PaddlePaddle / Paddle

17 5月, 2019 2 次提交
- Y
  polish parallel dygraph code (#17164) · 02175555
  由 Yan Xu 提交于 5月 17, 2019
```
* add var grad hook test=develop
```
  02175555
- B
  
  fix assert,test=develop (#17445) · 3a9ae28d
  由 Bai Yifan 提交于 5月 17, 2019
  
  3a9ae28d
16 5月, 2019 2 次提交

Add conditional compile for gru opt (#17368) · b02f2aff

由 zhaoyuchen2018 提交于 5月 16, 2019

* improve gru unit performance.
refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Add conditional compile for gru opt

Not enable gru opt if compute ability < 700

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

b02f2aff

Z

fix recurrent_op,test=develop (#17433) · 712bfb17
由 Zeng Jinle 提交于 5月 16, 2019

712bfb17

15 5月, 2019 6 次提交
- M
  
  Eanble stack operator for a Ngraph, test=develop (#17406) · 6ee6700f
  由 mozga-intel 提交于 5月 15, 2019
  
  6ee6700f
- K
  Optimize the sequence padding op (#17403) · 0823a7bc
  由 Krzysztof Binias 提交于 5月 15, 2019
```
test=develop
```
  0823a7bc
- B
  
  NGraph Added fill_zeros_like op test=develop (#17295) · 1ce7b45b
  由 baojun 提交于 5月 14, 2019
  
  1ce7b45b
- B
  
  NGraph Added dropout and dropout_grad to ngraph test=develop (#17320) · 91019652
  由 baojun 提交于 5月 14, 2019
  
  91019652
- M
  
  Ngraph Enable gather operator test=develop (#17296) · b1894807
  由 mozga-intel 提交于 5月 14, 2019
  
  b1894807
- L
  Double backward sqrt (#17387) · 4ef63101
  由 lvmengsi 提交于 5月 15, 2019
```
* double backward sqrt

* refine unittest. test=develop

* refine test. test=develop

* remove alpha in unittest. test=develop
```
  4ef63101
14 5月, 2019 6 次提交

Double backward reduce mean (#17372) · 5d1ac41b

由 lvmengsi 提交于 5月 14, 2019

* test=develop, double backward reduce_mean

* add comment. test=develop

* fix format. test=develop

* rename GradGrad -> DoubleGrad. test=develop

* fix op_use_default_grad_op_maker.spec. test=develop

5d1ac41b

J

enhance generate mask labels, test=develop (#17380) · 0cae5a36
由 jerrywgz 提交于 5月 14, 2019

0cae5a36
K
add elementwise_add_grad_grad op (#17366) · bd9bef5a
由 Kaipeng Deng 提交于 5月 14, 2019
```
* add elementwise_add_grad_grad op. test=develop

* use defined GradMaker. test=develop
```
bd9bef5a
J
add collect fpn proposals op,test=develop (#16074) · 1c6d0646
由 jerrywgz 提交于 5月 14, 2019
```
* add collect fpn proposals op,test=develop
```
1c6d0646

support fc_op double grad (#17317) · 60be66e2

由 Kaipeng Deng 提交于 5月 14, 2019

* add double grad for mul_op. test=develop

* fix format. test=develop

* fix format. test=develop

* fix format. test=develop

* refine code. test=develop

* remove setzero. test=develop

* fix dx/dy init bug. test=develop

* fix format. test=develop

60be66e2

L
Fix the uninitialized gru_value.output_value. (#17197) · 08635993
由 liuwei1031 提交于 5月 14, 2019
```
test=develop
```
08635993

13 5月, 2019 4 次提交

Optimize the computing kernel of sequence_reverse operator (#17349) · 218d8d8f

由 Yihua Xu 提交于 5月 13, 2019

* Optimize the computing kernel of sequence_reverse operator.

test=develop

* Clean code

test=develop

* Fix for cpplint syntax checking.

test=develop

* Fix the compile warning issue.

test=develop

218d8d8f

Optimize the elementwise op using eigen (#15494) · dcda2023

由 Yiqun Liu 提交于 5月 13, 2019

* Optimize the elementwise op with CUDA kernels.
test=develop

* Support setting of attr in op config file.
test=develop

* Add the support the setting dtype and initializer in config.
test=develop

* Save workspace.

* Add initializer "zeros".
test=develop

* Fix compiling error.

* Support the use of existed file to initailize tensor in op_tester.

* Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
test=develop

dcda2023

add double grad for elementwise_mul op (#17255) · 8bae8590

由 Kaipeng Deng 提交于 5月 13, 2019

* add double grad for elementwise_mul. test=develop

* remove comment. test=develop

* fix grad sum. test=develop

* fix for axis expand. test=develop

* add test for axis expand. test=develop

8bae8590

add double grad for square op (#17173) · 11d3a38f

由 Kaipeng Deng 提交于 5月 13, 2019

* add double grad for square. test=develop

* formax code. test=develop

* fix for grad sum. test=develop

* refine shape. test=develop

* refine extract. test=develop

11d3a38f

10 5月, 2019 4 次提交

Z

Add Where Op(#16793) · d4b67e16
由 zhoukunsheng 提交于 5月 10, 2019

d4b67e16
Z

Add Diag Op(#17027) · 1bfff020
由 zhoukunsheng 提交于 5月 10, 2019

1bfff020

improve gru unit performance. (#16338) · 8a2caacd

由 zhaoyuchen2018 提交于 5月 10, 2019

refine code

fuse cublas  calling and kernels into one cuda kernel.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

8a2caacd

Double backward of conv2d. (#17211) · e32c9888

由 qingqing01 提交于 5月 10, 2019

* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.

e32c9888

09 5月, 2019 2 次提交
- Z
  
  follow comments,test=develop (#17273) · fff270ea
  由 Zeng Jinle 提交于 5月 09, 2019
  
  fff270ea
- Z
  Mod floordiv (#17251) · 4292bd86
  由 zhoukunsheng 提交于 5月 09, 2019
```
* test=develop
add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
```
  4292bd86
08 5月, 2019 8 次提交

X
modified formula for Lrn (#17281) · 9ed4aaad
由 xiaoting 提交于 5月 08, 2019
```
* modified formula for lrn

test=develop

* modified api.spec

test=develop
```
9ed4aaad

Refine elementwise kernel. (#16952) · 792443ef

由 zhaoyuchen2018 提交于 5月 08, 2019

* Refine elementwise kernel.

Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

792443ef

Optimize the cuda implementation of sum_op (#17283) · 6b84688b

由 Yiqun Liu 提交于 5月 08, 2019

* Optimize the cuda implementation of sum_op, which add two lod_tensors inplace.
test=develop

* Use eigen to add to tensors.
test=develop

6b84688b

C
update assert (#17282) · db5e74ab
由 chengduo 提交于 5月 08, 2019
```
test=develop
```
db5e74ab

Fix concat shape check (#17247) · c3195de5

由 Hongyu Liu 提交于 5月 08, 2019

* fix shape_check; test=develop

* fix format; test=develop

* fix format; test=develop

* fix ddim bug; test=develop

* fix c++ format; test=develop

* change function name; test=develop

c3195de5

W

Fix bp of roi perspective transform op. (#17216) · 7d7e2995
由 whs 提交于 5月 08, 2019

7d7e2995

Adding lrn op for ngraph engine (#17189) · 7bd1d03e

由 baojun 提交于 5月 07, 2019

* added lrn op test=develop

* Added CreateConstant method test=develop

* avoid duplicates test=develop

7bd1d03e

G

Fix code in document. (#17237) · 91784f8e
由 gongweibao 提交于 5月 08, 2019

91784f8e

07 5月, 2019 6 次提交

Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225) · 4f859408

由 Zeng Jinle 提交于 5月 07, 2019

* add use_cuda to inplace pass,test=develop

* add test softmax_with_xe_inplace test,test=develop

* fix potential inplace bug
test=develop

* add more skip vars in mem opt pass,test=develop

* follow comment,test=develop

* follow comments,move duplicate out arg check to program->graph,test=develop

4f859408

B

update sofmax with axis arg test=develop (#17190) · e782b54b
由 baojun 提交于 5月 07, 2019

e782b54b

Softmax_cross_entropy op add axis (#16806) · a71d8fdb

由 Kaipeng Deng 提交于 5月 07, 2019

* add attr axis infershape. test=develop

* add CUDA kernel. test=develop

* fix unittest. test=develop

* fix unittest for soft_label. test=develop

* fix fp16 unittest. test=develop

* remove comment code. test=develop

* refine test for axis. test=develop

* add python api. test=develop

* fix doc. test=develop

* fix fp16 unittest. test=develop

* fix ngraph test. test=develop

* fix ENFORCE for test_imperative_transformer. test=develop

* fit for ngraph test. test=develop

* fix after rebase develop. test=develop

* fix doc. test=develop

* fix API.spec. test=develop

* fix test_layers. test=develop

* fix format. test=develop

a71d8fdb

Quant output scale (#17215) · a914d9b1

由 Zhen Wang 提交于 5月 07, 2019

* Add MovingAverageAbsMaxScale operator which is only used for calculating the quantization scale.

* test=develop

* change the output into inplace. test=develop

* Revert "test=develop"

This reverts commit 696cf626.

* Revert "change the output into inplace. test=develop"

This reverts commit a19acd20.

* test=develop.

* update the MovingAverageAbsMaxScaleOp test. test=develop

a914d9b1

optimize sum op (#16820) · 32b62c25

由 zhaoyuchen2018 提交于 5月 07, 2019

* optimize sum op

fuse multi eigen kernel calls into one cuda kernel.
refine code

test=develop.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code according to comments.

test=develop

* refine code

delete sum_op_gpu.h
test=develop

* Fix test error.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code in format.

test=develop.

* refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

32b62c25

石

Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a

由石晓伟提交于 5月 07, 2019

* cherry-pick commit from 88770542

* cherry-pick commit from 3f0b97df

* cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn

(cherry picked from commit 8643dbc2)

* Cherry-Pick from 16662 : Anakin subgraph cpu support

(cherry picked from commit 7ad182e1)

* Cherry-pick from 1662, 16797.. : add anakin int8 support

(cherry picked from commit e14ab180)

* Cherry-pick from 16813 : change singleton to graph RegistBlock
test=release/1.4

(cherry picked from commit 4b9fa423)

* Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2

Support ShuffleNet and MobileNet-v2, test=release/1.4

(cherry picked from commit a6fb066f)

* Cherry-pick : anakin subgraph add opt config layout argument #16846
test=release/1.4

(cherry picked from commit 8121b3ec)

* 1. add shuffle_channel_detect

(cherry picked from commit 6efdea89)

* update shuffle_channel op convert, test=release/1.4

(cherry picked from commit e4726a06)

* Modify symbol export rules

test=develop

a72dbe9a

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功