提交 · e32c9888f5c0160f19ef2faa4b6bdddb16bde303 · Crayon鑫 / Paddle

10 5月, 2019 2 次提交

Double backward of conv2d. (#17211) · e32c9888

由 qingqing01 提交于 5月 10, 2019

* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.

e32c9888

W
rename the default version from '0.0.0' to 'latest' (#17304) · f456c8be
由 wopeizl 提交于 5月 10, 2019
```
* rename the default version from '0.0.0' to 'latest'
```
f456c8be

08 5月, 2019 1 次提交

Adding lrn op for ngraph engine (#17189) · 7bd1d03e

由 baojun 提交于 5月 07, 2019

* added lrn op test=develop

* Added CreateConstant method test=develop

* avoid duplicates test=develop

7bd1d03e

07 5月, 2019 5 次提交

Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225) · 4f859408

由 Zeng Jinle 提交于 5月 07, 2019

* add use_cuda to inplace pass,test=develop

* add test softmax_with_xe_inplace test,test=develop

* fix potential inplace bug
test=develop

* add more skip vars in mem opt pass,test=develop

* follow comment,test=develop

* follow comments,move duplicate out arg check to program->graph,test=develop

4f859408

B

update sofmax with axis arg test=develop (#17190) · e782b54b
由 baojun 提交于 5月 07, 2019

e782b54b
T
remove unused FLAGS_warpctc_dir (#17162) · ff1661f1
由 Tao Luo 提交于 5月 07, 2019
```
* remove unused FLAGS_warpctc_dir

test=develop

* remove FLAGS_warpctc_dir

test=develop
```
ff1661f1

Softmax_cross_entropy op add axis (#16806) · a71d8fdb

由 Kaipeng Deng 提交于 5月 07, 2019

* add attr axis infershape. test=develop

* add CUDA kernel. test=develop

* fix unittest. test=develop

* fix unittest for soft_label. test=develop

* fix fp16 unittest. test=develop

* remove comment code. test=develop

* refine test for axis. test=develop

* add python api. test=develop

* fix doc. test=develop

* fix fp16 unittest. test=develop

* fix ngraph test. test=develop

* fix ENFORCE for test_imperative_transformer. test=develop

* fit for ngraph test. test=develop

* fix after rebase develop. test=develop

* fix doc. test=develop

* fix API.spec. test=develop

* fix test_layers. test=develop

* fix format. test=develop

a71d8fdb

Quant output scale (#17215) · a914d9b1

由 Zhen Wang 提交于 5月 07, 2019

* Add MovingAverageAbsMaxScale operator which is only used for calculating the quantization scale.

* test=develop

* change the output into inplace. test=develop

* Revert "test=develop"

This reverts commit 696cf62699ba1e1c98f61f7345ac7060010eb29a.

* Revert "change the output into inplace. test=develop"

This reverts commit a19acd20f07eee82622701a3015e6e9c073a5e0b.

* test=develop.

* update the MovingAverageAbsMaxScaleOp test. test=develop

a914d9b1

06 5月, 2019 2 次提交
- J
  fix distribute fpn proposals, test=develop (#16152) · cc95a751
  由 jerrywgz 提交于 5月 06, 2019
```
* fix distribute fpn proposals, test=develop
```
  cc95a751
- Z
  Add use_cuda to inplace pass (#17205) · ee2028a1
  由 Zeng Jinle 提交于 5月 05, 2019
```
* add use_cuda to inplace pass,test=develop

* add test softmax_with_xe_inplace test,test=develop
```
  ee2028a1
05 5月, 2019 3 次提交
- J
  Enhance concat op to support empty input. (#17015) · a72907bb
  由 jerrywgz 提交于 5月 05, 2019
```
* enhance_concat, test=develop
```
  a72907bb
- W
  
  use two GPUs to run the exclusive test test=develop (#17187) · 83c4f772
  由 wopeizl 提交于 5月 05, 2019
  
  83c4f772
- T
  Modify test timeout (#17181) · 8092c405
  由 tianshuo78520a 提交于 5月 05, 2019
```
* test=develop

* test=deelop
```
  8092c405
01 5月, 2019 1 次提交

remove async executor python api to fix document (#17174) · f938ccec

由 guru4elephant 提交于 5月 01, 2019

* remove async executor python api
test=develop

* remove test_async_executor.py
add executor train_from_dataset demo
test=develop

* fix import bug
test=develop

f938ccec

30 4月, 2019 3 次提交

Fix mem leak when converting Tensor to numpy array (#17182) · 5dfe2ab9

由 Zeng Jinle 提交于 4月 30, 2019

* fix mem leak when converting Tensor to numpy array
test=develop

* remove unused unittest,test=develop

* follow comments, test=develop

* fix dygraph bug,test=develop

5dfe2ab9

Rewrite inplace pass and fix gc bug (#17126) · 4e1bc6e8

由 Zeng Jinle 提交于 4月 29, 2019

* fix op graph view
test=develop

* rewrite inplace pass and fix reference count pass bug
test=develop

* fix unittest failed
test=develop

* follow comments, test=develop

4e1bc6e8

X
polish the label_smooth (#17138) · bc48453b
由 xiaoting 提交于 4月 30, 2019
```
* polish the label_smooth

test=develop

* polish code

test=develop
```
bc48453b

29 4月, 2019 2 次提交
- T
  cvm op feature (#17081) · deb510d4
  由 tangwei12 提交于 4月 29, 2019
```
cvm without LoD.
```
  deb510d4
- J
  
  test=develop fix bug: fix selected_indices in nms (#17140) · 554d3a71
  由 Jiancheng Li 提交于 4月 29, 2019
  
  554d3a71
28 4月, 2019 1 次提交

Refine dropout gpu memory (#17095) · 28d69d71

由 Zeng Jinle 提交于 4月 28, 2019

* refine_dropout_mem,test=develop

* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)

# This is the 2nd commit message:

Fleet unify distributed training (#16791)

* implement distributed transpiler with fleet
# This is the 3rd commit message:

ParallelDyGraph with GPU collective mode (#16827)

implement dygraph.parallel.DataParallel to hook reduce op.

# This is the 4th commit message:

Init mixed precision training interface (#16856)

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

# This is the 5th commit message:

fix reference_count_pass,test=develop (#17060)

test=develop
# This is the 6th commit message:

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

# This is the 7th commit message:

remove unnecessary prepare_data (#17080)

test=develop
# This is the 8th commit message:

fix interpolate cu. test=develop (#17101)

# This is the 9th commit message:

test=develop, double backward leaky_relu (#17067)

backward of backward: leaky_relu
# This is the 10th commit message:

fix fuse optimizer ops (#17102)

test=develop
# This is the 11th commit message:

truncated_gaussian_random supported in distributed training, test=develop (#17091)

# This is the 12th commit message:

 Detailed coordinate description for yolov3 loss (#17007)

* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop

# This is the 13th commit message:

fix test_weight_decay (#17109)

test=develop
# This is the 14th commit message:

Path flag (#17105)

* fix python/paddle/fluid/__init__.py detecting problems

28d69d71

27 4月, 2019 1 次提交
- C
  fix test_weight_decay (#17109) · 9ccce576
  由 chengduo 提交于 4月 27, 2019
```
test=develop
```
  9ccce576
26 4月, 2019 2 次提交
- C
  test=develop, double backward leaky_relu (#17067) · 258e000b
  由 ceci3 提交于 4月 26, 2019
```
backward of backward: leaky_relu
```
  258e000b
- K
  
  fix interpolate cu. test=develop (#17101) · 10c487eb
  由 Kaipeng Deng 提交于 4月 26, 2019
  
  10c487eb
25 4月, 2019 5 次提交

Speedup roi_perspective_transform op by caching the information of linear... · 55ce36e9

由 whs 提交于 4月 25, 2019

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

55ce36e9

Init mixed precision training interface (#16856) · beda7825

由 Yibing Liu 提交于 4月 25, 2019

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

beda7825

Y
ParallelDyGraph with GPU collective mode (#16827) · 0b07eef1
由 Yan Xu 提交于 4月 25, 2019
```
implement dygraph.parallel.DataParallel to hook reduce op.
```
0b07eef1
T
Fleet unify distributed training (#16791) · 1a4a51db
由 tangwei12 提交于 4月 25, 2019
```
* implement distributed transpiler with fleet
```
1a4a51db
T

remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) · e707119a
由 tangwei12 提交于 4月 25, 2019

e707119a

24 4月, 2019 1 次提交

Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058) · 2deac4e4

由 guomingz 提交于 4月 24, 2019

* resolve #17057

Fixed the bug that fuse_relu/fuse_residual option couldn't be passed to class TestConv2dInt8Op.

test=develop

* Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing.

test=develop

2deac4e4

23 4月, 2019 3 次提交

C
Add fuse momenutum ops (#16745) · a2be4b4d
由 chengduo 提交于 4月 23, 2019
```
* Add fuse momenutum ops
```
a2be4b4d
C
fix test_parallel_executor_seresnet random fail (#17030) · e296e0fe
由 chengduo 提交于 4月 23, 2019
```
test=develop
```
e296e0fe

Support backward of backward for Relu and add a new gradient checker by... · c1c2633a

由 qingqing01 提交于 4月 23, 2019

Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)

* Support backward of backward and a new gradient checker
* Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.

1. Add ReluDoubleGradMaker when register relu_grad.
2. Add a new gradient checker by comparing theoretical and numerical Jacobian.  Check double gradients by double_grad_check.

c1c2633a

22 4月, 2019 8 次提交

Move gc test to each test of op (#16999) · f188b370

由 Zeng Jinle 提交于 4月 22, 2019

* move gc test to op_test
test=develop

* Revert "move gc test to op_test"

This reverts commit cf15da65.

* enable gc test in some ops
test=develop

f188b370

C
Fix test_recurrent_op (#17001) · 7c370e42
由 chengduo 提交于 4月 22, 2019
```
* fix ramdom fail
test=develop
```
7c370e42
T
reduce unittest time by rename testcuda to has_cuda · 9466e956
由 Tao Luo 提交于 4月 22, 2019
```
test=develop
```
9466e956

add parallel build script to ci … (#16901) · d9991dcc

由 wopeizl 提交于 4月 22, 2019

* add parallel build script to ci test=develop
* 1. classify the test case as single card/two cards/multiple cards type
   2. run test case according to the run type

d9991dcc

Speed unit testing. (#16978) · ea42e431

由 qingqing01 提交于 4月 22, 2019

* Speed affine_channel_op unit testing
* Add check in tensor_py
* Fix ONLY_CPU Compiling

ea42e431

resolve (#16995) · ae7a2cb8

由 guomingz 提交于 4月 22, 2019

Update the filter generation mechanism that it could generate the negative parameter.
The original calling(np.random.random()) couldn't simulate the conv/relu fusion case.

test=develop

ae7a2cb8

Unittest improve, test=develop (#16941) · 765c70a1

由 liuwei1031 提交于 4月 22, 2019

* accelerate test_ir_memory_optimize_nlp, test=develop

* accelerate test_ir_memory_optimize_nlp, test=develop

765c70a1

resolve (#16994) · 23df084b

由 guomingz 提交于 4月 22, 2019

Rename the testcuda function to has_cuda, it will elimate the unnecessary testing.
test=develop

23df084b

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致