提交 · 92ce44522716b6006d0ace9a5df159b5bb39821f · PaddlePaddle / Paddle

29 4月, 2019 1 次提交
- T
  
  Supplementary monitoring file reason explanation (#17131) · 92ce4452
  由 tianshuo78520a 提交于 4月 29, 2019
  
  92ce4452
28 4月, 2019 2 次提交

Refine dropout gpu memory (#17095) · 28d69d71

由 Zeng Jinle 提交于 4月 28, 2019

* refine_dropout_mem,test=develop

* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)

# This is the 2nd commit message:

Fleet unify distributed training (#16791)

* implement distributed transpiler with fleet
# This is the 3rd commit message:

ParallelDyGraph with GPU collective mode (#16827)

implement dygraph.parallel.DataParallel to hook reduce op.

# This is the 4th commit message:

Init mixed precision training interface (#16856)

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

# This is the 5th commit message:

fix reference_count_pass,test=develop (#17060)

test=develop
# This is the 6th commit message:

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

# This is the 7th commit message:

remove unnecessary prepare_data (#17080)

test=develop
# This is the 8th commit message:

fix interpolate cu. test=develop (#17101)

# This is the 9th commit message:

test=develop, double backward leaky_relu (#17067)

backward of backward: leaky_relu
# This is the 10th commit message:

fix fuse optimizer ops (#17102)

test=develop
# This is the 11th commit message:

truncated_gaussian_random supported in distributed training, test=develop (#17091)

# This is the 12th commit message:

 Detailed coordinate description for yolov3 loss (#17007)

* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop

# This is the 13th commit message:

fix test_weight_decay (#17109)

test=develop
# This is the 14th commit message:

Path flag (#17105)

* fix python/paddle/fluid/__init__.py detecting problems

28d69d71

Use CudnnWorkspaceHandle in exhaustive search (#17082) · b9494058

由 Huihuang Zheng 提交于 4月 28, 2019

1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn.
2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search.

test=develop

b9494058

27 4月, 2019 2 次提交
- T
  Path flag (#17105) · 2192e7bb
  由 tianshuo78520a 提交于 4月 27, 2019
```
* fix python/paddle/fluid/__init__.py detecting problems
```
  2192e7bb
- C
  fix test_weight_decay (#17109) · 9ccce576
  由 chengduo 提交于 4月 27, 2019
```
test=develop
```
  9ccce576
26 4月, 2019 6 次提交
- X
  Detailed coordinate description for yolov3 loss (#17007) · 7da7881c
  由 xiaoting 提交于 4月 26, 2019
```
* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop
```
  7da7881c
- T
  
  truncated_gaussian_random supported in distributed training, test=develop (#17091) · 7330cd63
  由 tangwei12 提交于 4月 26, 2019
  
  7330cd63
- C
  fix fuse optimizer ops (#17102) · 794a1958
  由 chengduo 提交于 4月 26, 2019
```
test=develop
```
  794a1958
- C
  test=develop, double backward leaky_relu (#17067) · 258e000b
  由 ceci3 提交于 4月 26, 2019
```
backward of backward: leaky_relu
```
  258e000b
- K
  
  fix interpolate cu. test=develop (#17101) · 10c487eb
  由 Kaipeng Deng 提交于 4月 26, 2019
  
  10c487eb
- T
  remove unnecessary prepare_data (#17080) · aca60e9a
  由 Tao Luo 提交于 4月 26, 2019
```
test=develop
```
  aca60e9a
25 4月, 2019 6 次提交
- W
  Speedup roi_perspective_transform op by caching the information of linear... · 55ce36e9
  由 whs 提交于 4月 25, 2019
```
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop
```
  55ce36e9
- Z
  fix reference_count_pass,test=develop (#17060) · 842ded14
  由 Zeng Jinle 提交于 4月 25, 2019
```
test=develop
```
  842ded14
- Y
  Init mixed precision training interface (#16856) · beda7825
  由 Yibing Liu 提交于 4月 25, 2019
```
* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop
```
  beda7825
- Y
  ParallelDyGraph with GPU collective mode (#16827) · 0b07eef1
  由 Yan Xu 提交于 4月 25, 2019
```
implement dygraph.parallel.DataParallel to hook reduce op.
```
  0b07eef1
- T
  Fleet unify distributed training (#16791) · 1a4a51db
  由 tangwei12 提交于 4月 25, 2019
```
* implement distributed transpiler with fleet
```
  1a4a51db
- T
  
  remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) · e707119a
  由 tangwei12 提交于 4月 25, 2019
  
  e707119a
24 4月, 2019 8 次提交
- Z
  Merge pull request #17029 from wzzju/add_graph_checkpoint · b8c166f6
  由 Zhen Wang 提交于 4月 24, 2019
```
add checkpoint functions for graph. test=develop
```
  b8c166f6
- G
  Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058) · 2deac4e4
  由 guomingz 提交于 4月 24, 2019
```
* resolve #17057

Fixed the bug that fuse_relu/fuse_residual option couldn't be passed to class TestConv2dInt8Op.

test=develop

* Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing.

test=develop
```
  2deac4e4
- T
  Merge pull request #17048 from luotao1/fix_runtime_cache_bug · d9cd9898
  由 Tao Luo 提交于 4月 24, 2019
```
fix runtime_context_cache bug when gpu model has an op runs only on cpu
```
  d9cd9898
- W
  specify the cuda arch name and bin to decrease the compile time for i… (#17020) · f5d6937f
  由 wopeizl 提交于 4月 24, 2019
```
1. specify the cuda arch name and bin to decrease the compile time for inference test=develop
2. simplify the script and add comments
3. remove the fluid process from cicheck
```
  f5d6937f
- X
  Merge pull request #17063 from PaddlePaddle/shanyi15-patch-1-1 · f7caf7d4
  由 XiaoguangHu 提交于 4月 24, 2019
```
update pip version in Readme to 1.4.1
```
  f7caf7d4
- C
  update pip version in Readme to 1.4.1 · fd6a1b5d
  由 Cheerego 提交于 4月 24, 2019
```
test=develop
```
  fd6a1b5d
- C
  use fast executor as default (#17044) · cc316816
  由 chengduo 提交于 4月 24, 2019
```
test=develop
```
  cc316816
- X
  Merge pull request #17042 from shanyi15/update_release_1.4 · 30f2f457
  由 XiaoguangHu 提交于 4月 24, 2019
```
update Readme and releasenote for 1.4.1
```
  30f2f457
23 4月, 2019 14 次提交
- C
  Add fuse momenutum ops (#16745) · a2be4b4d
  由 chengduo 提交于 4月 23, 2019
```
* Add fuse momenutum ops
```
  a2be4b4d
- G
  Merge pull request #17005 from wopeizl/fix_ncclwrapper_win1 · 03d469ad
  由 guru4elephant 提交于 4月 23, 2019
```
fix nccl wrapper on windows
```
  03d469ad
- T
  
  load persistables with selected rows, test=develop (#17047) · 13295d90
  由 tangwei12 提交于 4月 23, 2019
  
  13295d90
- L
  fix runtime_context_cache bug when gpu model has an op runs only on cpu · 490e7462
  由 luotao1 提交于 4月 23, 2019
```
test=develop
```
  490e7462
- Z
  Make conv cudnn workspace size configurable (#17036) · 0c335dcd
  由 Zeng Jinle 提交于 4月 23, 2019
```
* make_conv_cudnn_ws_size_configurable, test=develop

* change std::max to std::min
test=develop
```
  0c335dcd
- J
  Merge pull request #17017 from jerrywgz/fix_potential_hung · ea3504c7
  由 jerrywgz 提交于 4月 23, 2019
```
fix potential hung in generate proposals, test=develop
```
  ea3504c7
- K
  Merge pull request #17043 from tink2123/fix_split · 52de7fd8
  由 Kaipeng Deng 提交于 4月 23, 2019
```
fix split for dimension judgment
```
  52de7fd8
- T
  Merge pull request #16990 from baojun-nervana/ng_cmake · 620b0541
  由 Tao Luo 提交于 4月 23, 2019
```
update ngraph version
```
  620b0541
- T
  fix split · 5e216fcf
  由 tink2123 提交于 4月 23, 2019
```
test=develop
```
  5e216fcf
- S
  
  update_release_1.4 · b612c465
  由 shanyi15 提交于 4月 23, 2019
  
  b612c465
- C
  fix test_parallel_executor_seresnet random fail (#17030) · e296e0fe
  由 chengduo 提交于 4月 23, 2019
```
test=develop
```
  e296e0fe
- T
  Merge pull request #17031 from luotao1/reduce_test_time · b3a11943
  由 Tao Luo 提交于 4月 23, 2019
```
reduce unittest time by rename testcuda to has_cuda
```
  b3a11943
- Q
  Support backward of backward for Relu and add a new gradient checker by... · c1c2633a
  由 qingqing01 提交于 4月 23, 2019
```
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)

* Support backward of backward and a new gradient checker
* Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.

1. Add ReluDoubleGradMaker when register relu_grad.
2. Add a new gradient checker by comparing theoretical and numerical Jacobian.  Check double gradients by double_grad_check.
```
  c1c2633a
- L
  Merge pull request #17034 from seiriosPlus/fix/save_for_selected_rows · 63d9fe33
  由 lujun 提交于 4月 23, 2019
```
fix bug in save, test=develop
```
  63d9fe33
22 4月, 2019 1 次提交

Move gc test to each test of op (#16999) · f188b370

由 Zeng Jinle 提交于 4月 22, 2019

* move gc test to op_test
test=develop

* Revert "move gc test to op_test"

This reverts commit cf15da65.

* enable gc test in some ops
test=develop

f188b370

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功