提交 · 5dfe2ab9e883a9d2ea1f227730a26dc3d1a42cd2 · urhero / Paddle

30 4月, 2019 9 次提交
- Z
  Fix mem leak when converting Tensor to numpy array (#17182) · 5dfe2ab9
  由 Zeng Jinle 提交于 4月 30, 2019
```
* fix mem leak when converting Tensor to numpy array
test=develop

* remove unused unittest,test=develop

* follow comments, test=develop

* fix dygraph bug,test=develop
```
  5dfe2ab9
- H
  Fix a typo in gpu_info.cc (#17175) · e4a53324
  由 Huihuang Zheng 提交于 4月 30, 2019
```
test=develop
```
  e4a53324
- T
  fix bn fuse vardesc and add model saver (#17143) · 79ed1c76
  由 tensor-tang 提交于 4月 30, 2019
```
* fix bn fuse vardesc and add model saver

test=develop

* unify save model in test helper

test=develop

* fix mkdir on windows

test=develop

* remove magic number use bn bias var desc

test=develop
```
  79ed1c76
- Z
  Rewrite inplace pass and fix gc bug (#17126) · 4e1bc6e8
  由 Zeng Jinle 提交于 4月 29, 2019
```
* fix op graph view
test=develop

* rewrite inplace pass and fix reference count pass bug
test=develop

* fix unittest failed
test=develop

* follow comments, test=develop
```
  4e1bc6e8
- Z
  
  fix reader default stream,test=develop (#17106) · 08773b60
  由 Zeng Jinle 提交于 4月 29, 2019
  
  08773b60
- L
  fix python3 run_time_error in ops. test=develop (#17170) · aa5307ce
  由 Lfc1993 提交于 4月 30, 2019
```
fix python3 run_time_error in layers.ops caused by locals()
```
  aa5307ce
- G
  resolve #17159 (#17172) · e4a52e08
  由 guomingz 提交于 4月 30, 2019
```
Update the folder name generation mechanism for saving the quantized model and weights.
The folder name would be unique by adding the timestamp postfix.

test=develop
```
  e4a52e08
- X
  polish the label_smooth (#17138) · bc48453b
  由 xiaoting 提交于 4月 30, 2019
```
* polish the label_smooth

test=develop

* polish code

test=develop
```
  bc48453b
- L
  fix assertion failure issue when test_analyzer_bert uses ngraph (#17148) · bf4b21fa
  由 Leo Zhao 提交于 4月 30, 2019
```
resolve #17147
test=develop
```
  bf4b21fa
29 4月, 2019 5 次提交
- L
  fix run_time_error in uniform_random. test=develop (#17152) · 626922d3
  由 Lfc1993 提交于 4月 29, 2019
```
fix runtimeerror : dictionary changed size during iteration when calling uniform_random in python3+
```
  626922d3
- T
  cvm op feature (#17081) · deb510d4
  由 tangwei12 提交于 4月 29, 2019
```
cvm without LoD.
```
  deb510d4
- J
  
  test=develop fix bug: fix selected_indices in nms (#17140) · 554d3a71
  由 Jiancheng Li 提交于 4月 29, 2019
  
  554d3a71
- W
  1. move the API check into CPU process (#17110) · 3acb3635
  由 wopeizl 提交于 4月 29, 2019
```
* 1. move the API check into CPU process
2. adjust the check order
```
  3acb3635
- T
  
  Supplementary monitoring file reason explanation (#17131) · 92ce4452
  由 tianshuo78520a 提交于 4月 29, 2019
  
  92ce4452
28 4月, 2019 2 次提交

Refine dropout gpu memory (#17095) · 28d69d71

由 Zeng Jinle 提交于 4月 28, 2019

* refine_dropout_mem,test=develop

* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)

# This is the 2nd commit message:

Fleet unify distributed training (#16791)

* implement distributed transpiler with fleet
# This is the 3rd commit message:

ParallelDyGraph with GPU collective mode (#16827)

implement dygraph.parallel.DataParallel to hook reduce op.

# This is the 4th commit message:

Init mixed precision training interface (#16856)

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

# This is the 5th commit message:

fix reference_count_pass,test=develop (#17060)

test=develop
# This is the 6th commit message:

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

# This is the 7th commit message:

remove unnecessary prepare_data (#17080)

test=develop
# This is the 8th commit message:

fix interpolate cu. test=develop (#17101)

# This is the 9th commit message:

test=develop, double backward leaky_relu (#17067)

backward of backward: leaky_relu
# This is the 10th commit message:

fix fuse optimizer ops (#17102)

test=develop
# This is the 11th commit message:

truncated_gaussian_random supported in distributed training, test=develop (#17091)

# This is the 12th commit message:

 Detailed coordinate description for yolov3 loss (#17007)

* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop

# This is the 13th commit message:

fix test_weight_decay (#17109)

test=develop
# This is the 14th commit message:

Path flag (#17105)

* fix python/paddle/fluid/__init__.py detecting problems

28d69d71

Use CudnnWorkspaceHandle in exhaustive search (#17082) · b9494058

由 Huihuang Zheng 提交于 4月 28, 2019

1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn.
2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search.

test=develop

b9494058

27 4月, 2019 2 次提交
- T
  Path flag (#17105) · 2192e7bb
  由 tianshuo78520a 提交于 4月 27, 2019
```
* fix python/paddle/fluid/__init__.py detecting problems
```
  2192e7bb
- C
  fix test_weight_decay (#17109) · 9ccce576
  由 chengduo 提交于 4月 27, 2019
```
test=develop
```
  9ccce576
26 4月, 2019 6 次提交
- X
  Detailed coordinate description for yolov3 loss (#17007) · 7da7881c
  由 xiaoting 提交于 4月 26, 2019
```
* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop
```
  7da7881c
- T
  
  truncated_gaussian_random supported in distributed training, test=develop (#17091) · 7330cd63
  由 tangwei12 提交于 4月 26, 2019
  
  7330cd63
- C
  fix fuse optimizer ops (#17102) · 794a1958
  由 chengduo 提交于 4月 26, 2019
```
test=develop
```
  794a1958
- C
  test=develop, double backward leaky_relu (#17067) · 258e000b
  由 ceci3 提交于 4月 26, 2019
```
backward of backward: leaky_relu
```
  258e000b
- K
  
  fix interpolate cu. test=develop (#17101) · 10c487eb
  由 Kaipeng Deng 提交于 4月 26, 2019
  
  10c487eb
- T
  remove unnecessary prepare_data (#17080) · aca60e9a
  由 Tao Luo 提交于 4月 26, 2019
```
test=develop
```
  aca60e9a
25 4月, 2019 6 次提交
- W
  Speedup roi_perspective_transform op by caching the information of linear... · 55ce36e9
  由 whs 提交于 4月 25, 2019
```
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop
```
  55ce36e9
- Z
  fix reference_count_pass,test=develop (#17060) · 842ded14
  由 Zeng Jinle 提交于 4月 25, 2019
```
test=develop
```
  842ded14
- Y
  Init mixed precision training interface (#16856) · beda7825
  由 Yibing Liu 提交于 4月 25, 2019
```
* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop
```
  beda7825
- Y
  ParallelDyGraph with GPU collective mode (#16827) · 0b07eef1
  由 Yan Xu 提交于 4月 25, 2019
```
implement dygraph.parallel.DataParallel to hook reduce op.
```
  0b07eef1
- T
  Fleet unify distributed training (#16791) · 1a4a51db
  由 tangwei12 提交于 4月 25, 2019
```
* implement distributed transpiler with fleet
```
  1a4a51db
- T
  
  remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) · e707119a
  由 tangwei12 提交于 4月 25, 2019
  
  e707119a
24 4月, 2019 8 次提交
- Z
  Merge pull request #17029 from wzzju/add_graph_checkpoint · b8c166f6
  由 Zhen Wang 提交于 4月 24, 2019
```
add checkpoint functions for graph. test=develop
```
  b8c166f6
- G
  Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058) · 2deac4e4
  由 guomingz 提交于 4月 24, 2019
```
* resolve #17057

Fixed the bug that fuse_relu/fuse_residual option couldn't be passed to class TestConv2dInt8Op.

test=develop

* Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing.

test=develop
```
  2deac4e4
- T
  Merge pull request #17048 from luotao1/fix_runtime_cache_bug · d9cd9898
  由 Tao Luo 提交于 4月 24, 2019
```
fix runtime_context_cache bug when gpu model has an op runs only on cpu
```
  d9cd9898
- W
  specify the cuda arch name and bin to decrease the compile time for i… (#17020) · f5d6937f
  由 wopeizl 提交于 4月 24, 2019
```
1. specify the cuda arch name and bin to decrease the compile time for inference test=develop
2. simplify the script and add comments
3. remove the fluid process from cicheck
```
  f5d6937f
- X
  Merge pull request #17063 from PaddlePaddle/shanyi15-patch-1-1 · f7caf7d4
  由 XiaoguangHu 提交于 4月 24, 2019
```
update pip version in Readme to 1.4.1
```
  f7caf7d4
- C
  update pip version in Readme to 1.4.1 · fd6a1b5d
  由 Cheerego 提交于 4月 24, 2019
```
test=develop
```
  fd6a1b5d
- C
  use fast executor as default (#17044) · cc316816
  由 chengduo 提交于 4月 24, 2019
```
test=develop
```
  cc316816
- X
  Merge pull request #17042 from shanyi15/update_release_1.4 · 30f2f457
  由 XiaoguangHu 提交于 4月 24, 2019
```
update Readme and releasenote for 1.4.1
```
  30f2f457
23 4月, 2019 2 次提交
- C
  Add fuse momenutum ops (#16745) · a2be4b4d
  由 chengduo 提交于 4月 23, 2019
```
* Add fuse momenutum ops
```
  a2be4b4d
- G
  Merge pull request #17005 from wopeizl/fix_ncclwrapper_win1 · 03d469ad
  由 guru4elephant 提交于 4月 23, 2019
```
fix nccl wrapper on windows
```
  03d469ad

urhero / Paddle 与 Fork 源项目一致

urhero / Paddle
与 Fork 源项目一致