提交 · 696cf62699ba1e1c98f61f7345ac7060010eb29a · PaddlePaddle / Paddle

05 5月, 2019 5 次提交
- Z
  
  test=develop · 696cf626
  由 Zhen Wang 提交于 5月 05, 2019
  
  696cf626
- Z
  
  Add MovingAverageAbsMaxScale operator which is only used for calculating the quantization scale. · ea72246f
  由 Zhen Wang 提交于 5月 05, 2019
  
  ea72246f
- J
  Enhance concat op to support empty input. (#17015) · a72907bb
  由 jerrywgz 提交于 5月 05, 2019
```
* enhance_concat, test=develop
```
  a72907bb
- W
  
  use two GPUs to run the exclusive test test=develop (#17187) · 83c4f772
  由 wopeizl 提交于 5月 05, 2019
  
  83c4f772
- C
  Remove unnecessary set_devices (#17158) · 3c6ab799
  由 chengduo 提交于 5月 05, 2019
```
* remove unnecessary set_devices
```
  3c6ab799
01 5月, 2019 1 次提交

remove async executor python api to fix document (#17174) · f938ccec

由 guru4elephant 提交于 5月 01, 2019

* remove async executor python api
test=develop

* remove test_async_executor.py
add executor train_from_dataset demo
test=develop

* fix import bug
test=develop

f938ccec

30 4月, 2019 7 次提交
- Z
  Fix mem leak when converting Tensor to numpy array (#17182) · 5dfe2ab9
  由 Zeng Jinle 提交于 4月 30, 2019
```
* fix mem leak when converting Tensor to numpy array
test=develop

* remove unused unittest,test=develop

* follow comments, test=develop

* fix dygraph bug,test=develop
```
  5dfe2ab9
- H
  Fix a typo in gpu_info.cc (#17175) · e4a53324
  由 Huihuang Zheng 提交于 4月 30, 2019
```
test=develop
```
  e4a53324
- T
  fix bn fuse vardesc and add model saver (#17143) · 79ed1c76
  由 tensor-tang 提交于 4月 30, 2019
```
* fix bn fuse vardesc and add model saver

test=develop

* unify save model in test helper

test=develop

* fix mkdir on windows

test=develop

* remove magic number use bn bias var desc

test=develop
```
  79ed1c76
- Z
  Rewrite inplace pass and fix gc bug (#17126) · 4e1bc6e8
  由 Zeng Jinle 提交于 4月 29, 2019
```
* fix op graph view
test=develop

* rewrite inplace pass and fix reference count pass bug
test=develop

* fix unittest failed
test=develop

* follow comments, test=develop
```
  4e1bc6e8
- Z
  
  fix reader default stream,test=develop (#17106) · 08773b60
  由 Zeng Jinle 提交于 4月 29, 2019
  
  08773b60
- X
  polish the label_smooth (#17138) · bc48453b
  由 xiaoting 提交于 4月 30, 2019
```
* polish the label_smooth

test=develop

* polish code

test=develop
```
  bc48453b
- L
  fix assertion failure issue when test_analyzer_bert uses ngraph (#17148) · bf4b21fa
  由 Leo Zhao 提交于 4月 30, 2019
```
resolve #17147
test=develop
```
  bf4b21fa
29 4月, 2019 3 次提交
- T
  cvm op feature (#17081) · deb510d4
  由 tangwei12 提交于 4月 29, 2019
```
cvm without LoD.
```
  deb510d4
- W
  1. move the API check into CPU process (#17110) · 3acb3635
  由 wopeizl 提交于 4月 29, 2019
```
* 1. move the API check into CPU process
2. adjust the check order
```
  3acb3635
- T
  
  Supplementary monitoring file reason explanation (#17131) · 92ce4452
  由 tianshuo78520a 提交于 4月 29, 2019
  
  92ce4452
28 4月, 2019 2 次提交

Refine dropout gpu memory (#17095) · 28d69d71

由 Zeng Jinle 提交于 4月 28, 2019

* refine_dropout_mem,test=develop

* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)

# This is the 2nd commit message:

Fleet unify distributed training (#16791)

* implement distributed transpiler with fleet
# This is the 3rd commit message:

ParallelDyGraph with GPU collective mode (#16827)

implement dygraph.parallel.DataParallel to hook reduce op.

# This is the 4th commit message:

Init mixed precision training interface (#16856)

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

# This is the 5th commit message:

fix reference_count_pass,test=develop (#17060)

test=develop
# This is the 6th commit message:

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

# This is the 7th commit message:

remove unnecessary prepare_data (#17080)

test=develop
# This is the 8th commit message:

fix interpolate cu. test=develop (#17101)

# This is the 9th commit message:

test=develop, double backward leaky_relu (#17067)

backward of backward: leaky_relu
# This is the 10th commit message:

fix fuse optimizer ops (#17102)

test=develop
# This is the 11th commit message:

truncated_gaussian_random supported in distributed training, test=develop (#17091)

# This is the 12th commit message:

 Detailed coordinate description for yolov3 loss (#17007)

* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop

# This is the 13th commit message:

fix test_weight_decay (#17109)

test=develop
# This is the 14th commit message:

Path flag (#17105)

* fix python/paddle/fluid/__init__.py detecting problems

28d69d71

Use CudnnWorkspaceHandle in exhaustive search (#17082) · b9494058

由 Huihuang Zheng 提交于 4月 28, 2019

1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn.
2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search.

test=develop

b9494058

27 4月, 2019 1 次提交
- T
  Path flag (#17105) · 2192e7bb
  由 tianshuo78520a 提交于 4月 27, 2019
```
* fix python/paddle/fluid/__init__.py detecting problems
```
  2192e7bb
26 4月, 2019 5 次提交
- X
  Detailed coordinate description for yolov3 loss (#17007) · 7da7881c
  由 xiaoting 提交于 4月 26, 2019
```
* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop
```
  7da7881c
- C
  fix fuse optimizer ops (#17102) · 794a1958
  由 chengduo 提交于 4月 26, 2019
```
test=develop
```
  794a1958
- C
  test=develop, double backward leaky_relu (#17067) · 258e000b
  由 ceci3 提交于 4月 26, 2019
```
backward of backward: leaky_relu
```
  258e000b
- K
  
  fix interpolate cu. test=develop (#17101) · 10c487eb
  由 Kaipeng Deng 提交于 4月 26, 2019
  
  10c487eb
- T
  remove unnecessary prepare_data (#17080) · aca60e9a
  由 Tao Luo 提交于 4月 26, 2019
```
test=develop
```
  aca60e9a
25 4月, 2019 4 次提交

Speedup roi_perspective_transform op by caching the information of linear... · 55ce36e9

由 whs 提交于 4月 25, 2019

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

55ce36e9

Z
fix reference_count_pass,test=develop (#17060) · 842ded14
由 Zeng Jinle 提交于 4月 25, 2019
```
test=develop
```
842ded14

Init mixed precision training interface (#16856) · beda7825

由 Yibing Liu 提交于 4月 25, 2019

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

beda7825

Y
ParallelDyGraph with GPU collective mode (#16827) · 0b07eef1
由 Yan Xu 提交于 4月 25, 2019
```
implement dygraph.parallel.DataParallel to hook reduce op.
```
0b07eef1

24 4月, 2019 2 次提交
- W
  specify the cuda arch name and bin to decrease the compile time for i… (#17020) · f5d6937f
  由 wopeizl 提交于 4月 24, 2019
```
1. specify the cuda arch name and bin to decrease the compile time for inference test=develop
2. simplify the script and add comments
3. remove the fluid process from cicheck
```
  f5d6937f
- C
  use fast executor as default (#17044) · cc316816
  由 chengduo 提交于 4月 24, 2019
```
test=develop
```
  cc316816
23 4月, 2019 5 次提交

C
Add fuse momenutum ops (#16745) · a2be4b4d
由 chengduo 提交于 4月 23, 2019
```
* Add fuse momenutum ops
```
a2be4b4d
T

load persistables with selected rows, test=develop (#17047) · 13295d90
由 tangwei12 提交于 4月 23, 2019

13295d90
L
fix runtime_context_cache bug when gpu model has an op runs only on cpu · 490e7462
由 luotao1 提交于 4月 23, 2019
```
test=develop
```
490e7462
Z
Make conv cudnn workspace size configurable (#17036) · 0c335dcd
由 Zeng Jinle 提交于 4月 23, 2019
```
* make_conv_cudnn_ws_size_configurable, test=develop

* change std::max to std::min
test=develop
```
0c335dcd

Support backward of backward for Relu and add a new gradient checker by... · c1c2633a

由 qingqing01 提交于 4月 23, 2019

Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)

* Support backward of backward and a new gradient checker
* Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.

1. Add ReluDoubleGradMaker when register relu_grad.
2. Add a new gradient checker by comparing theoretical and numerical Jacobian.  Check double gradients by double_grad_check.

c1c2633a

22 4月, 2019 5 次提交
- T
  
  fix bug in save, test=develop · 45136b1b
  由 tangwei12 提交于 4月 22, 2019
  
  45136b1b
- T
  Cmakelists fix (#17018) · 73a360b5
  由 tianshuo78520a 提交于 4月 22, 2019
```
* fix cmakelist detecting problems
```
  73a360b5
- L
  add doc for memory_optimize, test=develop (#17010) · a770ce06
  由 liuwei1031 提交于 4月 22, 2019
```
* add doc for memory_optimize, test=develop

* update doc, test=develop

* doc update, test=develop
```
  a770ce06
- W
  add parallel build script to ci … (#16901) · d9991dcc
  由 wopeizl 提交于 4月 22, 2019
```
* add parallel build script to ci test=develop
* 1. classify the test case as single card/two cards/multiple cards type
   2. run test case according to the run type
```
  d9991dcc
- J
  
  fix potential hung in generate proposals, test=develop · b2df6de8
  由 jerrywgz 提交于 4月 22, 2019
  
  b2df6de8

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功