提交 · cb74dac3816ed68d32bb8005252314b238470bc4 · 机器未来 / Paddle

30 8月, 2019 1 次提交

[Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3

由 Huihuang Zheng 提交于 8月 30, 2019

* Support memory eager deletion on recurrent OP (#17710)

Test PaddingRNN on V100 GPU device.

Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                   
GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)

* Fix random test_recurrent_op failure (#18718)

The change includes 3 things:

1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.

2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.

3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.

cb74dac3

29 8月, 2019 4 次提交

C
[Cherry pick] Support feed single persistable variable to PE (#19435) · a7a4b72b
由 chengduo 提交于 8月 29, 2019
```
* update executor feed
```
a7a4b72b
J
test=release/1.5, fix multiple Layers parameter missing error in dygraph mode (#19491) · 5860cc47
由 Jiabin Yang 提交于 8月 29, 2019
```
This PR cherry-picked the fix of multiple Layers parameter missing error in dygraph mode，the original one is #18968
```
5860cc47

Distributed training cherry-pick for Release 1.5 (#19486) · 416922e2

由 tangwei12 提交于 8月 29, 2019

* fix bug in Class MultiSlotDataGenerator's function _gen_str, test=develop (#18222)
* fix some bug when merge sparse embedding parameters, test=develop (#18223)
* fix communicator with pyreader (#18350)
* delete AllocatorFacade destructor  (#18606)
* fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
* merge pr #18441

416922e2

Y
Fix arg do_model_average in param_attr (#19448) · 0edeb838
由 Yibing Liu 提交于 8月 29, 2019
```
test=release/1.5
```
0edeb838

28 8月, 2019 1 次提交
- C
  [Cherry pick] Remove unnecessary op when trainable is false (#19434) · 9048229b
  由 chengduo 提交于 8月 28, 2019
```
* fix optimizer bug
test=develop
```
  9048229b
27 8月, 2019 2 次提交

test=release/1.5, fix problem that get_attr method can't using default mode... · 5b3d33bd

由 Jiabin Yang 提交于 8月 27, 2019

test=release/1.5, fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328) (#19414)

* add default getItem

* test=develop, fix has_attr disabled error in Layer

* test=develop, fix GroupNorm and deepcf bug on attrs

5b3d33bd

L
Fix depthwise conv gpu kernel bug (#18582) (#19392) · 07e7ebeb
由 LielinJiang 提交于 8月 27, 2019
```
* fix depthwise conv gpu kernel bug, test=develop
* add more depthwise conv test, test=develop
```
07e7ebeb

26 8月, 2019 4 次提交

L
Make roi_perspective_transform op return mask and transform matrix,test=release/1.5 (#19391) · ec64f44f
由 LielinJiang 提交于 8月 26, 2019
```
* make_roi_perspective_transform_op_return_mask_and_matrix

* make_roi_perspective_transform_op_return_mask_and_matrix
```
ec64f44f
C
update parallel.py (#19371) · 1460648a
由 chengduo 提交于 8月 26, 2019
```
test=release/1.5
```
1460648a

CHERRY PICK FROM 18941, 18860, 19213：Fix Mask RCNN bug AND Paddle-TRT fp16 support (#19378) · 6fbd224e

由 Zhaolong Xing 提交于 8月 26, 2019

* CHERRY_PICK 18941, 18860: TRT fp16 support.

test=release/1.5

* CHERRY_PICK 19213: Fix BUG: Mask RCNN inference diff When using AnalysisPredictor.
    1. fix affine channel fuse pass.
    2. fix condition block op.
    3. fix merge lod tensor op bug.
    4. fix memory optim cause by reset lod op.

    test=release/1.5

6fbd224e

石
Fusion: seqpool_cvm_concat, test=release/1.5 (#19381) · fae79811
由石晓伟提交于 8月 26, 2019
```
* add fusion_seqpool_cvm_concat test=develop

* simplify pass, test=develop

* fix code style, test=develop
```
fae79811

21 8月, 2019 2 次提交

C
[Cherry Pick] Add error info during compile (#19300) · c737116c
由 chengduo 提交于 8月 21, 2019
```
* Add call stack info during runtime and compile time
test=develop
```
c737116c

[Cherry Pick] Bug fix and speedup dygraph multi-cards on v1.5 (#19298) · 71168dad

由 chengduo 提交于 8月 21, 2019

* add warning info for CPU_NUM
test=develop

* update dygraph parallel.py
test=develop

* prune the feed op in compiler
test=release/1.5

* remove compile from PE
test=develop

* test CUDAPinnedPlace in reader
test=release/1.5

71168dad

20 8月, 2019 1 次提交
- C
  [Cherry pick] Fix register op without gradient (#19272) · 305bd25b
  由 chengduo 提交于 8月 20, 2019
```
* fix REGISTER_OP_WITHOUT_GRADIENT
test=develop
```
  305bd25b
16 8月, 2019 2 次提交
- W
  merge from develop: add tensorrt support for win test=develop (#19172) · 1fd0ca82
  由 wopeizl 提交于 8月 16, 2019
```
* merge from develop: add tensorrt support for win test=develop
```
  1fd0ca82
- S
  cherry pick #18761 (#19199) · 5a86891f
  由 silingtong123 提交于 8月 16, 2019
```
* fix warpctc dynamic library not found issue on mac and windows platform
```
  5a86891f
29 7月, 2019 2 次提交
- C
  [Cherry pick] Fix backward error (#18835) · cc3ba765
  由 chengduo 提交于 7月 29, 2019
```
* fix backward bug
```
  cc3ba765
- Z
  
  fix affine_channel no_need buffer bug, test=release/1.5 (#18849) · 46c5345f
  由 Zeng Jinle 提交于 7月 29, 2019
  
  46c5345f
08 7月, 2019 2 次提交
- J
  test=release/1.5, cherry-pick hide not_support for dygraph (#18528) · 7c73a68f
  由 Jiabin Yang 提交于 7月 08, 2019
```
* test=release/1.5, cherry-pick hide not_support for dygraph

* test=release/1.5, cherry-pick hide not_support for dygraph
```
  7c73a68f
- Z
  cherry-pick Fix topk cannot handle 1D vector bug (#18466) · 856536b9
  由 zhaoyuchen2018 提交于 7月 08, 2019
```
Add path to handle 1D vector
```
  856536b9
05 7月, 2019 3 次提交
- G
  
  checkerrpick Make fuse_all_reduce_op_pass support mix_precision test=develop test=release (#18490) · 3232618a
  由 gongweibao 提交于 7月 05, 2019
  
  3232618a
- T
  cherry pick core remove pycpuinfo (#18505) · 24107006
  由 tensor-tang 提交于 7月 05, 2019
```
test=release/1.5
```
  24107006
- X
  
  unaligned error in some examples(#18486) · 1413336a
  由 xsrobin 提交于 7月 05, 2019
  
  1413336a
02 7月, 2019 1 次提交
- X
  Cherrypick add import to examples lack of it (#18440) · 61b91926
  由 xsrobin 提交于 7月 02, 2019
```
* test=develop

* test=develop
```
  61b91926
01 7月, 2019 3 次提交
- T
  cherry pick fix mac ci random fail (#18437) · 5a9513b4
  由 tensor-tang 提交于 7月 01, 2019
```
* fix mac ci random fail
* use platform instead

test=release/1.5
```
  5a9513b4
- X
  replace mnist dataset url, test=develop (#18431) · 4bbcc2d6
  由 xiaoting 提交于 7月 01, 2019
```
replace mnist dataset url
```
  4bbcc2d6
- H
  
  cherry-pick: update api format (#18413) (#18421) · 55538c56
  由 hutuxian 提交于 7月 01, 2019
  
  55538c56
30 6月, 2019 1 次提交
- T
  cherry pick fix py-cpuinfo mac random fail (#18416) · 49884564
  由 tensor-tang 提交于 6月 30, 2019
```
* fix py-cpuinfo mac random fail
* differentiate version on windows

test=release/1.5
```
  49884564
29 6月, 2019 2 次提交
- Y
  init custom black white list (#18377) (#18417) · 5b540a19
  由 Yibing Liu 提交于 6月 29, 2019
```
test=release/1.5
```
  5b540a19
- Y
  [cherry-pick] Update lamb optimizer (#18333) (#18380) · 880fb833
  由 Yibing Liu 提交于 6月 29, 2019
```
* Update lamb optimizer (#18333)

* Update lamb optimizer

* Regenerate api spec

test=release/1.5

* Give an experimental warning

test=release/1.5
```
  880fb833
28 6月, 2019 4 次提交
- Q
  Simplify multi_box_head API in detection.py and remove assign op. (#18310) (#18388) · 5b103c24
  由 qingqing01 提交于 6月 28, 2019
```
* Simplify multi_box_head API in detection.py and remove assign op.
```
  5b103c24
- L
  cherry pick, fix dygraph api doc, test=release/1.5 · 4ae7ea0a
  由 lujun 提交于 6月 28, 2019
```
BackwardStrategy
dygraph.nn
dygraph.checkpoint
```
  4ae7ea0a
- C
  add cuda_is_available (#18357) · 3cd78f6e
  由 chengduo 提交于 6月 28, 2019
```
*  add cuda_is_available
test=release/1.5
```
  3cd78f6e
- H
  
  test=develop, disable basic gru related ut (#18329) (#18387) · b5556f2d
  由 Hongyu Liu 提交于 6月 28, 2019
  
  b5556f2d
27 6月, 2019 1 次提交

Cherry pick Fix Bug-prone code of PE (#18355) · b09ba8a7

由 chengduo 提交于 6月 27, 2019

* update pe reduce config
test=release/1.5

*  drop the local_exe_scopes of the previous parallel_executor
test=release/1.5

b09ba8a7

26 6月, 2019 1 次提交
- W
  Fix checkpoint of Light-NAS (#18332) · e0cb6712
  由 whs 提交于 6月 26, 2019
```
Socket can't be pickled.

test=release/1.5
```
  e0cb6712
25 6月, 2019 3 次提交

Cherry pick install check (#18326) · cf4533d0

由 Jiabin Yang 提交于 6月 25, 2019

* test=release/1.5, add mutigpu install check

* test=develop, refine code to use cuda_devices

cf4533d0

Sequence mask support tensor (#18249) (#18318) · c8d00cb2

由 Hongyu Liu 提交于 6月 25, 2019

* sequnce mask support max length tensor input; test=develop

* add rnn_impl.py; test=develop

* add basic gru lstm unittest; test=develop

* fix api spec; test=develop

* fix sequence_mask op bug;
test=develop
test=document_preview

* change +-*x to elmentwise_op; test=develop

* add mkl flag; test=develop

* fix rnn impl bug; test=develop

* update api spec; test=develop

* fix doc bug; test=develop

* fix lstm bugs; test=develop

c8d00cb2

cherry-pick from #17935 (#18051) · 5cd4bbfe

由 Guo Sheng 提交于 6月 25, 2019

test=release/1.5

* Fix the GetExpectedKernelType of add_position_encoding_op.

* Fix the doc of lstm_unit outputs in nn.py.

5cd4bbfe

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致