提交 · cb74dac3816ed68d32bb8005252314b238470bc4 · Crayon鑫 / Paddle

30 8月, 2019 1 次提交

[Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3

由 Huihuang Zheng 提交于 8月 30, 2019

* Support memory eager deletion on recurrent OP (#17710)

Test PaddingRNN on V100 GPU device.

Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                   
GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)

* Fix random test_recurrent_op failure (#18718)

The change includes 3 things:

1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.

2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.

3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.

cb74dac3

28 3月, 2019 1 次提交
- S
  revert some loop op revision · 4c8254e3
  由 sneaxiy 提交于 3月 27, 2019
```
test=develop
```
  4c8254e3
27 3月, 2019 1 次提交
- S
  fix grad desc maker · 63651c19
  由 sneaxiy 提交于 3月 27, 2019
```
test=develop
```
  63651c19
05 3月, 2019 1 次提交
- S
  enhance gc · 597dc65e
  由 sneaxiy 提交于 3月 05, 2019
```
test=develop
```
  597dc65e
14 12月, 2018 1 次提交
- Y
  
  Fea/fuse conv elementwise add fuse (#14669) · a985949b
  由 Yan Chunwei 提交于 12月 14, 2018
  
  a985949b
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致