1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 29 8月, 2019 4 次提交
  3. 28 8月, 2019 1 次提交
  4. 27 8月, 2019 2 次提交
  5. 26 8月, 2019 4 次提交
  6. 21 8月, 2019 2 次提交
  7. 20 8月, 2019 1 次提交
  8. 16 8月, 2019 2 次提交
  9. 29 7月, 2019 2 次提交
  10. 08 7月, 2019 2 次提交
  11. 05 7月, 2019 3 次提交
  12. 02 7月, 2019 1 次提交
  13. 01 7月, 2019 3 次提交
  14. 30 6月, 2019 1 次提交
  15. 29 6月, 2019 2 次提交
  16. 28 6月, 2019 4 次提交
  17. 27 6月, 2019 1 次提交
  18. 26 6月, 2019 1 次提交
  19. 25 6月, 2019 3 次提交
    • J
      Cherry pick install check (#18326) · cf4533d0
      Jiabin Yang 提交于
      * test=release/1.5, add mutigpu install check
      
      * test=develop, refine code to use cuda_devices
      cf4533d0
    • H
      Sequence mask support tensor (#18249) (#18318) · c8d00cb2
      Hongyu Liu 提交于
      * sequnce mask support max length tensor input; test=develop
      
      * add rnn_impl.py; test=develop
      
      * add basic gru lstm unittest; test=develop
      
      * fix api spec; test=develop
      
      * fix sequence_mask op bug;
      test=develop
      test=document_preview
      
      * change +-*x to elmentwise_op; test=develop
      
      * add mkl flag; test=develop
      
      * fix rnn impl bug; test=develop
      
      * update api spec; test=develop
      
      * fix doc bug; test=develop
      
      * fix lstm bugs; test=develop
      c8d00cb2
    • G
      cherry-pick from #17935 (#18051) · 5cd4bbfe
      Guo Sheng 提交于
      test=release/1.5
      
      * Fix the GetExpectedKernelType of add_position_encoding_op.
      
      * Fix the doc of lstm_unit outputs in nn.py.
      5cd4bbfe