1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 29 8月, 2019 3 次提交
  3. 28 8月, 2019 1 次提交
  4. 27 8月, 2019 2 次提交
  5. 26 8月, 2019 3 次提交
  6. 21 8月, 2019 2 次提交
  7. 20 8月, 2019 1 次提交
  8. 29 7月, 2019 2 次提交
  9. 08 7月, 2019 2 次提交
  10. 05 7月, 2019 1 次提交
  11. 29 6月, 2019 1 次提交
  12. 28 6月, 2019 1 次提交
  13. 25 6月, 2019 2 次提交
    • H
      Sequence mask support tensor (#18249) (#18318) · c8d00cb2
      Hongyu Liu 提交于
      * sequnce mask support max length tensor input; test=develop
      
      * add rnn_impl.py; test=develop
      
      * add basic gru lstm unittest; test=develop
      
      * fix api spec; test=develop
      
      * fix sequence_mask op bug;
      test=develop
      test=document_preview
      
      * change +-*x to elmentwise_op; test=develop
      
      * add mkl flag; test=develop
      
      * fix rnn impl bug; test=develop
      
      * update api spec; test=develop
      
      * fix doc bug; test=develop
      
      * fix lstm bugs; test=develop
      c8d00cb2
    • J
      Revert "Cherry pick install check for multi gpu" (#18312) · f6432604
      Jiabin Yang 提交于
      * Revert "Cherry pick install check for multi gpu (#18245)"
      
      This reverts commit d0219002.
      
      * test=release/1.5, ci start
      f6432604
  14. 24 6月, 2019 1 次提交
  15. 21 6月, 2019 1 次提交
    • J
      Cherry pick install check for multi gpu (#18245) · d0219002
      Jiabin Yang 提交于
      * test=develop, add add_multi_gpu_install_check (#18157)
      
      * test=develop, add add_multi_gpu_install_check
      
      * test=develop, refine warning doc
      
      * test=develop, refine warning doc
      
      * test=develop, refine warning doc
      
      * test=develop, support multi cpu
      
      * test=release/1.5, cherry-picked from develop
      d0219002
  16. 20 6月, 2019 2 次提交
  17. 19 6月, 2019 4 次提交
    • T
      Release/1.5 cherry pick (#18139) · 598addf1
      tangwei12 提交于
      * fix save/load in fleet (#17675)
      
      * fix save/load in Fleet
      * add UT framework of Fleet (#18058)
      
      * add paddle cloud role maker for customized usage, note this is only for industrial users that have cloud environment pre-configuration (#18121)
      
      add paddle cloud role maker for specific cloud usage. This pr will simplifies user's configuration in distributed training.
      
      * assign role_maker before use (#18137)
      598addf1
    • F
      Cherry pick retinanet_target_assign_op(#17893), sigmoid_focal_loss_op(#17895)... · 3305045c
      FlyingQianMM 提交于
      Cherry pick retinanet_target_assign_op(#17893), sigmoid_focal_loss_op(#17895) and retinanet_detection_output_op(#17896) for supporting retinanet (#18141)
      
      * test=release/1.5
      Fix conflicts in test_layers.py when adding target assign operator for supporting retinanet. Cherry pick #17893
      
      * test=release/1.5
      Add sigmoid focal loss operator for supporting retinanet. Cherry pick #17895
      
      * test=release/1.5
      Add detection output operator for supporting retinanet. Cherry pick #17896
      
      * test=release/1.5
      fix wrong code style in test_layers.py when cherry pick retinanet_target_assign #17893
      
      * test=release/1.5
      Fix type error of std::pow in sigmoid_focal_loss. Cherry pick #17895
      3305045c
    • K
      [cherry-pick] Fix logging to release/1.5 (#18026) · 7c7afef7
      Kaipeng Deng 提交于
      * fix logging unable. test=develop
      
      * unset sys.stdout for stream handler. test=develop
      
      * fix newly add basicConfig. test=develop
      
      * fix import error. test=release/1.5
      7c7afef7
    • C
      [Cherry Pick] Not init nccl when rank is 1 (#18170) · 041bc72c
      chengduo 提交于
      * remove nccl dep when the number of GPU is 1
      test=develop
      
      * use multi card run syncBN
      test=release/1.5
      041bc72c
  18. 18 6月, 2019 2 次提交
  19. 15 6月, 2019 2 次提交
  20. 14 6月, 2019 1 次提交
  21. 13 6月, 2019 2 次提交
  22. 12 6月, 2019 1 次提交
    • H
      Cherry-pick: fix random CI failure. (#17976) · 21554bcb
      Huihuang Zheng 提交于
      * Cherry-pick fix random Python3 CI failure.
      
      In some tests, SWEs used "print('xxx').format('xxx')". The syntax
      is only supported in Python2, not python3. However, since those
      lines are related to data download, if the CI machines already have
      the data, it passes CI tests. That causes random failure.
      
      * Cherry-pick: disable CUDNN case of test_warpctc_op
      
      test=release
      21554bcb
  23. 10 6月, 2019 2 次提交