1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 29 8月, 2019 1 次提交
    • T
      Distributed training cherry-pick for Release 1.5 (#19486) · 416922e2
      tangwei12 提交于
      * fix bug in Class MultiSlotDataGenerator's function _gen_str, test=develop (#18222)
      * fix some bug when merge sparse embedding parameters, test=develop (#18223)
      * fix communicator with pyreader (#18350)
      * delete AllocatorFacade destructor  (#18606)
      * fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
      * merge pr #18441
      416922e2
  3. 27 8月, 2019 1 次提交
  4. 26 8月, 2019 4 次提交
  5. 21 8月, 2019 1 次提交
  6. 20 8月, 2019 1 次提交
  7. 15 8月, 2019 1 次提交
  8. 29 7月, 2019 2 次提交
  9. 26 7月, 2019 1 次提交
  10. 25 7月, 2019 3 次提交
  11. 08 7月, 2019 2 次提交
  12. 05 7月, 2019 1 次提交
  13. 29 6月, 2019 1 次提交
  14. 28 6月, 2019 2 次提交
    • Q
      Simplify multi_box_head API in detection.py and remove assign op. (#18310) (#18388) · 5b103c24
      qingqing01 提交于
      * Simplify multi_box_head API in detection.py and remove assign op.
      5b103c24
    • Update the Anakin interfaces for content-dnn and MLU, test=release/1.5 (#18028) · 924e53b7
      石晓伟 提交于
      * Update the Anakin interfaces for content-dnn and MLU (#17890)
      
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      
      * modify the access level of anakin engine (#18015)
      
      test=develop
      
      * fix ci test cmake test=develop
      924e53b7
  15. 26 6月, 2019 1 次提交
  16. 25 6月, 2019 4 次提交
  17. 24 6月, 2019 2 次提交
  18. 20 6月, 2019 2 次提交
  19. 19 6月, 2019 2 次提交
    • T
      Release/1.5 cherry pick (#18139) · 598addf1
      tangwei12 提交于
      * fix save/load in fleet (#17675)
      
      * fix save/load in Fleet
      * add UT framework of Fleet (#18058)
      
      * add paddle cloud role maker for customized usage, note this is only for industrial users that have cloud environment pre-configuration (#18121)
      
      add paddle cloud role maker for specific cloud usage. This pr will simplifies user's configuration in distributed training.
      
      * assign role_maker before use (#18137)
      598addf1
    • F
      Cherry pick retinanet_target_assign_op(#17893), sigmoid_focal_loss_op(#17895)... · 3305045c
      FlyingQianMM 提交于
      Cherry pick retinanet_target_assign_op(#17893), sigmoid_focal_loss_op(#17895) and retinanet_detection_output_op(#17896) for supporting retinanet (#18141)
      
      * test=release/1.5
      Fix conflicts in test_layers.py when adding target assign operator for supporting retinanet. Cherry pick #17893
      
      * test=release/1.5
      Add sigmoid focal loss operator for supporting retinanet. Cherry pick #17895
      
      * test=release/1.5
      Add detection output operator for supporting retinanet. Cherry pick #17896
      
      * test=release/1.5
      fix wrong code style in test_layers.py when cherry pick retinanet_target_assign #17893
      
      * test=release/1.5
      Fix type error of std::pow in sigmoid_focal_loss. Cherry pick #17895
      3305045c
  20. 18 6月, 2019 2 次提交
  21. 15 6月, 2019 2 次提交
  22. 13 6月, 2019 3 次提交