1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 30 5月, 2019 1 次提交
  3. 19 5月, 2019 1 次提交
  4. 16 5月, 2019 1 次提交
  5. 12 4月, 2019 1 次提交
  6. 28 3月, 2019 1 次提交
  7. 27 3月, 2019 1 次提交
  8. 11 3月, 2019 1 次提交
  9. 06 3月, 2019 2 次提交
  10. 12 12月, 2018 1 次提交
  11. 26 11月, 2018 1 次提交
  12. 08 11月, 2018 1 次提交
  13. 21 6月, 2018 2 次提交
  14. 19 6月, 2018 1 次提交
  15. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  16. 25 4月, 2018 1 次提交
  17. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  18. 12 2月, 2018 1 次提交
  19. 10 2月, 2018 2 次提交
  20. 09 2月, 2018 1 次提交
  21. 09 1月, 2018 1 次提交
  22. 27 12月, 2017 4 次提交
  23. 26 12月, 2017 2 次提交
  24. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  25. 22 12月, 2017 1 次提交
  26. 21 12月, 2017 1 次提交
  27. 20 12月, 2017 1 次提交
  28. 19 12月, 2017 2 次提交
  29. 14 12月, 2017 1 次提交
  30. 11 12月, 2017 1 次提交
    • Y
      Fix gcc4.9 (#6442) · 95924686
      Yiqun Liu 提交于
      * Fix compiling error of gcc4.9.
      
      * Refine the check of cxx compiler flags in api/CMakeLists.txt.
      95924686
  31. 04 12月, 2017 1 次提交
    • Y
      While op forward for sentimental analysis (#6140) · d5e32794
      Yu Yang 提交于
      * Add DataFeeder
      
      A v2 API like data feeder for book demos.
      We can feed data directly from reader.
      
      * Fix CI
      
      * Add an unittest for while/rnn op forward
      
      * Add unittest for raw while op backward
      
      * Fix CI
      d5e32794
  32. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5