1. 12 12月, 2019 1 次提交
    • T
      memory leak for cpu (#21174) · 9ad940fd
      tangwei12 提交于
      * add fake init for the trainer, fix large memory hold in the trainer
      * do not merge recv vars from a remote endpoint, test=develop
      * add recv and save op, merge slice var in one op, save memory
      * remove hsigmoid with pull sparse, test=develop
      9ad940fd
  2. 19 9月, 2019 1 次提交
  3. 10 9月, 2019 1 次提交
    • W
      merge empty lod tensor, test=develop (#19228) · 25dcd74d
      wangguanzhong 提交于
      * merge_empty_lod_tensor, test=develop
      
      * fix multiclass_nms, test=develop
      
      * refine API.spec, test=develop
      
      * add unittest case for fetch, test=develop
      
      * add lod tensor test, test=develop
      
      * return index for multiclass_nms, test=develop
      
      * add api for multiclass_nms2
      
      * update API.spc, test=develop
      
      * refine api doc, test=develop
      
      * fix test_detection.py, test=develop
      
      * polish code, test=develop
      
      * add more unittest case, test=develop
      25dcd74d
  4. 19 8月, 2019 1 次提交
  5. 24 5月, 2019 1 次提交
  6. 20 4月, 2019 1 次提交
  7. 24 1月, 2019 1 次提交
    • Y
      Add the CUDA kernel for beam_search op (#15020) · 3008fa12
      Yiqun Liu 提交于
      * Refine the beam_search op and test.
      
      * A basic CUDA implementation of beam_search for small batch_size.
      
      * Implement CUDA kernel for beam_search_op.
      
      * Use multiple CUDA threads in the same block to select the top beam.
      
      * Update the python api of beam_search op.
      
      * Enable extend function in CPU kernel of beam_search op.
      
      * Unify the CUDA codes.
      test=develop
      
      * Unify the CPU kernel of beam_search op.
      
      * Ensure the seletced items of beam_search_op's CPU kernel sorted by scores.
      
      * Update the description of beam_search in API.spec.
      
      * Enable the use of CUDA kernel in beam_search op.
      
      * Exclude the beam_search's CUDA unittest when there is no CUDA gpu, and delete some debuging statements.
      test=develop
      
      * Follow comments.
      test=develop
      
      * Call the CPU kernel for beam_search op when batch_size > 4.
      test=develop
      
      * Remove the except of is_empty op in PrepareData.
      test=develop
      3008fa12
  8. 21 12月, 2018 1 次提交
  9. 12 12月, 2018 1 次提交
  10. 26 11月, 2018 1 次提交
  11. 24 11月, 2018 1 次提交
  12. 02 11月, 2018 1 次提交
  13. 10 9月, 2018 1 次提交
  14. 25 8月, 2018 1 次提交
  15. 24 8月, 2018 2 次提交
  16. 17 7月, 2018 1 次提交
  17. 02 7月, 2018 1 次提交
  18. 30 6月, 2018 1 次提交
  19. 29 6月, 2018 1 次提交
  20. 28 6月, 2018 1 次提交
  21. 22 6月, 2018 1 次提交
  22. 15 6月, 2018 1 次提交
    • K
      Modify Pybind LoDTensor API according to length-based LoD (#11106) · 417fcf4f
      Kexin Zhao 提交于
      * add lod_tensor util and modify pybind
      
      * refind pybind LoDTensor API and modify LoDTensor and DataFeeder test
      
      * fix test error
      
      * fix detection map op test
      
      * fix reorder_lod_tensor test
      
      * fix seq_concat_op
      
      * fix chunk evel op test
      
      * fix target assign op
      
      * fix warp ctc op
      
      * address comments step 1: reverse reset_lod op
      
      * step 2: modify op test
      
      * add warning message
      
      * remove has_valid_lod
      
      * add back has_valid_lod
      
      * address comments
      
      * add exception catching trial
      417fcf4f
  23. 11 4月, 2018 1 次提交
  24. 07 3月, 2018 1 次提交
  25. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  26. 12 2月, 2018 2 次提交
  27. 10 2月, 2018 2 次提交
  28. 31 1月, 2018 1 次提交
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  29. 23 1月, 2018 1 次提交
    • Q
      Nmt model (#7340) · e7d44a20
      Qiao Longfei 提交于
      neural machine translation model support beam search with while op
      e7d44a20
  30. 18 1月, 2018 3 次提交
  31. 17 1月, 2018 1 次提交
  32. 12 1月, 2018 2 次提交
  33. 11 1月, 2018 1 次提交
  34. 10 1月, 2018 1 次提交