1. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  2. 16 3月, 2018 1 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
  3. 13 3月, 2018 1 次提交
    • Q
      Repair nccl op test (#8575) · 7287630e
      QI JUN 提交于
      * fix nccl op unit test
      
      * fix build error
      
      * format code
      
      * refine nccl related unit test
      
      * fix build error
      
      * add setGPUData
      
      * clean up
      
      * follow comments
      
      * rm test_nccl.cu
      
      * follow comment
      
      * rm wait
      7287630e
  4. 08 3月, 2018 1 次提交
    • T
      compile and install the static library of fluid inference (#7827) · 6f50dee4
      Tao Luo 提交于
      * compile and install the static library of fluid inference
      
      * fix dynload_cuda not in CPU mode
      
      * update shared library and adjust the deploy of openblas
      
      * adjust the deploy of openblas
      
      * * auto add all fluid modules for static library
      * use libprotobuf.a instead of libprotobuf-lite.a for profiler
      
      * use set_property to set the global varible instead of ENV
      
      * add gpu depends of fluid modules, auto add inference_lib_dist depends
      
      * change the condition of openblas_lib, and fix a typo
      6f50dee4
  5. 06 3月, 2018 2 次提交
  6. 05 3月, 2018 1 次提交
  7. 01 3月, 2018 2 次提交
  8. 28 2月, 2018 1 次提交
  9. 26 2月, 2018 1 次提交
  10. 15 2月, 2018 1 次提交
  11. 14 2月, 2018 1 次提交
  12. 12 2月, 2018 3 次提交
  13. 08 2月, 2018 2 次提交
  14. 07 2月, 2018 1 次提交
  15. 06 2月, 2018 2 次提交
  16. 05 2月, 2018 1 次提交
  17. 01 2月, 2018 1 次提交
  18. 30 1月, 2018 2 次提交
  19. 27 1月, 2018 1 次提交
  20. 25 1月, 2018 1 次提交
  21. 23 1月, 2018 1 次提交
  22. 22 1月, 2018 2 次提交
  23. 21 1月, 2018 1 次提交
    • D
      "fix decode bug" (#7711) · e983cc90
      dzhwinter 提交于
      * "fix decode bug"
      
      * "follow commnet"
      
      * "fix error"
      
      * "fix hook bug"
      
      * fix based comment
      
      * fix copyright
      
      * fix based on comment
      e983cc90
  24. 20 1月, 2018 1 次提交
  25. 19 1月, 2018 1 次提交
  26. 16 1月, 2018 2 次提交
  27. 15 1月, 2018 2 次提交
  28. 09 1月, 2018 1 次提交
    • Y
      Port WarpCTC Operator (#5107) · b5fda272
      Yiqun Liu 提交于
      * Add Seq2BatchFunctor, which will be used in WarpCTCOp.
      
      * Implement WrapCTCFunctor and WrapCTCKernel.
      
      * Add unittest of warpctc_op.
      
      * Modify the check_output inferface in python unittest framework to allow check a subset of outputs.
      
      * Use absolute offset lod in warpctc_op and related functors.
      
      * Refine the comments of warpctc_op.
      
      * The new python unittest supports checking a subset of the outputs, so revoke the previous change.
      
      * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.
      
      * Update to the newest codes.
      
      * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
      b5fda272
  29. 06 1月, 2018 1 次提交
  30. 05 1月, 2018 1 次提交