1. 22 3月, 2018 1 次提交
  2. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  3. 16 3月, 2018 2 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
    • Y
      Single GPU ParallelExecutor complete · 6f0dfd89
      Yu Yang 提交于
      6f0dfd89
  4. 13 3月, 2018 1 次提交
  5. 08 3月, 2018 2 次提交
    • L
      add MKL for fluid static and shared library · 5030681c
      Luo Tao 提交于
      5030681c
    • T
      compile and install the static library of fluid inference (#7827) · 6f50dee4
      Tao Luo 提交于
      * compile and install the static library of fluid inference
      
      * fix dynload_cuda not in CPU mode
      
      * update shared library and adjust the deploy of openblas
      
      * adjust the deploy of openblas
      
      * * auto add all fluid modules for static library
      * use libprotobuf.a instead of libprotobuf-lite.a for profiler
      
      * use set_property to set the global varible instead of ENV
      
      * add gpu depends of fluid modules, auto add inference_lib_dist depends
      
      * change the condition of openblas_lib, and fix a typo
      6f50dee4
  6. 06 3月, 2018 2 次提交
  7. 05 3月, 2018 1 次提交
  8. 01 3月, 2018 1 次提交
  9. 15 2月, 2018 1 次提交
  10. 14 2月, 2018 1 次提交
  11. 12 2月, 2018 2 次提交
  12. 07 2月, 2018 1 次提交
  13. 06 2月, 2018 1 次提交
  14. 05 2月, 2018 1 次提交
  15. 30 1月, 2018 1 次提交
  16. 27 1月, 2018 1 次提交
  17. 25 1月, 2018 1 次提交
  18. 22 1月, 2018 2 次提交
  19. 20 1月, 2018 1 次提交
  20. 19 1月, 2018 1 次提交
  21. 16 1月, 2018 2 次提交
  22. 15 1月, 2018 1 次提交
  23. 09 1月, 2018 1 次提交
    • Y
      Port WarpCTC Operator (#5107) · b5fda272
      Yiqun Liu 提交于
      * Add Seq2BatchFunctor, which will be used in WarpCTCOp.
      
      * Implement WrapCTCFunctor and WrapCTCKernel.
      
      * Add unittest of warpctc_op.
      
      * Modify the check_output inferface in python unittest framework to allow check a subset of outputs.
      
      * Use absolute offset lod in warpctc_op and related functors.
      
      * Refine the comments of warpctc_op.
      
      * The new python unittest supports checking a subset of the outputs, so revoke the previous change.
      
      * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.
      
      * Update to the newest codes.
      
      * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
      b5fda272
  24. 06 1月, 2018 1 次提交
  25. 05 1月, 2018 1 次提交
  26. 04 1月, 2018 1 次提交
  27. 28 12月, 2017 1 次提交
  28. 21 12月, 2017 1 次提交
  29. 12 12月, 2017 1 次提交
  30. 08 12月, 2017 1 次提交
  31. 06 12月, 2017 1 次提交
  32. 05 12月, 2017 1 次提交
  33. 01 12月, 2017 2 次提交