1. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  2. 16 3月, 2018 3 次提交
  3. 15 3月, 2018 1 次提交
    • D
      [Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53
      dzhwinter 提交于
      * "add softmax cudnn functor support"
      
      * "add testing"
      
      * "refine cmakelist"
      
      * "sequence softmax forward speed up"
      
      * "add softmax grad"
      
      * "fix sequence softmax test"
      
      * "add double precision'
      
      * "fix softmax test"
      
      * "add softmax cudnn support"
      
      * "fix softmax cudnn test"
      
      * "add softmax to nn.py"
      
      * "fix compile bug"
      
      * "refine cmakelist"
      
      * "fix ci"
      
      * "fix based on comment"
      
      * "fix based on comments"
      
      * "fix ci"
      128adf53
  4. 12 3月, 2018 1 次提交
  5. 10 3月, 2018 3 次提交
  6. 09 3月, 2018 1 次提交
    • K
      Add float16 GEMM math function on GPU (#8695) · 90215b78
      kexinzhao 提交于
      * test cpu float16 data transform
      
      * add isnan etc
      
      * small fix
      
      * fix containsNAN test error
      
      * add data_type transform GPU test
      
      * add float16 GPU example
      
      * fix error
      
      * fix GPU test error
      
      * initial commit
      
      * fix error
      
      * small fix
      
      * add more gemm fp16 tests
      
      * fix error
      
      * add utility function
      90215b78
  7. 07 3月, 2018 3 次提交
  8. 05 3月, 2018 2 次提交
  9. 03 3月, 2018 1 次提交
  10. 02 3月, 2018 2 次提交
  11. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  12. 12 2月, 2018 1 次提交
  13. 10 2月, 2018 2 次提交