1. 23 3月, 2018 2 次提交
  2. 21 3月, 2018 1 次提交
  3. 20 3月, 2018 2 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
    • X
      add math_function to softmax's dep list · 9eae086e
      Xi Chen 提交于
      9eae086e
  4. 17 3月, 2018 2 次提交
    • K
      initial commit · 39c676e2
      Kexin Zhao 提交于
      39c676e2
    • X
      Fix compilation for gcc5.4 · ab3543e3
      xuwei06 提交于
      The error is:
      
      paddle/fluid/operators/math/concat.cc:47:72: error: invalid initialization of non-const reference of type 'paddle::platform::CPUPlace&' from an rvalue of type 'paddle::platform::CPUPlace'
         auto& cpu_place = boost::get<platform::CPUPlace>(context.GetPlace());
      
      Should not use reference for cpu_place.
      ab3543e3
  5. 16 3月, 2018 4 次提交
  6. 15 3月, 2018 1 次提交
    • D
      [Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53
      dzhwinter 提交于
      * "add softmax cudnn functor support"
      
      * "add testing"
      
      * "refine cmakelist"
      
      * "sequence softmax forward speed up"
      
      * "add softmax grad"
      
      * "fix sequence softmax test"
      
      * "add double precision'
      
      * "fix softmax test"
      
      * "add softmax cudnn support"
      
      * "fix softmax cudnn test"
      
      * "add softmax to nn.py"
      
      * "fix compile bug"
      
      * "refine cmakelist"
      
      * "fix ci"
      
      * "fix based on comment"
      
      * "fix based on comments"
      
      * "fix ci"
      128adf53
  7. 12 3月, 2018 1 次提交
  8. 10 3月, 2018 3 次提交
  9. 09 3月, 2018 1 次提交
    • K
      Add float16 GEMM math function on GPU (#8695) · 90215b78
      kexinzhao 提交于
      * test cpu float16 data transform
      
      * add isnan etc
      
      * small fix
      
      * fix containsNAN test error
      
      * add data_type transform GPU test
      
      * add float16 GPU example
      
      * fix error
      
      * fix GPU test error
      
      * initial commit
      
      * fix error
      
      * small fix
      
      * add more gemm fp16 tests
      
      * fix error
      
      * add utility function
      90215b78
  10. 07 3月, 2018 3 次提交
  11. 05 3月, 2018 2 次提交
  12. 03 3月, 2018 1 次提交
  13. 02 3月, 2018 2 次提交
  14. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  15. 12 2月, 2018 1 次提交
  16. 10 2月, 2018 2 次提交