1. 03 6月, 2019 1 次提交
  2. 20 2月, 2019 1 次提交
  3. 27 11月, 2018 1 次提交
  4. 26 11月, 2018 1 次提交
  5. 23 11月, 2018 1 次提交
    • S
      Fix cmake for AMDGPU platform (#13801) · 61c5f13f
      sabreshao 提交于
      * HIP cmake.
      Enable whole archieve build for pybind library.
      
      Disable two warning.
      
      Rollback to C++11.
      
      Link RCCL to WA gpu kernel loading issue.
      
      Update eigen to fix build failure.
      
      Add more include directories.
      
      Fix O3 build failure.
      
      Update eigen.
      
      fix tensor_util_test segment fault issue
      
      add more macro check in hip.cmake.
      we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
      
      Fix rocRAND load.
      
      Update eigen to fix gru_unit_op and reduce_op.
      
      Add HIP support to testing.
      
      Update eigen to support int16 and int8 in arg min and arg max.
      
      * add rocprim as cub library used by nv implementation
      
      * Reduce build time in rocprim.
      
      * Add rocprim introduction, remove useless cmake code.
      
      * Remove useless flags and format cmake file.
      61c5f13f
  6. 16 11月, 2018 1 次提交
  7. 15 11月, 2018 1 次提交
  8. 07 11月, 2018 2 次提交
  9. 05 11月, 2018 1 次提交
  10. 08 10月, 2018 1 次提交
  11. 17 5月, 2018 1 次提交
  12. 30 4月, 2018 1 次提交
  13. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  14. 16 3月, 2018 1 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
  15. 06 2月, 2018 1 次提交
  16. 30 1月, 2018 1 次提交
  17. 16 1月, 2018 1 次提交
  18. 28 12月, 2017 1 次提交
  19. 25 10月, 2017 1 次提交
    • Q
      CPU Batch Norm Op (#4964) · ee998a9c
      Qiao Longfei 提交于
      * init batch norm op
      
      * prepare input output
      
      * compute mean_out var_out save_mean save_var on CPU
      
      * active is test
      
      * use eigen to do computation
      
      * complete batch norm forward
      
      * set default momentum to 0.9
      
      * add batch norm grad op in CPU
      
      * add tensor_format and NHWC support, add python test
      
      * add test training
      
      * add batch norm gradient test
      
      * improve comment, fix foward Python UnitTest
      
      * add gradient test
      
      * fix eigen warning
      
      * follow name style
      
      * fix a bug
      
      * change float to T
      
      * add simple forward test
      
      * test with different place
      
      * add backward test
      
      * refine python test
      
      * remove old python test code
      
      * code clean
      
      * follow code style
      
      * update comment
      ee998a9c
  20. 13 10月, 2017 1 次提交
  21. 28 7月, 2017 1 次提交
  22. 04 7月, 2017 3 次提交
  23. 29 6月, 2017 1 次提交
  24. 27 6月, 2017 2 次提交