1. 25 1月, 2021 1 次提交
  2. 16 12月, 2020 1 次提交
    • Y
      添加rocm平台支持代码 (#29342) · 76738504
      Y_Xuan 提交于
      * 添加rocm平台支持代码
      
      * 修改一些问题
      
      * 修改一些歧义并添加备注
      
      * 修改代码格式
      
      * 解决冲突后的代码修改
      
      * 修改operators.cmake
      
      * 修改格式
      
      * 修正错误
      
      * 统一接口
      
      * 修改日期
      76738504
  3. 03 11月, 2020 1 次提交
  4. 26 10月, 2020 1 次提交
  5. 03 7月, 2020 1 次提交
  6. 19 6月, 2020 1 次提交
  7. 18 4月, 2020 1 次提交
    • Z
      Update eigen (#23203) · b89dd86f
      Zhang Ting 提交于
      * update eigen, test=develop
      
      * remove patches, test=develop
      
      * add definition of -fabi-version, test=develop
      
      * add patch for TensorBlock.h, test=develop
      
      * test windows, test=develop
      
      * only update eigen for Linux, test=develop
      
      * add code comments, test=develop
      b89dd86f
  8. 11 4月, 2020 1 次提交
  9. 09 1月, 2020 1 次提交
  10. 12 12月, 2019 1 次提交
  11. 04 12月, 2019 1 次提交
  12. 30 11月, 2019 1 次提交
  13. 25 11月, 2019 1 次提交
  14. 11 11月, 2019 1 次提交
  15. 08 11月, 2019 1 次提交
  16. 14 8月, 2019 1 次提交
  17. 18 7月, 2019 1 次提交
    • J
      Downgrade gcc to 4.8 (#18614) · 898237c1
      Jiabin Yang 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, modify cmake to renew all third_party
      
      * test=develop, invoke ci
      
      * test=develop, invoke ci
      
      * test=develop, complie python with wide-unicode
      
      * test=deveop, refine env settings
      
      * test=deveop, refine env settings
      898237c1
  18. 03 6月, 2019 1 次提交
  19. 20 2月, 2019 1 次提交
  20. 27 11月, 2018 1 次提交
  21. 26 11月, 2018 1 次提交
  22. 23 11月, 2018 1 次提交
    • S
      Fix cmake for AMDGPU platform (#13801) · 61c5f13f
      sabreshao 提交于
      * HIP cmake.
      Enable whole archieve build for pybind library.
      
      Disable two warning.
      
      Rollback to C++11.
      
      Link RCCL to WA gpu kernel loading issue.
      
      Update eigen to fix build failure.
      
      Add more include directories.
      
      Fix O3 build failure.
      
      Update eigen.
      
      fix tensor_util_test segment fault issue
      
      add more macro check in hip.cmake.
      we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
      
      Fix rocRAND load.
      
      Update eigen to fix gru_unit_op and reduce_op.
      
      Add HIP support to testing.
      
      Update eigen to support int16 and int8 in arg min and arg max.
      
      * add rocprim as cub library used by nv implementation
      
      * Reduce build time in rocprim.
      
      * Add rocprim introduction, remove useless cmake code.
      
      * Remove useless flags and format cmake file.
      61c5f13f
  23. 16 11月, 2018 1 次提交
  24. 15 11月, 2018 1 次提交
  25. 07 11月, 2018 2 次提交
  26. 05 11月, 2018 1 次提交
  27. 08 10月, 2018 1 次提交
  28. 17 5月, 2018 1 次提交
  29. 30 4月, 2018 1 次提交
  30. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  31. 16 3月, 2018 1 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
  32. 06 2月, 2018 1 次提交
  33. 30 1月, 2018 1 次提交
  34. 16 1月, 2018 1 次提交
  35. 28 12月, 2017 1 次提交
  36. 25 10月, 2017 1 次提交
    • Q
      CPU Batch Norm Op (#4964) · ee998a9c
      Qiao Longfei 提交于
      * init batch norm op
      
      * prepare input output
      
      * compute mean_out var_out save_mean save_var on CPU
      
      * active is test
      
      * use eigen to do computation
      
      * complete batch norm forward
      
      * set default momentum to 0.9
      
      * add batch norm grad op in CPU
      
      * add tensor_format and NHWC support, add python test
      
      * add test training
      
      * add batch norm gradient test
      
      * improve comment, fix foward Python UnitTest
      
      * add gradient test
      
      * fix eigen warning
      
      * follow name style
      
      * fix a bug
      
      * change float to T
      
      * add simple forward test
      
      * test with different place
      
      * add backward test
      
      * refine python test
      
      * remove old python test code
      
      * code clean
      
      * follow code style
      
      * update comment
      ee998a9c
  37. 13 10月, 2017 1 次提交
  38. 28 7月, 2017 1 次提交
  39. 04 7月, 2017 1 次提交