1. 03 7月, 2020 1 次提交
  2. 19 6月, 2020 1 次提交
  3. 18 4月, 2020 1 次提交
    • Z
      Update eigen (#23203) · b89dd86f
      Zhang Ting 提交于
      * update eigen, test=develop
      
      * remove patches, test=develop
      
      * add definition of -fabi-version, test=develop
      
      * add patch for TensorBlock.h, test=develop
      
      * test windows, test=develop
      
      * only update eigen for Linux, test=develop
      
      * add code comments, test=develop
      b89dd86f
  4. 11 4月, 2020 1 次提交
  5. 09 1月, 2020 1 次提交
  6. 12 12月, 2019 1 次提交
  7. 04 12月, 2019 1 次提交
  8. 30 11月, 2019 1 次提交
  9. 25 11月, 2019 1 次提交
  10. 11 11月, 2019 1 次提交
  11. 08 11月, 2019 1 次提交
  12. 14 8月, 2019 1 次提交
  13. 18 7月, 2019 1 次提交
    • J
      Downgrade gcc to 4.8 (#18614) · 898237c1
      Jiabin Yang 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, modify cmake to renew all third_party
      
      * test=develop, invoke ci
      
      * test=develop, invoke ci
      
      * test=develop, complie python with wide-unicode
      
      * test=deveop, refine env settings
      
      * test=deveop, refine env settings
      898237c1
  14. 03 6月, 2019 1 次提交
  15. 20 2月, 2019 1 次提交
  16. 27 11月, 2018 1 次提交
  17. 26 11月, 2018 1 次提交
  18. 23 11月, 2018 1 次提交
    • S
      Fix cmake for AMDGPU platform (#13801) · 61c5f13f
      sabreshao 提交于
      * HIP cmake.
      Enable whole archieve build for pybind library.
      
      Disable two warning.
      
      Rollback to C++11.
      
      Link RCCL to WA gpu kernel loading issue.
      
      Update eigen to fix build failure.
      
      Add more include directories.
      
      Fix O3 build failure.
      
      Update eigen.
      
      fix tensor_util_test segment fault issue
      
      add more macro check in hip.cmake.
      we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
      
      Fix rocRAND load.
      
      Update eigen to fix gru_unit_op and reduce_op.
      
      Add HIP support to testing.
      
      Update eigen to support int16 and int8 in arg min and arg max.
      
      * add rocprim as cub library used by nv implementation
      
      * Reduce build time in rocprim.
      
      * Add rocprim introduction, remove useless cmake code.
      
      * Remove useless flags and format cmake file.
      61c5f13f
  19. 16 11月, 2018 1 次提交
  20. 15 11月, 2018 1 次提交
  21. 07 11月, 2018 2 次提交
  22. 05 11月, 2018 1 次提交
  23. 08 10月, 2018 1 次提交
  24. 17 5月, 2018 1 次提交
  25. 30 4月, 2018 1 次提交
  26. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  27. 16 3月, 2018 1 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
  28. 06 2月, 2018 1 次提交
  29. 30 1月, 2018 1 次提交
  30. 16 1月, 2018 1 次提交
  31. 28 12月, 2017 1 次提交
  32. 25 10月, 2017 1 次提交
    • Q
      CPU Batch Norm Op (#4964) · ee998a9c
      Qiao Longfei 提交于
      * init batch norm op
      
      * prepare input output
      
      * compute mean_out var_out save_mean save_var on CPU
      
      * active is test
      
      * use eigen to do computation
      
      * complete batch norm forward
      
      * set default momentum to 0.9
      
      * add batch norm grad op in CPU
      
      * add tensor_format and NHWC support, add python test
      
      * add test training
      
      * add batch norm gradient test
      
      * improve comment, fix foward Python UnitTest
      
      * add gradient test
      
      * fix eigen warning
      
      * follow name style
      
      * fix a bug
      
      * change float to T
      
      * add simple forward test
      
      * test with different place
      
      * add backward test
      
      * refine python test
      
      * remove old python test code
      
      * code clean
      
      * follow code style
      
      * update comment
      ee998a9c
  33. 13 10月, 2017 1 次提交
  34. 28 7月, 2017 1 次提交
  35. 04 7月, 2017 3 次提交
  36. 29 6月, 2017 1 次提交
  37. 27 6月, 2017 1 次提交