- 27 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 26 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 23 11月, 2018 1 次提交
-
-
由 sabreshao 提交于
* HIP cmake. Enable whole archieve build for pybind library. Disable two warning. Rollback to C++11. Link RCCL to WA gpu kernel loading issue. Update eigen to fix build failure. Add more include directories. Fix O3 build failure. Update eigen. fix tensor_util_test segment fault issue add more macro check in hip.cmake. we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future. Fix rocRAND load. Update eigen to fix gru_unit_op and reduce_op. Add HIP support to testing. Update eigen to support int16 and int8 in arg min and arg max. * add rocprim as cub library used by nv implementation * Reduce build time in rocprim. * Add rocprim introduction, remove useless cmake code. * Remove useless flags and format cmake file.
-
- 16 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
test=develop
-
- 15 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 07 11月, 2018 2 次提交
- 05 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 08 10月, 2018 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 17 5月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 30 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "re-commit " * "picked up" * "fix ci" * "fix pdb hang up issue in cuda 9"
-
- 20 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
1. Add option WITH_AMD_GPU. 2. Add cmake/hip.cmake for HIP toolchain. 3. Some external module such as eigen may need HIP port. 4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake. 5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
-
- 16 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
1. Add option WITH_AMD_GPU. 2. Add cmake/hip.cmake for HIP toolchain. 3. Some external module such as eigen may need HIP port. 4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake. 5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
-
- 06 2月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 30 1月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 16 1月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 28 12月, 2017 1 次提交
-
-
由 Liu Yiqun 提交于
-
- 25 10月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
* init batch norm op * prepare input output * compute mean_out var_out save_mean save_var on CPU * active is test * use eigen to do computation * complete batch norm forward * set default momentum to 0.9 * add batch norm grad op in CPU * add tensor_format and NHWC support, add python test * add test training * add batch norm gradient test * improve comment, fix foward Python UnitTest * add gradient test * fix eigen warning * follow name style * fix a bug * change float to T * add simple forward test * test with different place * add backward test * refine python test * remove old python test code * code clean * follow code style * update comment
-
- 13 10月, 2017 1 次提交
-
-
由 helinwang 提交于
-
- 28 7月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 04 7月, 2017 3 次提交
- 29 6月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 27 6月, 2017 2 次提交