- 25 1月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 16 12月, 2020 1 次提交
-
-
由 Y_Xuan 提交于
* 添加rocm平台支持代码 * 修改一些问题 * 修改一些歧义并添加备注 * 修改代码格式 * 解决冲突后的代码修改 * 修改operators.cmake * 修改格式 * 修正错误 * 统一接口 * 修改日期
-
- 03 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 26 10月, 2020 1 次提交
-
-
由 XiaoguangHu 提交于
-
- 03 7月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 19 6月, 2020 1 次提交
-
-
由 T8T9 提交于
-
- 18 4月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* update eigen, test=develop * remove patches, test=develop * add definition of -fabi-version, test=develop * add patch for TensorBlock.h, test=develop * test windows, test=develop * only update eigen for Linux, test=develop * add code comments, test=develop
-
- 11 4月, 2020 1 次提交
-
-
由 zhangchunle 提交于
-
- 09 1月, 2020 1 次提交
-
-
由 zhouwei25 提交于
tweak the interface of cache_third_party function - expose the SOURCE_DIR for each external library (#21899)
-
- 12 12月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 04 12月, 2019 1 次提交
-
-
由 silingtong123 提交于
* modify the repo address of eigen and warpctc * fix the eigen not work on windows * fix the eigen and warpctc can't recompile
-
- 30 11月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 25 11月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 11 11月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
test=develop
-
- 08 11月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 14 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 18 7月, 2019 1 次提交
-
-
由 Jiabin Yang 提交于
* test=develop, fix docker with paddle nccl problem * test=develop, downgrade gcc to 4.8 for latest-dev * test=develop, downgrade gcc to 4.8 for latest-dev * test=develop, modify cmake to renew all third_party * test=develop, invoke ci * test=develop, invoke ci * test=develop, complie python with wide-unicode * test=deveop, refine env settings * test=deveop, refine env settings
-
- 03 6月, 2019 1 次提交
-
-
由 wopeizl 提交于
* add support for cuda9 on windows test=develop * use different git address for cuda9 compatible on windows
-
- 20 2月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 27 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 26 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 23 11月, 2018 1 次提交
-
-
由 sabreshao 提交于
* HIP cmake. Enable whole archieve build for pybind library. Disable two warning. Rollback to C++11. Link RCCL to WA gpu kernel loading issue. Update eigen to fix build failure. Add more include directories. Fix O3 build failure. Update eigen. fix tensor_util_test segment fault issue add more macro check in hip.cmake. we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future. Fix rocRAND load. Update eigen to fix gru_unit_op and reduce_op. Add HIP support to testing. Update eigen to support int16 and int8 in arg min and arg max. * add rocprim as cub library used by nv implementation * Reduce build time in rocprim. * Add rocprim introduction, remove useless cmake code. * Remove useless flags and format cmake file.
-
- 16 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
test=develop
-
- 15 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 07 11月, 2018 2 次提交
- 05 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 08 10月, 2018 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 17 5月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 30 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "re-commit " * "picked up" * "fix ci" * "fix pdb hang up issue in cuda 9"
-
- 20 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
1. Add option WITH_AMD_GPU. 2. Add cmake/hip.cmake for HIP toolchain. 3. Some external module such as eigen may need HIP port. 4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake. 5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
-
- 16 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
1. Add option WITH_AMD_GPU. 2. Add cmake/hip.cmake for HIP toolchain. 3. Some external module such as eigen may need HIP port. 4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake. 5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
-
- 06 2月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 30 1月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 16 1月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 28 12月, 2017 1 次提交
-
-
由 Liu Yiqun 提交于
-
- 25 10月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
* init batch norm op * prepare input output * compute mean_out var_out save_mean save_var on CPU * active is test * use eigen to do computation * complete batch norm forward * set default momentum to 0.9 * add batch norm grad op in CPU * add tensor_format and NHWC support, add python test * add test training * add batch norm gradient test * improve comment, fix foward Python UnitTest * add gradient test * fix eigen warning * follow name style * fix a bug * change float to T * add simple forward test * test with different place * add backward test * refine python test * remove old python test code * code clean * follow code style * update comment
-
- 13 10月, 2017 1 次提交
-
-
由 helinwang 提交于
-
- 28 7月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 04 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-