- 27 11月, 2018 6 次提交
-
-
由 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
test=develop
-
由 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
test=develop
-
由 dengkaipeng 提交于
-
由 dengkaipeng 提交于
-
- 26 11月, 2018 10 次提交
-
-
由 tensor-tang 提交于
test=develop
-
由 qingqing01 提交于
* Transpose-Flatten-Concat fusion operator. * Add unit testing and fix bug.
-
由 Yan Chunwei 提交于
-
由 tangwei12 提交于
* fix mkdir conflict * fix load/save lookup tables test=develop * add lookup_table_utils * fix load optimize vars on pserver * delete lookup table utils * fix save and load lookup tables * fix load optimizer var * fix load optimizer var, test=develop * fix python 3 style, test=develop * move lookup_table_utils to contrib utils
-
由 qingqing01 提交于
* Remove the memory copy for feeding data in C++ inference API * Fix compling dependence * Fix compling in ONLY_CPU mode
-
由 peizhilin 提交于
-
由 peizhilin 提交于
-
由 Yiqun Liu 提交于
test=develop
-
由 superjomn 提交于
test=develop
-
由 dzhwinter 提交于
* test=develop remove code. * test=develop
-
- 25 11月, 2018 4 次提交
-
-
由 Yan Chunwei 提交于
-
由 Yan Chunwei 提交于
-
由 gongweibao 提交于
-
由 minqiyang 提交于
test=develop
-
- 24 11月, 2018 6 次提交
- 23 11月, 2018 12 次提交
-
-
由 qingqing01 提交于
-
由 chengduozh 提交于
test=develop
-
由 JiabinYang 提交于
-
由 luotao1 提交于
test=develop
-
由 luotao1 提交于
test=develop
-
由 luotao1 提交于
-
由 luotao1 提交于
-
由 peizhilin 提交于
fix code style
-
由 sabreshao 提交于
* HIP cmake. Enable whole archieve build for pybind library. Disable two warning. Rollback to C++11. Link RCCL to WA gpu kernel loading issue. Update eigen to fix build failure. Add more include directories. Fix O3 build failure. Update eigen. fix tensor_util_test segment fault issue add more macro check in hip.cmake. we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future. Fix rocRAND load. Update eigen to fix gru_unit_op and reduce_op. Add HIP support to testing. Update eigen to support int16 and int8 in arg min and arg max. * add rocprim as cub library used by nv implementation * Reduce build time in rocprim. * Add rocprim introduction, remove useless cmake code. * Remove useless flags and format cmake file.
-
由 qingqing01 提交于
* CUDA kernel for density_prior_box_op. * Support flatten to 2D.
-
由 tensor-tang 提交于
test=develop
-
由 peizhilin 提交于
-
- 22 11月, 2018 2 次提交
-
-
由 chengduo 提交于
* refine cublase test=develop * code refine * refine cublas * add GEMME_EX * add enable_cublas_tensor_op_math doc and add cublasCall test=develop * fix CublasCall for cuda version test=develop * fix error test=develop * fix GEMM_EX to be compatible with gcc 4.8 test=develop * add GEMM_EX test=develop * to compatiable with gcc4.8 test=develop
-
由 peizhilin 提交于
test=develop
-