- 17 12月, 2018 2 次提交
- 14 12月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 11 12月, 2018 2 次提交
- 10 12月, 2018 2 次提交
- 06 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 05 12月, 2018 3 次提交
-
-
由 tensor-tang 提交于
test=develop
-
由 sneaxiy 提交于
test=develop
-
由 liuhongyu 提交于
-
- 04 12月, 2018 3 次提交
-
-
由 Wu Yi 提交于
* wip multi process multi gpu dist training * workable for p2p * update test=develop * change back env name test=develop * fix alloc init * fix cpu build test=devlop * fix mac tests test=develop * refine code * refine test=develop
-
由 liuhongyu 提交于
-
由 ZongwuYang 提交于
Fix the bug that profiler cannot trace the nccl allreduce operator
-
- 03 12月, 2018 4 次提交
-
-
由 sneaxiy 提交于
test=develop
-
由 sneaxiy 提交于
-
由 Yihua Xu 提交于
-
由 Yibing Liu 提交于
-
- 29 11月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 27 11月, 2018 6 次提交
-
-
由 Clementine 提交于
-
由 Michal Gallus 提交于
test=develop
-
由 Jacek Czaja 提交于
-
由 liuhongyu 提交于
-
由 peizhilin 提交于
-
由 Jacek Czaja 提交于
test=develop - Added new header for MKLDNN reuse functionality - Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt - Buildable conv transpose mkldnn and conv mkldnn using conv template - Conv2d transpose roughlt implemented and buildable - Added modifications conv2d transpose MKLDNN unit tests - Fix to UT of conv2d transpose mkldnn op - Wrong type of MKLDNN primitive was chosen for conv2d transpose - HAcks for conv2d transpose - UT enalbed - Replaced copying loop with memcpy - Draft of passing lambda into AcquireMemory - Made reorder (IOHW->OIHW) to be called only once
-
- 26 11月, 2018 2 次提交
- 23 11月, 2018 3 次提交
-
-
由 chengduozh 提交于
test=develop
-
由 luotao1 提交于
-
由 peizhilin 提交于
fix code style
-
- 22 11月, 2018 4 次提交
-
-
由 chengduo 提交于
* refine cublase test=develop * code refine * refine cublas * add GEMME_EX * add enable_cublas_tensor_op_math doc and add cublasCall test=develop * fix CublasCall for cuda version test=develop * fix error test=develop * fix GEMM_EX to be compatible with gcc 4.8 test=develop * add GEMM_EX test=develop * to compatiable with gcc4.8 test=develop
-
由 peizhilin 提交于
test=develop
-
由 peizhilin 提交于
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 21 11月, 2018 2 次提交
- 20 11月, 2018 2 次提交
- 19 11月, 2018 1 次提交
-
-
由 qingqing01 提交于
* Convolution fusion operator. * Clean code test=develop
-
- 18 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-