- 14 6月, 2018 15 次提交
-
-
由 Tao Luo 提交于
fix compiler error in contrib/tape
-
由 Luo Tao 提交于
-
由 Luo Tao 提交于
-
由 tensor-tang 提交于
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
-
由 fengjiayi 提交于
fix a small error
-
由 Qiyang Min 提交于
* 1. Create buddy allocator in each places before NcclBcast the variables 2. Check the memory usage of ALL gpus rather than the first one * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing 2. NOTE the usage of NCCLGroupGuard * Remove the memory usage check of gpus * Fix code style
-
由 Qiao Longfei 提交于
outvar must be create in local scope for prefetch
-
由 Xin Pan 提交于
Remove cuptiFinalize.
-
由 Yan Chunwei 提交于
-
由 whs 提交于
* Add mean_iou op. * Add unitest for mean iou op. * Add optional collections of confusion matrix and mean_iou. * Fix cuda kernel. * Refine code. 1. Merge computing in GPU to two kernel. 2. Use wrong array and correct array instead of confusion matrix. * Add python api and fix cuda kernel. * Fix comments. * Small fix. * Small fix.
-
由 qiaolongfei 提交于
-
由 Xin Pan 提交于
In cupti samples, only cuptiFlush is used. I can't find any places calling cuptiFinalize and this API can error out as not_implemented in some cuda installation.
-
由 Yang Yang(Tony) 提交于
* fix #11416 * make sgd check tape has been backwarded_ * add error message
-
由 Yang Yang(Tony) 提交于
* change dynamic to tape * update readme link
-
由 Yang Yang(Tony) 提交于
-
- 13 6月, 2018 25 次提交
-
-
由 qiaolongfei 提交于
-
由 fengjiayi 提交于
-
由 Tao Luo 提交于
add apis for learning_rate_scheduler
-
由 weixing02 提交于
-
由 Qiao Longfei 提交于
Add merge_ids_op
-
由 weixing 提交于
fix doc generating error
-
由 weixing 提交于
fix errors
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 tensor-tang 提交于
MKLDNN layout: Support for convolution operator
-
由 qiaolongfei 提交于
-
由 Wu Yi 提交于
Fix nccl dist train bug
-
-
由 weixing02 提交于
-
由 chengduo 提交于
Change drop_last defalut value
-
由 Shan Yi 提交于
add url of cuda9.0_cudnn7_avx_mkl
-
由 weixing02 提交于
-
由 Luo Tao 提交于
-
-
由 yi.wu 提交于
-
由 Yancey 提交于
* expose h0 in dynamic_lstm * update by comment * update by comment * h0 to H0
-
由 weixing 提交于
add detection api
-
由 Qiao Longfei 提交于
fix build on mac
-
由 tensor-tang 提交于
MKLDNN layout: Support for batch norm operator
-
由 weixing02 提交于
-