- 15 6月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 14 6月, 2018 8 次提交
-
-
由 chengduoZH 提交于
-
由 tensor-tang 提交于
-
由 fengjiayi 提交于
-
由 Qiyang Min 提交于
* 1. Create buddy allocator in each places before NcclBcast the variables 2. Check the memory usage of ALL gpus rather than the first one * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing 2. NOTE the usage of NCCLGroupGuard * Remove the memory usage check of gpus * Fix code style
-
由 whs 提交于
* Add mean_iou op. * Add unitest for mean iou op. * Add optional collections of confusion matrix and mean_iou. * Fix cuda kernel. * Refine code. 1. Merge computing in GPU to two kernel. 2. Use wrong array and correct array instead of confusion matrix. * Add python api and fix cuda kernel. * Fix comments. * Small fix. * Small fix.
-
由 qiaolongfei 提交于
-
由 Xin Pan 提交于
In cupti samples, only cuptiFlush is used. I can't find any places calling cuptiFinalize and this API can error out as not_implemented in some cuda installation.
-
由 Yang Yang(Tony) 提交于
-
- 13 6月, 2018 7 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 yi.wu 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
- 12 6月, 2018 7 次提交
-
-
由 qiaolongfei 提交于
-
由 Xin Pan 提交于
-
由 tensor-tang 提交于
-
由 Wu Yi 提交于
* wip * use executor.complete to end trainer * fix build * fix build with distribute off * fix typo * fix cmake typo * fix build
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 qingqing01 提交于
* Add normalization operator. 1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis. 2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`. 3. Use norm_op to refine the l2_normalize API. 4. Fix bug in test_normalization_wrapper.py.
-
- 11 6月, 2018 17 次提交
-
-
由 whs 提交于
* Add slice op. * Remove using from header file and fix doc. * Fix doc * Small fix.
-
由 qiaolongfei 提交于
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 gongweibao 提交于
-
由 Xin Pan 提交于
The status is updated in the Process() thread and can be checked in another HandleRequest() thread.
-
由 qiaolongfei 提交于
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 dzhwinter 提交于
* "add inplace attribute" * "register inplace attribute" * "change se-next model for memory-reuse" * "fix typo" * repick * fix merge conflict * "fix stupid error"
-
由 qiaolongfei 提交于
-
由 gongweibao 提交于
-
由 qiaolongfei 提交于
-
由 chengduoZH 提交于
replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
-
由 qiaolongfei 提交于
-
由 gongweibao 提交于
-
由 chengduoZH 提交于
-