- 15 6月, 2018 4 次提交
-
-
由 tensor-tang 提交于
-
由 Yan Chunwei 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
- 14 6月, 2018 15 次提交
-
-
由 gongweibao 提交于
-
由 Yan Chunwei 提交于
-
由 chengduoZH 提交于
-
由 yuyang18 提交于
-
由 yuyang18 提交于
-
由 Luo Tao 提交于
-
由 tensor-tang 提交于
-
由 Qiyang Min 提交于
* 1. Create buddy allocator in each places before NcclBcast the variables 2. Check the memory usage of ALL gpus rather than the first one * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing 2. NOTE the usage of NCCLGroupGuard * Remove the memory usage check of gpus * Fix code style
-
由 Yan Chunwei 提交于
-
由 whs 提交于
* Add mean_iou op. * Add unitest for mean iou op. * Add optional collections of confusion matrix and mean_iou. * Fix cuda kernel. * Refine code. 1. Merge computing in GPU to two kernel. 2. Use wrong array and correct array instead of confusion matrix. * Add python api and fix cuda kernel. * Fix comments. * Small fix. * Small fix.
-
由 qiaolongfei 提交于
-
由 Xin Pan 提交于
In cupti samples, only cuptiFlush is used. I can't find any places calling cuptiFinalize and this API can error out as not_implemented in some cuda installation.
-
由 Yang Yang(Tony) 提交于
* fix #11416 * make sgd check tape has been backwarded_ * add error message
-
由 Yang Yang(Tony) 提交于
* change dynamic to tape * update readme link
-
由 Yang Yang(Tony) 提交于
-
- 13 6月, 2018 10 次提交
-
-
由 qiaolongfei 提交于
-
由 yuyang18 提交于
-
由 yuyang18 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 yuyang18 提交于
-
由 yi.wu 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
- 12 6月, 2018 11 次提交
-
-
由 qiaolongfei 提交于
-
由 Xin Pan 提交于
-
由 yuyang18 提交于
* less_than * cumsum * multiplex * open_recordio_file
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 Wu Yi 提交于
* wip * use executor.complete to end trainer * fix build * fix build with distribute off * fix typo * fix cmake typo * fix build
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 qingqing01 提交于
* Add normalization operator. 1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis. 2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`. 3. Use norm_op to refine the l2_normalize API. 4. Fix bug in test_normalization_wrapper.py.
-