- 19 6月, 2018 4 次提交
-
-
由 mozga-intel 提交于
-
由 Qiyang Min 提交于
* Add sub_blocks of lr_decay_op to pserver_prog after distribute_transpiler * Remove unused logs and logics * 1. Add ops to new block (considering the nested block condition) 2. Follow the original hierarchy of blocks 3. Change the function's name and remove debug lines
-
由 tensor-tang 提交于
-
由 gongweibao 提交于
-
- 18 6月, 2018 2 次提交
-
-
由 mozga-intel 提交于
-
由 Yan Chunwei 提交于
-
- 17 6月, 2018 4 次提交
-
-
由 yuyang18 提交于
-
由 gongweibao 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
- 16 6月, 2018 2 次提交
-
-
由 qiaolongfei 提交于
-
由 tensor-tang 提交于
-
- 15 6月, 2018 17 次提交
-
-
由 Kexin Zhao 提交于
* add lod_tensor util and modify pybind * refind pybind LoDTensor API and modify LoDTensor and DataFeeder test * fix test error * fix detection map op test * fix reorder_lod_tensor test * fix seq_concat_op * fix chunk evel op test * fix target assign op * fix warp ctc op * address comments step 1: reverse reset_lod op * step 2: modify op test * add warning message * remove has_valid_lod * add back has_valid_lod * address comments * add exception catching trial
-
由 gongweibao 提交于
-
由 Yibing Liu 提交于
-
由 qiaolongfei 提交于
-
由 qingqing01 提交于
* Update some doc about layers' API. * Fix format. * Fix example bug in random_data_generator. * Fix example bug in dropout. * Follow comments and some small fix for some examples.
-
由 qiaolongfei 提交于
-
由 chengduoZH 提交于
-
由 Yibing Liu 提交于
-
由 dzhwinter 提交于
-
由 Yibing Liu 提交于
-
由 Luo Tao 提交于
-
由 Yan Chunwei 提交于
-
由 dzhwinter 提交于
-
由 yi.wu 提交于
-
由 fengjiayi 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
- 14 6月, 2018 11 次提交
-
-
由 dzhwinter 提交于
-
由 chengduoZH 提交于
-
由 yi.wu 提交于
-
由 yuyang18 提交于
-
由 yuyang18 提交于
-
由 tensor-tang 提交于
-
由 fengjiayi 提交于
-
由 Qiyang Min 提交于
* 1. Create buddy allocator in each places before NcclBcast the variables 2. Check the memory usage of ALL gpus rather than the first one * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing 2. NOTE the usage of NCCLGroupGuard * Remove the memory usage check of gpus * Fix code style
-
由 whs 提交于
* Add mean_iou op. * Add unitest for mean iou op. * Add optional collections of confusion matrix and mean_iou. * Fix cuda kernel. * Refine code. 1. Merge computing in GPU to two kernel. 2. Use wrong array and correct array instead of confusion matrix. * Add python api and fix cuda kernel. * Fix comments. * Small fix. * Small fix.
-
由 qiaolongfei 提交于
-
由 Xin Pan 提交于
In cupti samples, only cuptiFlush is used. I can't find any places calling cuptiFinalize and this API can error out as not_implemented in some cuda installation.
-