- 28 6月, 2018 1 次提交
-
-
由 Haichao Zhang 提交于
* Add fill_zeros_array op. This op is used for zero-filling an LoDTensorArray. * merge fill_zeros_array_op with fill_zeros_like_op * add unit_test for fill_zeros_like for array
-
- 26 6月, 2018 6 次提交
-
-
由 yi.wu 提交于
-
由 yi.wu 提交于
-
由 yi.wu 提交于
-
由 yi.wu 提交于
-
由 chengduo 提交于
-
由 Tomasz Patejko 提交于
* elementwise_add with bcast: Brian's implementation by Brian added, with default bcasts * elementwise_add with bcast: GetExpectedKernelType added to elementwise_op * elementwise_add with bcast: use_mkldnn attribute added * elementwise_add with bcast: changes after review and some formatting * elementwise_add with bcast: changes after style check * elementwise_add with bcast: changes after style check cont. * elementwise_add with bcast: MKLDNN unittests added * elementwise_add with bcast: original unittests with use_mkldnn flag * elementwise_add with bcast: handling of MKLDNN format corrected * elementwise_add with bcast: setting MKLDNN format turned into lambda * elementwise_add with bcast: MKDNN format setting turned into separate function * elementwise_add with bcast: condition for choosing MKLDNN simplified * elementwise_add with bcast: fix for MKLDNN format set incorrectly in bcasts * elementwise_add with bcast: changes in unittests for broadcasts * elementwise_add with bcast: fixes in unittests regarding dimensions * elementwise_add with bcast: bring back correct format setting in mklml grad path * elementwise_add with bcast: fixed compilation error
-
- 25 6月, 2018 1 次提交
-
-
由 yi.wu 提交于
-
- 22 6月, 2018 4 次提交
-
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-
由 chengduo 提交于
-
由 Kexin Zhao 提交于
-
- 21 6月, 2018 6 次提交
-
-
由 chengduo 提交于
* Fix Parallel Exe(VarHandel's version) * Fix broadcast * enhance ParallelExecutor stable
-
由 fengjiayi 提交于
-
由 chengduoZH 提交于
-
由 Xin Pan 提交于
small thread-safety fix and doc improvements.
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 20 6月, 2018 3 次提交
-
-
由 Xin Pan 提交于
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-
- 19 6月, 2018 1 次提交
-
-
由 Qiyang Min 提交于
* Add sub_blocks of lr_decay_op to pserver_prog after distribute_transpiler * Remove unused logs and logics * 1. Add ops to new block (considering the nested block condition) 2. Follow the original hierarchy of blocks 3. Change the function's name and remove debug lines
-
- 16 6月, 2018 2 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
- 15 6月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
* add lod_tensor util and modify pybind * refind pybind LoDTensor API and modify LoDTensor and DataFeeder test * fix test error * fix detection map op test * fix reorder_lod_tensor test * fix seq_concat_op * fix chunk evel op test * fix target assign op * fix warp ctc op * address comments step 1: reverse reset_lod op * step 2: modify op test * add warning message * remove has_valid_lod * add back has_valid_lod * address comments * add exception catching trial
-
- 14 6月, 2018 3 次提交
-
-
由 tensor-tang 提交于
-
由 Qiyang Min 提交于
* 1. Create buddy allocator in each places before NcclBcast the variables 2. Check the memory usage of ALL gpus rather than the first one * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing 2. NOTE the usage of NCCLGroupGuard * Remove the memory usage check of gpus * Fix code style
-
由 Yang Yang(Tony) 提交于
-
- 13 6月, 2018 3 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
- 12 6月, 2018 4 次提交
-
-
由 Yancey1989 提交于
-
由 tensor-tang 提交于
-
由 Yancey1989 提交于
-
由 Wu Yi 提交于
* wip * use executor.complete to end trainer * fix build * fix build with distribute off * fix typo * fix cmake typo * fix build
-
- 11 6月, 2018 5 次提交
-
-
由 qiaolongfei 提交于
-
由 dzhwinter 提交于
* "add inplace attribute" * "register inplace attribute" * "change se-next model for memory-reuse" * "fix typo" * repick * fix merge conflict * "fix stupid error"
-
由 gongweibao 提交于
-
由 chengduoZH 提交于
replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
-
由 gongweibao 提交于
-