- 27 7月, 2021 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 31 3月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* fix split core * format
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 12 6月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979) test=develop
-
- 29 5月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415) * Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2. test=develop * Refine codes. test=develop * Correct the condition. test=develop * Move the define of tmp_data outside the if statement. * Print the cudnn minor version. test=develop * Fix the case when in_num/o_num is 1 in concat/split op. test=develop * Remove const_cast. test=develop
-
- 25 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine tensor test=develop * refine tensor test=develop * fix device_context log test=develop
-
- 21 12月, 2018 2 次提交
-
-
由 chengduo 提交于
* Add Temporal Allocator * add Temporay Allocator to DeviceContext test=develop * code refine test=develop * fix mean_iou test=develop * Add DeviceTemporaryAllocator test=develop * fix conv_op bug test=develop * small fix test=develop * code refine test=develop * log refine test=develop * fix unit test test=develop * move double check * refine concat_and_split test=develop * add limit_of_temporary_allocation test=develop * fix name test=develop
-
由 minqiyang 提交于
test=develop
-
- 20 12月, 2018 1 次提交
-
-
由 minqiyang 提交于
-
- 23 10月, 2018 1 次提交
-
-
由 chengduo 提交于
* speedup split_op test=develop * speedup split_op test=develop * rename ConcatGrad to Split * refine concat and split test=develop * fix compile error
-
- 18 9月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 17 9月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 27 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 22 8月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 19 6月, 2018 2 次提交
-
-
由 qiaolongfei 提交于
-
由 qingqing01 提交于
* Make the CUDA kernel of concat correct and fix unit tests.
-
- 17 6月, 2018 1 次提交
-
-
由 qiaolongfei 提交于
-
- 02 5月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
* Fix CPPLint issues in lstm_cpu_kernel.h * Fix CPPLint issues in math/math_function_test * Fix CPPLint issues in math/math_function_test * Fix CPPLint issues in math/concat.cc * Fix CPPLint issues in math/concat.cc * Fix CPPLint issues in math/concat.cc * Fix CPPLint issues in math/gru_cpu_kernel * Fix CPPLint issues in math/selected_rows_functor_test.cu * Fix compile error * Fix compile error
-
- 30 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "re-commit " * "picked up" * "fix ci" * "fix pdb hang up issue in cuda 9"
-
- 23 3月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 05 3月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 03 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 02 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-