- 13 7月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Fix gather_op by adding OurOfRangeCheck for param[Index] * Code Optimization
-
- 11 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix gather shape bug * fix None * fix topo
-
- 10 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 09 11月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 23 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 14 5月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 30 8月, 2019 1 次提交
-
-
由 ShenLiang 提交于
* fixed the code for coverage * fixed the document,test=document_preview test=develop
-
- 14 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix gather op bug test=develop
-
- 25 5月, 2019 1 次提交
-
-
由 hutuxian 提交于
* gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop
-
- 26 3月, 2019 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 12 11月, 2018 1 次提交
-
-
由 Yibing Liu 提交于
* Add int type support for stack_op * Improve gather op to support index with shape N x 1 test=develop * Fix stack_op kernel's registry test=develop
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 04 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 03 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 29 9月, 2017 2 次提交