- 20 11月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Fix topk compile failed on windows * Use explicit cast for assign data
-
- 14 11月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 30 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 26 12月, 2018 2 次提交
-
-
由 whs 提交于
* Make topk op support variable k. test=develop * Fix tensor type. test=develop
-
由 Wu Yi 提交于
* wip * wip * wip * wip for test * add fp16 tests test=develop * fix cpu build test=develop * fix test=develop * fix py3 tests test=develop * fix lr_scheduler dtype test=develop * fix test=dvelop * test fix ci compile test=develop * fix build and merge test=develop * fallback momentumop change to general test=develop * make fp16 lr schedule simple test=develop * fix ut test=develop * fix tests test=develop * remove fp16 learning rate cast test=develop
-
- 20 12月, 2018 2 次提交
-
-
由 typhoonzero 提交于
This reverts commit 3d750f9c.
-
由 Wu Yi 提交于
* wip * wip * wip * wip for test * add fp16 tests test=develop * fix cpu build test=develop * fix test=develop * fix py3 tests test=develop * fix lr_scheduler dtype test=develop * fix test=dvelop * test fix ci compile test=develop * fix build and merge test=develop * fallback momentumop change to general test=develop
-
- 29 10月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 26 10月, 2018 2 次提交
- 24 10月, 2018 1 次提交
-
-
由 qingqing01 提交于
1. Fix CUDA kernel when height is large than 2048. 2. Support input with more than 2D. 3. Fix unit test when k is large than 1. 4. Enhence unit testing. test=develop
-
- 08 10月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 17 8月, 2018 1 次提交
-
- 16 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "cherry picked operators changes" * "remove duplicated code" * "add constant setter" * "add get expected kernel" * "fix ci" * "add fill constant"
-
- 04 5月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 03 5月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "fix double type error" * "fix ci" * "softmax fp64" * "fix momentum" * "fix ci"
-
- 02 5月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 13 4月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
* Fix CPPLint errors in operators * Fix prior box op * Fix Prior Box op * Fix top_k_op.cu * Fix pool mkmldnn * Fix pool mkmldnn
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 25 12月, 2017 1 次提交
-
-
由 dzhwinter 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 31 10月, 2017 1 次提交
-
-
由 fengjiayi 提交于
* Fix Type error * Fix error * Fix top_k_op GPU code data type
-
- 28 9月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-
- 23 9月, 2017 1 次提交
-
-
由 caoying03 提交于
-
- 07 9月, 2017 1 次提交
-
-
由 武毅 提交于
* init add * add topk op * someupdate * fix style check * add test py file * update top k cuda kernel * follow comments * remove debug print * fix casting error * fix casting error * fix casting error * fix rename bug... * fix travis
-