lite/kernels/cuda/fetch_compute.cc · f6461e395e83e23a94b4ddd07bf22f39c55455e6 · PaddlePaddle / Paddle-Lite

由 Wilber 提交于 3月 17, 2020

- 增加cuda c++ demo.
- 考虑到检测模型尾部一般是multiclass_nms，该kernel为host，如果fetch kernel为cuda的话，则会在此处插入无用的io_copy(host->cuda)，由于该原因，注释掉fetch的cuda kernel. 默认使用host的fetch kernel. 此处暗中进行的行为：每次predictor run完，都会默认把数据从cuda拷贝到cpu

f6461e39

fetch_compute.cc 2.7 KB

PaddlePaddle / Paddle-Lite

Replace fetch_compute.cc