Created by: jiweibo
https://github.com/PaddlePaddle/Paddle-Lite/pull/2166
ci 代理挂掉,关掉此pr,重提Pr,见- 修复了yolobox cuda实现的bug,使得输出与fluid对齐
- 添加了@NHZIX忘记添加的conv_op_cache_cudnn文件
yolobox中关于anchor data memcpy from Host to Device的过程,MemcpyAsync没有收益,延用了之前的写法
yolov3模型,bakckbone耗时48ms,backbone with head耗时53.1ms