Fork自 PaddlePaddle / PaddleDetection
Using DeviceContext, not Place to get stream
* with unit-tests * Also complete `memcpy`