Fork自 PaddlePaddle / Paddle
* get default calc stream from execution ctx instead of global dev ctx pool.