Created by: zhiqiu
On CUDA place, elementwise_pow fails when input integers, for example,
elementwise_pow
This PR fixes that.