Created by: heavengate
fix elementwise_mod float point kernel
use fmod(b + fmod(a, b), b)
to calcualte positive stable mod value
same as pytorch https://github.com/pytorch/pytorch/blob/f3b15727c58ab9d0010a03fd90c124f0727826e3/torch/csrc/jit/register_prim_ops.cpp#L2844