PaddlePaddle / Paddle
大约 2 年前同步成功

Add FP16 support for mul op

Created by: kexinzhao

Add fp16 compute kernel in mul op so that it can call the FP16 gemm math function using the cublas fp16 kernel on GPU.