Created by: kexinzhao
fix #9175 (closed)
Only added and verified float16 kernel for the inference mode of cudnn batch norm kernel, which is needed to run vgg/resnet inference.
OpTest.np_dtype_to_fluid_dtype is to change the dtype of a numpy array from float16 to uint16 so that it can correctly bind with paddle float16 in tensor_py.h.