Created by: kexinzhao
As the first stage of float16 inference work is coming to an end, we need a design doc to explain what have we done.