* first commit * add fp16 ctest files for compare op * add cpu register of float16 for compare ops
* Move compare OPs to phi * Fix bug * Use BroadcastKernel and ElementwiseKernel in phi