Fork自 PaddlePaddle / Paddle
* add conv_op_npu and test * add more tests * clean headers & support fp16 * update