* use BatchMatMul * replace TensorCopy with ShareDataWith * remove check fp16 grad * fix format * add grad_check * fix grad check
拖放文件到此处或点击上传