Fork自 PaddlePaddle / Paddle
* fix c_split bug * fix utest * add c_embedding for tensorparallel
* add c_concat op