* auto parallel sharding base * chmod * add unitest * set unitest cmake dist label * revise code according to rewiew * chmod