Created by: sandyhouse
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op move the assertion (input.shape[0] must be divisible by nranks) from compiler time to run time