* reformat hybrid_parallel_util.py by black * add fused_allreduce_gradients_with_group * add scale * fix ci
拖放文件到此处或点击上传