“5c4dfdebcb12d17b8fe3090b874a496ea38dfcf4”上不存在“paddle/operators/op_documentation/name_convention.md”
* add gradient merge for DistributedFusedLamb * use master acc gradient * fix CI ut * polish * remove math_function_impl.h change * fix test_update_loss_scaling_op.py * try to fix XPU/NPU CI * add gm ut