* Add fused_gate_attention API. (#53432) * Add PADDLE_THROW in take_along_axis kernel when the datatype of index is wrong. (#53556)