• L
    Refactor the organization of layer_norm cuda impl. (#34883) · 7f5eb533
    Li Min 提交于
    Refactor the organization of layer_norm cuda impl so that it can be reused in fused attention op.
    
        Extract the layer_norm cuda impl form layer_norm_op.cu to layer_norm_kernel.cu.h.
        Define fused/attention_layer_norm.h, which can be used in fused attention op in next PR.
    7f5eb533
layer_norm_kernel.cu.h 33.3 KB