• Z
    [Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0
    zhoutianzi666 提交于
    * add fmha_flashattention oss plugin
    
    * add fmhca
    
    * add oss fmhca
    
    * code reconstruct and add ut
    
    * code style refine
    
    * fix ut and enforce check
    
    * refine trt version check
    
    refine compile
    
    fix compile
    
    * fix cross ut
    
    * code refine
    
    * use runtime trt version check
    
    * bug fix and code refine
    
    * compile fix
    
    * merge develop
    
    * add GN QDQ kernel
    
    * support GN int8 fake kernel
    
    * add with_int8
    
    * add GN int8 fake kernel
    
    * add GN int8 fake kernel
    
    * add GN int8 fake kernel
    
    * add GN int8 fake kernel
    
    * add GN int8 fake kernel
    
    * add GN int8 fake kernel
    
    * add GN int8 fake kernel
    
    * add GN int8  UT
    
    * add verison > 8000  in GN int8  UT
    
    * add some check in .cu
    
    * add stdlib.h in UT
    
    * little change  in .cu
    
    * remove rand_r use rand
    
    * remove use rand
    
    * setAxis(1)
    
    * when int8 is on allow fall back to fp16
    
    ---------
    Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
    d93c63a0
engine.h 30.5 KB