[Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146)
* add fmha_flashattention oss plugin
* add fmhca
* add oss fmhca
* code reconstruct and add ut
* code style refine
* fix ut and enforce check
* refine trt version check
refine compile
fix compile
* fix cross ut
* code refine
* use runtime trt version check
* bug fix and code refine
* compile fix
* merge develop
* add GN QDQ kernel
* support GN int8 fake kernel
* add with_int8
* add GN int8 fake kernel
* add GN int8 fake kernel
* add GN int8 fake kernel
* add GN int8 fake kernel
* add GN int8 fake kernel
* add GN int8 fake kernel
* add GN int8 fake kernel
* add GN int8 UT
* add verison > 8000 in GN int8 UT
* add some check in .cu
* add stdlib.h in UT
* little change in .cu
* remove rand_r use rand
* remove use rand
* setAxis(1)
* when int8 is on allow fall back to fp16
---------
Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
Showing
想要评论请 注册 或 登录