fix(quant): fix quant compute error
while filter weight is quanted elementwise, use maximum of weight_scale instead of mininum to keep quant weight in range of int8
Showing
想要评论请 注册 或 登录
while filter weight is quanted elementwise, use maximum of weight_scale instead of mininum to keep quant weight in range of int8