Created by: lidanqing-intel
Change ComputeINT8 to template version so that a lot of if-else for checking dst_dt could be ommitted. @jczaja
In this version, ComputeFP32 and ComputeINT8 are still seperate. As which design will be used hasn't been decided, so I will open another PR for combined one in a moment. @wojtuss