Created by: Sand3r-
PR types: New features
PR changes: OPs
Describe:
- PR introduces oneDNN forward kernels for:
- INT8 elementwise_add
- INT8 elementwise_mul
- FP32 elementwise_mul
- It also fixes 2 bugs:
- When using ShareBufferWith (inplace operators), the type wasn't assigned to the tensor to which the buffer was shared with. This resulted in hard to detect memory crashes when attempting to test inplace elementwise int8 operators (size was allocated in ints, while fetch_op required size allocated for floats).
- OpTester has can now succesfully output uint8 results to the console. Previously arrays of uint8 data were treated as encoded byte strings instead of arrays of numbers.
- At the same time, the following has been removed:
- An old elementwise_mul kernel that was crafted for a special case in ResNeXt-50. This special case (and many others) should now be fully covered inside of the new kernel
- It's unit tests
PRs to be merged before this one:
#24853 #24854 #24855