Add AXPY oneDNN handler (#33632)
* Add oneDNN AXPY handler.
* Add fallback for small tensors.
* Fix ifdefs
* Remove unnecessary namespace prefixes and add missing headers.
* Guard handler_axpy with proper ifdefs.
* Compilation of this function is possible only when Paddle is not build
with CUDA nor HIP.
* Move AXPY handler code to separate files.
* Use oneDNN AXPY handler in SGD op.
* Use axpy handler only when Paddle is built with oneDNN.
* Add test for SUM BF16 with big rows.
* Fix SFINAE rules for elementwise_add_to.
* Add test case for SGD with big rows.
* update
* update
Co-authored-by: NAdam Osewski <adam.osewski@intel.com>
Showing
想要评论请 注册 或 登录