• L
    Add AXPY oneDNN handler (#33632) · 773aabc7
    lidanqing 提交于
    * Add oneDNN AXPY handler.
    
    * Add fallback for small tensors.
    
    * Fix ifdefs
    
    * Remove unnecessary namespace prefixes and add missing headers.
    
    * Guard handler_axpy with proper ifdefs.
    
    * Compilation of this function is possible only when Paddle is not build
    with CUDA nor HIP.
    
    * Move AXPY handler code to separate files.
    
    * Use oneDNN AXPY handler in SGD op.
    
    * Use axpy handler only when Paddle is built with oneDNN.
    
    * Add test for SUM BF16 with big rows.
    
    * Fix SFINAE rules for elementwise_add_to.
    
    * Add test case for SGD with big rows.
    
    * update
    
    * update
    Co-authored-by: NAdam Osewski <adam.osewski@intel.com>
    773aabc7
selected_rows_functor.cc 25.9 KB