1. 30 6月, 2021 1 次提交
    • J
      Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84
      jakpiase 提交于
      * added matmul_v2 bf16/fp32 FWD kernel
      
      added matmul_v2 bf16/fp32 FWD kernel
      
      * added formatting
      
      * removed some tests due to timeout in CI
      
      * refactored tests
      
      * merged tests classes into one file
      
      * minor change
      
      * removed test guard for CUDA
      
      * remove skipIf
      
      * changes after review
      
      * formated one file
      
      * minor change
      
      * added skipping UT in CUDA place
      24783c84
  2. 24 6月, 2021 1 次提交
  3. 23 6月, 2021 1 次提交
    • J
      Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
      jakpiase 提交于
      * base changes for split op
      
      * 90% of split functionality added
      
      * full fp32 functionality
      
      * added bf16 test
      
      * added submemory caching
      
      * added bf test to static mode whitelist
      
      * minor change
      
      * enabled split op for inference
      
      * minor fix
      
      * minor fix
      68106509
  4. 21 6月, 2021 1 次提交
    • L
      Add AXPY oneDNN handler (#33632) · 773aabc7
      lidanqing 提交于
      * Add oneDNN AXPY handler.
      
      * Add fallback for small tensors.
      
      * Fix ifdefs
      
      * Remove unnecessary namespace prefixes and add missing headers.
      
      * Guard handler_axpy with proper ifdefs.
      
      * Compilation of this function is possible only when Paddle is not build
      with CUDA nor HIP.
      
      * Move AXPY handler code to separate files.
      
      * Use oneDNN AXPY handler in SGD op.
      
      * Use axpy handler only when Paddle is built with oneDNN.
      
      * Add test for SUM BF16 with big rows.
      
      * Fix SFINAE rules for elementwise_add_to.
      
      * Add test case for SGD with big rows.
      
      * update
      
      * update
      Co-authored-by: NAdam Osewski <adam.osewski@intel.com>
      773aabc7
  5. 16 6月, 2021 1 次提交
  6. 27 5月, 2021 1 次提交
  7. 26 5月, 2021 1 次提交
  8. 25 5月, 2021 1 次提交
  9. 22 5月, 2021 1 次提交
    • J
      Added oneDNN matmul grad BF16/FP32 kernel (#32968) · e2a3a6f7
      jakpiase 提交于
      * added support for most matmul cases
      
      * added more functionality
      
      * full functionality of matmul op, fp32 only
      
      * added bf16 tests and functionality
      
      * added formatting
      
      * changes after review
      
      * minor change
      
      * added reviewers suggestions
      e2a3a6f7
  10. 19 5月, 2021 1 次提交
  11. 14 5月, 2021 1 次提交
  12. 28 4月, 2021 1 次提交
  13. 21 4月, 2021 1 次提交
  14. 24 3月, 2021 1 次提交
  15. 09 3月, 2021 1 次提交
  16. 25 2月, 2021 1 次提交
  17. 23 2月, 2021 2 次提交
  18. 18 2月, 2021 1 次提交
    • J
      Add Conv Transpose BF16 (#30877) · caf9d398
      joanna.wozna.intel 提交于
      * Add conv transpose BF16
      
      * Share function GetWeightsTz
      
      * Adjust to review and fix op compatibility
      
      * Add bias to unique handler name
      
      * Remove errors related to paddle enforce
      
      * Add conv2d_transpose to bf16 list and kernel refator
      caf9d398
  19. 04 2月, 2021 1 次提交
  20. 28 1月, 2021 1 次提交
  21. 25 1月, 2021 1 次提交
  22. 20 1月, 2021 1 次提交
  23. 12 1月, 2021 1 次提交
  24. 11 1月, 2021 1 次提交
  25. 09 1月, 2021 1 次提交
  26. 31 12月, 2020 1 次提交
  27. 24 12月, 2020 1 次提交
  28. 23 12月, 2020 1 次提交
  29. 14 12月, 2020 2 次提交
  30. 08 12月, 2020 1 次提交
    • J
      added internal and external reorders to profiler (#29443) · 57a4f16d
      jakpiase 提交于
      * added external reorder to profiler
      
      * added external and internal reorders to profiler
      
      * added internal and external reorder to profiler
      
      * added formatting to int/ext reorder commit
      
      * removed unnecessary comment
      57a4f16d
  31. 30 11月, 2020 1 次提交
  32. 26 11月, 2020 1 次提交
  33. 23 11月, 2020 1 次提交
  34. 20 11月, 2020 1 次提交
  35. 17 11月, 2020 1 次提交
  36. 06 11月, 2020 1 次提交
  37. 05 11月, 2020 1 次提交
  38. 03 11月, 2020 1 次提交