1. 08 10月, 2021 1 次提交
  2. 07 10月, 2021 1 次提交
  3. 05 10月, 2021 1 次提交
    • J
      Added concat BF16/FP32 BWD OneDNN kernel (#35889) · dc4d5719
      jakpiase 提交于
      * tmp
      
      * added concat BF16/FP32 BWD oneDNN kernel
      
      * minor change
      
      * minor change
      
      * fix for CI
      
      * added formatting
      
      * Reverted deleting static keyword
      
      * added reviewers suggestions
      
      * reverted deleting concat bf16 test file
      
      * fixed concat tests
      dc4d5719
  4. 27 9月, 2021 1 次提交
  5. 24 9月, 2021 1 次提交
  6. 21 9月, 2021 1 次提交
    • A
      Reuse OneDNN handler for SGD and SUM for SelectedRows input tensors. (#35510) · 799f3861
      Adam Osewski 提交于
      * Create stateful OneDNNAXPYHandler object.
      
      This makes it possible to call it multiple times without recreating the
      oneDNN primitives every time.
      
      * Prepare SGDOpKernel to reuse its implementation from OneDNN kernel.
      
      * OneDNN SGD kernel.
      
      * Update call to use new OneDNNAXPYHandler object api.
      
      * Setup seed in proper place.
      
      * Enable OneDNN kernel only for single case.
      
      * For dense param and sparse grad.
      
      * Small refactor.
      
      * Enable oneDNN by op attr or by cmd line flag.
      
      * Use int64_t type for number of elements.
      
      * Support dense param and grad from OneDNN kernel.
      
      * Enable SGD OneDNN kernel when use MP BF16 optimizer.
      
      * Force non-copyable/movable OneDNNAXPYHandler.
      
      * Reuse OneDNNAXPYHandler for spare tensors in SUM op.
      
      * Fix SFINAE rules.
      
      * Remove recording event inside AXPY.
      
      * Get rid of internal primitive caching.
      
      * Stop use PP cache mechanims to store mem and primitive obj.
      * Handler obj store and reuse needed desc & prim
      
      * Do not derive from MKLDNNHandlerT
      799f3861
  7. 18 9月, 2021 1 次提交
    • J
      [oneDNN] Disable caching of Reorder operation (#35664) · e4c2a854
      Jacek Czaja 提交于
      * - REorder disabling caching
      
      * - compilation fix
      
      * - another compilation fix
      
      * - another compilation fix
      
      * - compilation fix
      
      * - Fix
      
      * - yet another compilation fix
      
      * - suppresingly another compilation fix
      
      * - lint
      
      * - fix after review
      
      * - fix
      e4c2a854
  8. 15 9月, 2021 1 次提交
  9. 13 9月, 2021 1 次提交
  10. 10 9月, 2021 1 次提交
  11. 07 9月, 2021 2 次提交
    • J
      [oneDNN] Disable cache matmul v1 & refactoring (#35331) · e9ae8dd0
      Jacek Czaja 提交于
      * - refactoring progressing
      
      - Fix
      
      - compilation fix
      
      - another compilation fix
      
      - refactoring
      
      * - fix
      
      * - compilation fix
      
      * - compilation fix
      
      * - missing set_format
      
      * - compilation fix
      
      * - reverted setting memeory format
      
      * - Brought back format
      
      * - Fix
      
      * - fixes after review
      
      * CI rerun
      
      * CI rerun
      e9ae8dd0
    • J
      Fix for reshape2 oneDNN op (#35455) · 36cdb6e2
      jakpiase 提交于
      * fix for reshape2
      
      * added reviewers sugestions
      36cdb6e2
  12. 01 9月, 2021 1 次提交
    • J
      Added slice BF16/FP32 FWD/BWD kernels (#34332) · 070cab11
      jakpiase 提交于
      * aded slice FWD FP32
      
      * added tests for slice FWD FP32
      
      * added slice bwd
      
      * added bf16 tests
      
      * CI fix
      
      * CI fix
      
      * added reason to skip_if
      
      * minor change
      
      * temporary fix for failing test
      
      * temporary fix
      
      * changes after review
      
      * CI rerun
      070cab11
  13. 26 8月, 2021 1 次提交
    • J
      [oneDNN] disable caching oneDNN primitives in matmul v2, Reduce grad and... · 31f0221f
      Jacek Czaja 提交于
      [oneDNN] disable caching oneDNN primitives in  matmul v2, Reduce grad and elementwise_add grad, expand_v2 (#35132)
      
      * - grad caching disabled of matmul_v1
      
      - compilation fix
      
      - compilation fix
      
      * - reduction removed
      
      * - Matmul v2 disabled caching
      
      * Draft of further changes
      
      * - workaround for reducegrad
      
      * - fixes to UT
      
      * - fix to compilation
      
      * - another fix
      
      * - fix
      31f0221f
  14. 25 8月, 2021 1 次提交
  15. 24 8月, 2021 1 次提交
    • J
      [oneDNN] Concat refactoring and disabling caching (#35002) · d9c0f09b
      Jacek Czaja 提交于
      * - concat refactoring draft
      
      * - cmpilation fixes
      
      * - yet another compilation fix
      
      * - fix
      
      * - compilation fix
      
      * - fixes to compilation
      
      * - another compilation fix
      
      * - fix
      
      * - Added overloaded AcquirePrimitiveDesc for concat
      
      * - fix
      
      * - reserve introduced
      
      * - UT fixes
      
      * - test concat int8 improved
      
      * - fixes
      
      * - fix to crash
      
      * - lint fixes
      
      * - fixes after review
      
      * - some other fixes from review
      d9c0f09b
  16. 23 8月, 2021 1 次提交
  17. 17 8月, 2021 2 次提交
    • C
      Copy boost optional to Paddle (#34780) · 9be41447
      chentianyu03 提交于
      * copy boost optional.hpp to paddle
      
      * copy boost optional.hpp to paddle
      
      * move directions
      
      * del fluid/utils
      
      * modify .hpp to .h
      
      * move directions
      
      * modify to paddle::optional
      
      * add modification description
      
      * format code stype for the files in paddle/utils
      
      * format code stype
      9be41447
    • J
      [oneDNN ] disabling more ops caching (#34830) · f1c1d9e0
      Jacek Czaja 提交于
      * - disabled caching of layer norm
      
      - fix in compilation
      
      - compilation fix
      
      - transpose caching disabled
      
      - compilation fix
      
      - more compilation fixes
      
      - sum caching disabled
      
      - compilation fix
      
      * - LRN with disabled cache
      
      * lint fixes
      f1c1d9e0
  18. 16 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (#34859) · 9cb65653
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      
      * - hopefully compilation fix
      
      - compilation fix
      9cb65653
  19. 12 8月, 2021 1 次提交
  20. 11 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to issue #34554 (#34623) · 0a5c99e8
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      0a5c99e8
  21. 30 7月, 2021 3 次提交
    • J
      Added matmul_v2 BF16/FP32 BWD kernel (#34192) · 0be71571
      jakpiase 提交于
      * test version of matmul_v2
      
      * added matmul_v2 grad kernel
      
      * minor changes
      
      * minor changes
      
      * minor change for CI approval
      
      * CI fix
      
      * CI fix
      
      * trigger CI
      
      * changes after review, not working yet
      
      * moved ops to anonymous namespaces
      
      * changes after review
      0be71571
    • J
      Added reshape, reshape2, squeeze and squeeze2 BF16/FP32 FWD/BWD kernels (#34219) · 22c4c189
      jakpiase 提交于
      * test version of matmul_v2
      
      * added matmul_v2 grad kernel
      
      * minor changes
      
      * minor changes
      
      * minor change for CI approval
      
      * CI fix
      
      * CI fix
      
      * added squeeze and squeeze2 kernels
      
      * CI fix
      
      * CI fix
      
      * CI fix
      
      * disabled tests when compiled with cuda
      
      * added setting format_tag by strides
      
      * added sigmoid BF16 FWD/BWD and gelu BF16 BWD
      
      * changes after review
      
      * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD"
      
      This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op"
      
      This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing
      changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * minor change
      
      * added reshape1/2 kernels
      
      * moved some functions into private block
      
      * CI fix
      
      * CI fix
      
      * CI fix
      22c4c189
    • J
      Added expand_v2 BF16/FP32 FWD/BWD kernels (#34284) · 41c4f723
      jakpiase 提交于
      * added expand_v2 bf16/fp32 kernel
      
      * minor change
      
      * CI fix
      
      * added missing test file
      
      * added formatting
      
      * reduced binary size
      
      * CI fix
      41c4f723
  22. 22 7月, 2021 1 次提交
  23. 19 7月, 2021 1 次提交
  24. 07 7月, 2021 1 次提交
  25. 30 6月, 2021 1 次提交
    • J
      Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84
      jakpiase 提交于
      * added matmul_v2 bf16/fp32 FWD kernel
      
      added matmul_v2 bf16/fp32 FWD kernel
      
      * added formatting
      
      * removed some tests due to timeout in CI
      
      * refactored tests
      
      * merged tests classes into one file
      
      * minor change
      
      * removed test guard for CUDA
      
      * remove skipIf
      
      * changes after review
      
      * formated one file
      
      * minor change
      
      * added skipping UT in CUDA place
      24783c84
  26. 24 6月, 2021 1 次提交
  27. 23 6月, 2021 1 次提交
    • J
      Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
      jakpiase 提交于
      * base changes for split op
      
      * 90% of split functionality added
      
      * full fp32 functionality
      
      * added bf16 test
      
      * added submemory caching
      
      * added bf test to static mode whitelist
      
      * minor change
      
      * enabled split op for inference
      
      * minor fix
      
      * minor fix
      68106509
  28. 21 6月, 2021 1 次提交
    • L
      Add AXPY oneDNN handler (#33632) · 773aabc7
      lidanqing 提交于
      * Add oneDNN AXPY handler.
      
      * Add fallback for small tensors.
      
      * Fix ifdefs
      
      * Remove unnecessary namespace prefixes and add missing headers.
      
      * Guard handler_axpy with proper ifdefs.
      
      * Compilation of this function is possible only when Paddle is not build
      with CUDA nor HIP.
      
      * Move AXPY handler code to separate files.
      
      * Use oneDNN AXPY handler in SGD op.
      
      * Use axpy handler only when Paddle is built with oneDNN.
      
      * Add test for SUM BF16 with big rows.
      
      * Fix SFINAE rules for elementwise_add_to.
      
      * Add test case for SGD with big rows.
      
      * update
      
      * update
      Co-authored-by: NAdam Osewski <adam.osewski@intel.com>
      773aabc7
  29. 16 6月, 2021 1 次提交
  30. 27 5月, 2021 1 次提交
  31. 26 5月, 2021 1 次提交
  32. 25 5月, 2021 1 次提交
  33. 22 5月, 2021 1 次提交
    • J
      Added oneDNN matmul grad BF16/FP32 kernel (#32968) · e2a3a6f7
      jakpiase 提交于
      * added support for most matmul cases
      
      * added more functionality
      
      * full functionality of matmul op, fp32 only
      
      * added bf16 tests and functionality
      
      * added formatting
      
      * changes after review
      
      * minor change
      
      * added reviewers suggestions
      e2a3a6f7
  34. 19 5月, 2021 1 次提交
  35. 14 5月, 2021 1 次提交
  36. 28 4月, 2021 1 次提交