1. 05 11月, 2021 1 次提交
    • J
      Disable pool&conv_transpose&quantize caching (#36695) · db6c00c4
      Jacek Czaja 提交于
      * - WIP
      
      - compilation fix
      
      - fix
      
      - fixes
      
      - fix
      
      - fix
      
      - fix again
      
      - fix
      
      - another fix
      
      - another compilation fix
      
      - fix
      
      - fix
      
      - fix
      
      - lint
      
      * - pool2d partially stripped from cache
      
      - pool2d partially stripped of caching
      
      * - compilation fix
      
      * - compilation fix
      
      * - Fix to UT of caching
      
      * - Enabling test_conv3d_mkldnn
      
      * - conv_transpose stripped of cache
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * - compilation fix
      
      * - fix
      
      * Reverted disabling caching of conv2d
      
      * - compilation fix
      
      * - ut reverted
      db6c00c4
  2. 02 11月, 2021 2 次提交
  3. 27 10月, 2021 1 次提交
  4. 23 10月, 2021 1 次提交
  5. 21 10月, 2021 1 次提交
    • J
      Added matmul_v2+transpose+reshape fuse pass (#36481) · 856cb9c5
      jakpiase 提交于
      * added base changes for matmul_v2+trans+resh fuse pass
      
      * added full matmul_v2+transpose+reshape pass
      
      * removed a file added by mistake
      
      * added reviewers suggestions
      
      * Changed ops type in checking capatibility version
      
      * Deteled one statement
      856cb9c5
  6. 18 10月, 2021 1 次提交
    • J
      Added softplus FP32 FWD OneDNN kernel (#36382) · bdac9ff6
      jakpiase 提交于
      * added softplus
      
      * refactored softplus op
      
      * deleted unnecessary file
      
      * added missing file
      
      * added formatting
      
      * disabled tests if GPU is used
      
      * added reviewer suggestion
      
      * unified softplus kernel
      bdac9ff6
  7. 14 10月, 2021 1 次提交
  8. 13 10月, 2021 1 次提交
  9. 11 10月, 2021 1 次提交
  10. 08 10月, 2021 2 次提交
  11. 07 10月, 2021 1 次提交
  12. 05 10月, 2021 1 次提交
    • J
      Added concat BF16/FP32 BWD OneDNN kernel (#35889) · dc4d5719
      jakpiase 提交于
      * tmp
      
      * added concat BF16/FP32 BWD oneDNN kernel
      
      * minor change
      
      * minor change
      
      * fix for CI
      
      * added formatting
      
      * Reverted deleting static keyword
      
      * added reviewers suggestions
      
      * reverted deleting concat bf16 test file
      
      * fixed concat tests
      dc4d5719
  13. 27 9月, 2021 1 次提交
  14. 24 9月, 2021 1 次提交
  15. 21 9月, 2021 1 次提交
    • A
      Reuse OneDNN handler for SGD and SUM for SelectedRows input tensors. (#35510) · 799f3861
      Adam Osewski 提交于
      * Create stateful OneDNNAXPYHandler object.
      
      This makes it possible to call it multiple times without recreating the
      oneDNN primitives every time.
      
      * Prepare SGDOpKernel to reuse its implementation from OneDNN kernel.
      
      * OneDNN SGD kernel.
      
      * Update call to use new OneDNNAXPYHandler object api.
      
      * Setup seed in proper place.
      
      * Enable OneDNN kernel only for single case.
      
      * For dense param and sparse grad.
      
      * Small refactor.
      
      * Enable oneDNN by op attr or by cmd line flag.
      
      * Use int64_t type for number of elements.
      
      * Support dense param and grad from OneDNN kernel.
      
      * Enable SGD OneDNN kernel when use MP BF16 optimizer.
      
      * Force non-copyable/movable OneDNNAXPYHandler.
      
      * Reuse OneDNNAXPYHandler for spare tensors in SUM op.
      
      * Fix SFINAE rules.
      
      * Remove recording event inside AXPY.
      
      * Get rid of internal primitive caching.
      
      * Stop use PP cache mechanims to store mem and primitive obj.
      * Handler obj store and reuse needed desc & prim
      
      * Do not derive from MKLDNNHandlerT
      799f3861
  16. 18 9月, 2021 1 次提交
    • J
      [oneDNN] Disable caching of Reorder operation (#35664) · e4c2a854
      Jacek Czaja 提交于
      * - REorder disabling caching
      
      * - compilation fix
      
      * - another compilation fix
      
      * - another compilation fix
      
      * - compilation fix
      
      * - Fix
      
      * - yet another compilation fix
      
      * - suppresingly another compilation fix
      
      * - lint
      
      * - fix after review
      
      * - fix
      e4c2a854
  17. 15 9月, 2021 1 次提交
  18. 13 9月, 2021 1 次提交
  19. 10 9月, 2021 1 次提交
  20. 07 9月, 2021 2 次提交
    • J
      [oneDNN] Disable cache matmul v1 & refactoring (#35331) · e9ae8dd0
      Jacek Czaja 提交于
      * - refactoring progressing
      
      - Fix
      
      - compilation fix
      
      - another compilation fix
      
      - refactoring
      
      * - fix
      
      * - compilation fix
      
      * - compilation fix
      
      * - missing set_format
      
      * - compilation fix
      
      * - reverted setting memeory format
      
      * - Brought back format
      
      * - Fix
      
      * - fixes after review
      
      * CI rerun
      
      * CI rerun
      e9ae8dd0
    • J
      Fix for reshape2 oneDNN op (#35455) · 36cdb6e2
      jakpiase 提交于
      * fix for reshape2
      
      * added reviewers sugestions
      36cdb6e2
  21. 01 9月, 2021 1 次提交
    • J
      Added slice BF16/FP32 FWD/BWD kernels (#34332) · 070cab11
      jakpiase 提交于
      * aded slice FWD FP32
      
      * added tests for slice FWD FP32
      
      * added slice bwd
      
      * added bf16 tests
      
      * CI fix
      
      * CI fix
      
      * added reason to skip_if
      
      * minor change
      
      * temporary fix for failing test
      
      * temporary fix
      
      * changes after review
      
      * CI rerun
      070cab11
  22. 26 8月, 2021 1 次提交
    • J
      [oneDNN] disable caching oneDNN primitives in matmul v2, Reduce grad and... · 31f0221f
      Jacek Czaja 提交于
      [oneDNN] disable caching oneDNN primitives in  matmul v2, Reduce grad and elementwise_add grad, expand_v2 (#35132)
      
      * - grad caching disabled of matmul_v1
      
      - compilation fix
      
      - compilation fix
      
      * - reduction removed
      
      * - Matmul v2 disabled caching
      
      * Draft of further changes
      
      * - workaround for reducegrad
      
      * - fixes to UT
      
      * - fix to compilation
      
      * - another fix
      
      * - fix
      31f0221f
  23. 25 8月, 2021 1 次提交
  24. 24 8月, 2021 1 次提交
    • J
      [oneDNN] Concat refactoring and disabling caching (#35002) · d9c0f09b
      Jacek Czaja 提交于
      * - concat refactoring draft
      
      * - cmpilation fixes
      
      * - yet another compilation fix
      
      * - fix
      
      * - compilation fix
      
      * - fixes to compilation
      
      * - another compilation fix
      
      * - fix
      
      * - Added overloaded AcquirePrimitiveDesc for concat
      
      * - fix
      
      * - reserve introduced
      
      * - UT fixes
      
      * - test concat int8 improved
      
      * - fixes
      
      * - fix to crash
      
      * - lint fixes
      
      * - fixes after review
      
      * - some other fixes from review
      d9c0f09b
  25. 23 8月, 2021 1 次提交
  26. 17 8月, 2021 2 次提交
    • C
      Copy boost optional to Paddle (#34780) · 9be41447
      chentianyu03 提交于
      * copy boost optional.hpp to paddle
      
      * copy boost optional.hpp to paddle
      
      * move directions
      
      * del fluid/utils
      
      * modify .hpp to .h
      
      * move directions
      
      * modify to paddle::optional
      
      * add modification description
      
      * format code stype for the files in paddle/utils
      
      * format code stype
      9be41447
    • J
      [oneDNN ] disabling more ops caching (#34830) · f1c1d9e0
      Jacek Czaja 提交于
      * - disabled caching of layer norm
      
      - fix in compilation
      
      - compilation fix
      
      - transpose caching disabled
      
      - compilation fix
      
      - more compilation fixes
      
      - sum caching disabled
      
      - compilation fix
      
      * - LRN with disabled cache
      
      * lint fixes
      f1c1d9e0
  27. 16 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (#34859) · 9cb65653
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      
      * - hopefully compilation fix
      
      - compilation fix
      9cb65653
  28. 12 8月, 2021 1 次提交
  29. 11 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to issue #34554 (#34623) · 0a5c99e8
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      0a5c99e8
  30. 30 7月, 2021 3 次提交
    • J
      Added matmul_v2 BF16/FP32 BWD kernel (#34192) · 0be71571
      jakpiase 提交于
      * test version of matmul_v2
      
      * added matmul_v2 grad kernel
      
      * minor changes
      
      * minor changes
      
      * minor change for CI approval
      
      * CI fix
      
      * CI fix
      
      * trigger CI
      
      * changes after review, not working yet
      
      * moved ops to anonymous namespaces
      
      * changes after review
      0be71571
    • J
      Added reshape, reshape2, squeeze and squeeze2 BF16/FP32 FWD/BWD kernels (#34219) · 22c4c189
      jakpiase 提交于
      * test version of matmul_v2
      
      * added matmul_v2 grad kernel
      
      * minor changes
      
      * minor changes
      
      * minor change for CI approval
      
      * CI fix
      
      * CI fix
      
      * added squeeze and squeeze2 kernels
      
      * CI fix
      
      * CI fix
      
      * CI fix
      
      * disabled tests when compiled with cuda
      
      * added setting format_tag by strides
      
      * added sigmoid BF16 FWD/BWD and gelu BF16 BWD
      
      * changes after review
      
      * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD"
      
      This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op"
      
      This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing
      changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * minor change
      
      * added reshape1/2 kernels
      
      * moved some functions into private block
      
      * CI fix
      
      * CI fix
      
      * CI fix
      22c4c189
    • J
      Added expand_v2 BF16/FP32 FWD/BWD kernels (#34284) · 41c4f723
      jakpiase 提交于
      * added expand_v2 bf16/fp32 kernel
      
      * minor change
      
      * CI fix
      
      * added missing test file
      
      * added formatting
      
      * reduced binary size
      
      * CI fix
      41c4f723
  31. 22 7月, 2021 1 次提交
  32. 19 7月, 2021 1 次提交
  33. 07 7月, 2021 1 次提交
  34. 30 6月, 2021 1 次提交
    • J
      Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84
      jakpiase 提交于
      * added matmul_v2 bf16/fp32 FWD kernel
      
      added matmul_v2 bf16/fp32 FWD kernel
      
      * added formatting
      
      * removed some tests due to timeout in CI
      
      * refactored tests
      
      * merged tests classes into one file
      
      * minor change
      
      * removed test guard for CUDA
      
      * remove skipIf
      
      * changes after review
      
      * formated one file
      
      * minor change
      
      * added skipping UT in CUDA place
      24783c84