1. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  2. 05 7月, 2022 1 次提交
  3. 26 6月, 2022 1 次提交
  4. 23 5月, 2022 1 次提交
    • J
      [Internal reviewing] NHWC fix to am_vocoder model for oneDNN 2.6 (#42729) · d414af94
      Jacek Czaja 提交于
      * - prototype of reimplemented fixes
      
      * - compilation fixes
      
      * - compilation fix
      
      * - cosmetic info
      
      * - hopefully fix
      
      * - compilation fix
      
      * - supported for nested blocking of cache clearing
      
      * - fix
      
      * - Unit test to changes
      
      * - Compilation fix to windows (hopefully)
      
      * - Moved resetting layout to ResetBlob
      
      * - fixes after review
      d414af94
  5. 09 5月, 2022 1 次提交
  6. 14 4月, 2022 1 次提交
    • J
      Fix to #38693 (minimal UT) (#41026) · d0f3296b
      Jacek Czaja 提交于
      * Add UT
      
      - Added missed data_layout
      
      - Added missing conversions
      
      - NDHWC added
      
      - NDHWC support in data_transform
      
      - another fix
      
      - condddate change
      
      - fix
      
      u- fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix to hack
      
      - compilation fix
      
      - fix to automatic merge
      
      * - reduced UT
      
      * - fix
      
      * - lint
      
      * - fix to lint
      d0f3296b
  7. 31 3月, 2022 1 次提交
    • L
      [new-exec] fit mkldnn op (#41058) · 02cf6764
      Leo Chen 提交于
      * fix bug that some op has no op_role attr
      
      * add mkldnn support for new executor
      
      * fit for mkldnn data_transfer
      
      * fit for mkldnn data_transfer
      02cf6764
  8. 15 3月, 2022 1 次提交
    • J
      oneDNN NHWC fixes (#40049) · dde9cec0
      Jacek Czaja 提交于
      * - Prototype of third solution
      
      - fix
      
      - compilation fixes
      
      - fix
      
      - fixe
      
      - fix
      
      - fix
      
      - compilation fix
      
      - comment fix
      
      - lint
      
      update mkldnn conv_elementwise_add_fuse_pass ut
      
      - NHWC changes to prelu
      
      - alhpa dims
      
      - UT fix
      
      - fix to UT
      
      - lint
      
      - Some fixes
      
      - added to BWD of prelu NHWC support
      
      - reverted removal of resetting cu_layout in clearing of caching
      
      * - Small changes
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * lint
      
      * - fixes after internal review
      
      * - compilation fix
      
      * - lint
      dde9cec0
  9. 20 2月, 2022 1 次提交
  10. 19 2月, 2022 2 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
    • C
      fix RecordEvent interface (#39675) · 019a552b
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update operator.cc
      019a552b
  11. 11 2月, 2022 1 次提交
    • J
      Added shape (U)INT8/BF16/FP32 oneDNN kernel (#36033) · 52bbaae9
      jakpiase 提交于
      * added shape oneDNN kernel
      
      * removed unnecessary import from test
      
      * added skipping tests for GPU
      
      * refactoring
      
      * refactored shape kernel
      
      * added tests in new framework
      
      * removed one line
      
      * minor change
      
      * added newline at EOF
      
      * added formatting
      
      * added attributes as extra
      52bbaae9
  12. 31 12月, 2021 1 次提交
  13. 24 11月, 2021 1 次提交
  14. 18 11月, 2021 1 次提交
  15. 17 11月, 2021 1 次提交
  16. 10 11月, 2021 1 次提交
  17. 07 10月, 2021 1 次提交
  18. 15 9月, 2021 1 次提交
  19. 30 8月, 2021 1 次提交
  20. 28 4月, 2021 1 次提交
  21. 09 4月, 2021 1 次提交
  22. 18 2月, 2021 1 次提交
    • J
      Add Conv Transpose BF16 (#30877) · caf9d398
      joanna.wozna.intel 提交于
      * Add conv transpose BF16
      
      * Share function GetWeightsTz
      
      * Adjust to review and fix op compatibility
      
      * Add bias to unique handler name
      
      * Remove errors related to paddle enforce
      
      * Add conv2d_transpose to bf16 list and kernel refator
      caf9d398
  23. 25 1月, 2021 1 次提交
  24. 17 12月, 2020 1 次提交
  25. 14 12月, 2020 1 次提交
  26. 08 12月, 2020 1 次提交
    • J
      added internal and external reorders to profiler (#29443) · 57a4f16d
      jakpiase 提交于
      * added external reorder to profiler
      
      * added external and internal reorders to profiler
      
      * added internal and external reorder to profiler
      
      * added formatting to int/ext reorder commit
      
      * removed unnecessary comment
      57a4f16d
  27. 27 11月, 2020 1 次提交
  28. 23 11月, 2020 1 次提交
  29. 28 10月, 2020 1 次提交
  30. 01 10月, 2020 1 次提交
  31. 14 9月, 2020 1 次提交
  32. 03 9月, 2020 1 次提交
  33. 21 8月, 2020 1 次提交
  34. 08 8月, 2020 1 次提交
  35. 06 8月, 2020 1 次提交
    • A
      Add oneDNN fusion_gru kernel (#25594) · 68c6160e
      Adam 提交于
      * Add oneDNN fusion_gru kernel and fix fc+gru pass
      test=develop
      
      * Formatting changes
      test=develop
      
      * Lint fixes
      test=develop
      
      * Add memory::format_tag::any to GRU weights
      test=develop
      
      * Fix build with CUDA
      
      * Fix build with CUDA v2
      68c6160e
  36. 28 7月, 2020 1 次提交
    • A
      Added DNNL cache management for DyGraph (#25624) · e52df3b1
      arlesniak 提交于
      * Added DNNL cache management for DyGraph
      
      * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients
      
      * missing file
      
      * Fixes after review
      
      * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative.
      
      * Removed duplicate and added docs
      
      * Fixes for CI
      e52df3b1
  37. 09 7月, 2020 1 次提交
  38. 22 4月, 2020 1 次提交
  39. 11 4月, 2020 1 次提交
    • M
      [DNNL][INT8][FP32] MatMul (#23395) · a63bcf9a
      Michał Gallus 提交于
      * Initial FP32 DNNL MatMul Implementation
      
      * Implement int8 DNNL MatMul
      
      * Unify in-kernel-naming, clean UTs
      
      * MatmuL: Introduce op caching
      
      * Final adjustments
      
      test=develop
      
      * Remove dy_graph disablement
      
      test=develop
      
      * Change dnnl header name to new one
      
      test=develop
      
      * Contrain multi head check to prevent fails
      
      test=develop
      
      * Resolve dnnl header problems on MAC CI
      
      * Variable namings to kernel and skip_grad_ci added
      
      test=develop
      
      * Prevent MAC CI from failing
      
      * Prevent windows build from failing
      
      test=develop
      
      * Modify UTs to conform to the rules
      
      * Modify MatMul aux functions namings
      
      test=develop
      a63bcf9a