1. 23 4月, 2020 1 次提交
  2. 11 4月, 2020 1 次提交
    • M
      [DNNL][INT8][FP32] MatMul (#23395) · a63bcf9a
      Michał Gallus 提交于
      * Initial FP32 DNNL MatMul Implementation
      
      * Implement int8 DNNL MatMul
      
      * Unify in-kernel-naming, clean UTs
      
      * MatmuL: Introduce op caching
      
      * Final adjustments
      
      test=develop
      
      * Remove dy_graph disablement
      
      test=develop
      
      * Change dnnl header name to new one
      
      test=develop
      
      * Contrain multi head check to prevent fails
      
      test=develop
      
      * Resolve dnnl header problems on MAC CI
      
      * Variable namings to kernel and skip_grad_ci added
      
      test=develop
      
      * Prevent MAC CI from failing
      
      * Prevent windows build from failing
      
      test=develop
      
      * Modify UTs to conform to the rules
      
      * Modify MatMul aux functions namings
      
      test=develop
      a63bcf9a
  3. 07 1月, 2020 1 次提交
  4. 06 1月, 2020 1 次提交
  5. 05 1月, 2020 1 次提交
  6. 10 12月, 2019 1 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
  7. 07 11月, 2019 1 次提交
  8. 22 10月, 2019 1 次提交
  9. 17 9月, 2019 1 次提交
  10. 14 9月, 2019 1 次提交
  11. 10 9月, 2019 1 次提交
  12. 03 9月, 2019 1 次提交
  13. 30 8月, 2019 1 次提交
    • J
      [MKL-DNN] Fix to face model on AVX512 platforms (#19282) · ecd9f330
      Jacek Czaja 提交于
      - Refactor step 1
      
      - Compilation fix
      
      - Yet another compilation fix
      
      - Even more compilation fix
      
      - Lint fixes
      
      test=develop
      
      - Removed deprectaed PADDLE_ENFORCE occurance
      
      test=develop
      
      - Candidate fix to BN forward
      
      - Lint fixes
      
      test=develop
      
      - Refactoring in data_layout_transform
      
      - compilation fix
      
      - Another comppilation fix
      
      - Step further into darkness
      
      - Yet another compilation fix
      
      - Yet another compilation fix
      
      - missing header
      
      - compilation fix
      
      - Added MKLDNN -> Paddle conversion in fetch op
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - Lint
      
      test=develop
      
      - Mul fix
      
      - Fix to MKLDNN MUL op and Elementwise MUL UT
      
      test=develop
      
      - Workaround for diffrent weights with groups representation Paddle vs
        MKL-DNN.
      
      test=develop
      
      - Candidate fix for 5D convolution with groups
      
      - Refactor of fix for conv3d and conv2d in fetch op
      
      test=develop
      
      - Compilation fix
      
      - Still same compilation fix
      
      - Compilation fix
      
      - Compilation fix
      
      - Reverted refactoring of fixes
      
      - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format
        not NHWC
      
      test=develop
      
      - minor fix in UT
      
      test=develop
      
      - Lint fixes
      
      test=develop
      ecd9f330
  14. 25 7月, 2019 1 次提交
  15. 09 7月, 2019 1 次提交
  16. 03 12月, 2018 1 次提交
  17. 27 11月, 2018 2 次提交
    • M
      EltwiseMul: Extract StringToFormat to MKLDNN helper · 9455be0b
      Michal Gallus 提交于
      test=develop
      9455be0b
    • J
      - conv2d transpose MKL-DNN · fb24690a
      Jacek Czaja 提交于
      test=develop
      
      - Added new header for MKLDNN reuse functionality
      
      - Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt
      
      - Buildable conv transpose mkldnn and conv mkldnn using conv template
      
      - Conv2d transpose roughlt implemented and buildable
      
      - Added modifications conv2d transpose MKLDNN unit tests
      
      - Fix to UT of conv2d transpose mkldnn op
      
      - Wrong type of MKLDNN primitive was chosen for conv2d transpose
      
      - HAcks for conv2d transpose
      
      - UT enalbed
      
      - Replaced copying loop with memcpy
      
      - Draft of passing lambda into AcquireMemory
      
      - Made reorder (IOHW->OIHW) to be called only once
      fb24690a
  18. 01 11月, 2018 1 次提交
  19. 11 9月, 2018 2 次提交
  20. 10 9月, 2018 1 次提交
  21. 21 8月, 2018 1 次提交
    • M
      Fuse Convolution and Eltwise Add into MKLDNN's Conv+Bias (#12669) · cd32ddac
      Michał Gallus 提交于
      * Fuse Convolution and Eltwise Add into Conv+Bias
      
      * Reduce bias branching at conv_mkldnn_op
      
      * Add MKLDNN build checks for Conv Bias
      
      * Conv-bias: check if bias input exist befor assignment
      
      * Conv-bias: Remove Bias dim check from infershape
      
      It was causing conv3d test to crash upon\ncalling HasInput(Bias)
      cd32ddac
  22. 09 8月, 2018 1 次提交
  23. 11 7月, 2018 1 次提交
  24. 30 6月, 2018 1 次提交
  25. 28 6月, 2018 1 次提交
  26. 21 6月, 2018 3 次提交
    • J
      - MKLDNN Softmax Grad Op · 98f3ad3b
      Jacek Czaja 提交于
      - Added hash function inside of MKLDNN softmax op to be used as handle for primitives stroing in a
      context
      
      - Style fixes to softmax mkldnn op
      
      - Fixes after review
      
      - Coding style
      
      - Fix to style
      
      - style fixes
      
      - style fix
      
      - style fixes
      
      - Fix to cody style check
      
      - Rephrasing a comment
      
      fix t obroken merge
      
      Fixes to rebase
      
      Conflicts:
      	benchmark/fluid/models/machine_translation.py
      	cmake/external/mkldnn.cmake
      	paddle/fluid/operators/softmax_mkldnn_op.cc
      
      - Bumped revision of MKL-DNN up to have softmax backward primitive
      
      - Added choosing MKLDNN softmax grad operator
      
      - First reuse of softmax backward
      
      - Reinvented reusing for softmax
      
      - Fix to crash in reinvented reuse
      
      - Clang format fixes
      
      - Clang format fixes
      
      - Improved softmax mkldnn reuse mechanism
      
      - clang format fixes
      
      - Fix to broken merge
      
      - Fix
      98f3ad3b
    • T
      Revert "Merge pull request #11628 from PaddlePaddle/revert-11102-mozga-intel/Sum_mkldnn_layout" · d5fb8fa7
      tensor-tang 提交于
      This reverts commit 4d8e8ee2, reversing
      changes made to d6a9f005.
      d5fb8fa7
    • T
      Revert "MKLDNN layout: Support for sum operator" · 90780e22
      tensor-tang 提交于
      90780e22
  27. 19 6月, 2018 1 次提交
  28. 07 6月, 2018 1 次提交
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
  29. 21 5月, 2018 1 次提交
  30. 17 5月, 2018 1 次提交
    • J
      - Draft of reuse of pooling mkldnn operator · 5f133305
      Jacek Czaja 提交于
      - Finished draft of pooling reusing of operators
      
      - Using gethash in PoolGrad added
      
      - Removed diagnostic
      
      - Added pool mkldnn grad reusing of primitives
      
      - Added diagnostic
      
      - Removed diagnostic
      
      - added dependency to mkldnn data type for pooling mkldnn
      
      - Added mkldnn memory data type determining based on template type of op
      
      - Compilation warning fix
      
      - codying style fixes
      5f133305
  31. 17 4月, 2018 1 次提交
  32. 10 4月, 2018 1 次提交
  33. 23 3月, 2018 2 次提交
  34. 07 3月, 2018 1 次提交
  35. 12 2月, 2018 1 次提交