1. 11 10月, 2022 1 次提交
    • S
      [cherry-pick] [PHI] relu6_grad kernel (#46501) (#46862) · 2bcbf8b0
      Sławomir Siwek 提交于
      * [PHI] Migrate gelu kernels (#45596)
      
      * gaussian random
      
      * mkldnn to onednn renaming
      
      * fix merge conflicts
      
      * remove fluid code
      
      * onednn renaming
      
      * gelu fwd
      
      * sort activations
      
      * gelu gradient
      
      * remove unused macros
      
      * merge conflicts
      
      * fix merge conflicts
      
      * remove extra contraint from gelu op
      
      * [PHI] relu6_grad kernel (#46501)
      
      * Relu6
      
      * remove fluid handler
      
      * add individual kernel signature
      
      * coding style
      
      * replace bounded_relu with clip
      
      * whitespace
      
      * code style
      2bcbf8b0
  2. 29 8月, 2022 1 次提交
    • S
      [PHI] Migrate relu6 and abs kernels (#45397) · 632bc1f2
      Sławomir Siwek 提交于
      * abs relu6 fwd
      
      * abs bwd
      
      * gaussian_random_kernel and mkldnn-onednn renaming
      
      * scale kernel
      
      * whitespace
      
      * whitespace
      
      * revert scale migration
      
      * whitespaces
      
      * revert changes to gaussian kernel
      
      * whitespaces
      632bc1f2
  3. 05 8月, 2022 1 次提交
    • Y
      [MKLDNN]Move mkldnn activation kernel to phi (#44365) · 2dfa88d2
      YuanRisheng 提交于
      * move mkldnn activation kernel
      
      * fix compile bugs
      
      * fix compile bugs
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix windows compile bugs
      
      * mkldnn unittest fix
      
      * change mutable to alloc
      
      * fix unittest bugs
      
      * modify code according comment
      2dfa88d2
  4. 14 7月, 2022 1 次提交
    • Y
      [Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be
      YuanRisheng 提交于
      * adapt mkldnn kernel in PHI
      
      * fix ci compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * delete comment
      
      * fix compile bugs in windows-inference
      
      * delete code for converage
      
      * modify code by review
      
      * modify code by review
      
      * add todo
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix unittest bugsx
      e9b4d0be
  5. 26 6月, 2022 1 次提交
  6. 05 6月, 2022 1 次提交
  7. 19 5月, 2022 1 次提交
  8. 21 4月, 2022 1 次提交
  9. 23 3月, 2022 1 次提交
  10. 22 2月, 2022 1 次提交
  11. 20 2月, 2022 1 次提交
  12. 18 1月, 2022 2 次提交
    • S
      Mish FP32/BF16 kernel, conv and fc fuse passes (#38623) · 1d18bc2c
      Sławomir Siwek 提交于
      * Mish
      
      * Change exp() library
      
      * mish fuse pass
      
      * mish attrs
      
      * fixes
      
      * mishop maker
      
      * remove attrs
      
      * mish kernal for bf16
      
      * fc+mish fuse
      
      * fix code format error
      
      * Resolve merge conflicts
      
      * Update mish operator version
      
      * update mish variable to new naming convention
      1d18bc2c
    • Z
      [Unify Tensors PR #8] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3
      Zhanlue Yang 提交于
      * Merged LoDTensor with Tensor,test=allcases
      
      * Patched python level LoDTensor
      
      * Patched python level LoDTensor
      
      * Merge Tensor into DenseTensor
      
      * Fixed namespace issues,test=allcases
      
      * Fixed merge issues
      
      * Fixed inference issues
      
      * Fixed NPU test issues
      
      * Fixed merge issues
      2052f1e3
  13. 06 1月, 2022 1 次提交
  14. 04 1月, 2022 1 次提交
  15. 24 11月, 2021 1 次提交
  16. 22 11月, 2021 1 次提交
  17. 16 11月, 2021 1 次提交
  18. 18 10月, 2021 1 次提交
    • J
      Added softplus FP32 FWD OneDNN kernel (#36382) · bdac9ff6
      jakpiase 提交于
      * added softplus
      
      * refactored softplus op
      
      * deleted unnecessary file
      
      * added missing file
      
      * added formatting
      
      * disabled tests if GPU is used
      
      * added reviewer suggestion
      
      * unified softplus kernel
      bdac9ff6
  19. 08 10月, 2021 1 次提交
  20. 16 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (#34859) · 9cb65653
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      
      * - hopefully compilation fix
      
      - compilation fix
      9cb65653
  21. 12 8月, 2021 1 次提交
  22. 11 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to issue #34554 (#34623) · 0a5c99e8
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      0a5c99e8
  23. 22 7月, 2021 1 次提交
  24. 16 6月, 2021 1 次提交
  25. 25 2月, 2021 1 次提交
  26. 25 1月, 2021 1 次提交
  27. 20 1月, 2021 1 次提交
  28. 12 1月, 2021 1 次提交
  29. 06 11月, 2020 1 次提交
  30. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  31. 02 9月, 2020 1 次提交
  32. 10 8月, 2020 1 次提交
  33. 28 7月, 2020 1 次提交
  34. 26 5月, 2020 1 次提交
    • L
      Update PADDLE_ENFORCE in DNNL related ops (#24333) · c3c61d34
      lidanqing 提交于
      * Update PADDLE_ENFORCE in DNNL related ops
      test=develop
      
      * Abstract macro of OP_GET_PLACE_CHECK
      test=develop
      
      * update according to reviews
      
      * update GET_PLACE_CPU_CHECK
      
      * fix typo
      test=develop
      
      * revert macro
      test=develop
      c3c61d34
  35. 24 4月, 2020 1 次提交
  36. 07 4月, 2020 1 次提交
  37. 02 4月, 2020 1 次提交
  38. 19 3月, 2020 1 次提交
  39. 10 12月, 2019 1 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228