1. 08 9月, 2023 1 次提交
  2. 29 8月, 2023 1 次提交
  3. 09 8月, 2023 1 次提交
  4. 04 8月, 2023 1 次提交
  5. 03 8月, 2023 1 次提交
  6. 31 7月, 2023 1 次提交
    • Z
      rename BatchNormGradFunctor (#55717) · eee4b8fb
      zhangyuqin1998 提交于
      * rename BatchNormGradFunctor
      
      * Update batch_norm_grad_kernel.cc
      
      * Update batch_norm_grad_kernel.cu
      
      * Update batch_norm_grad_kernel.cc
      
      * fix
      
      * Update batch_norm_grad_kernel.cc
      eee4b8fb
  7. 26 7月, 2023 1 次提交
  8. 19 7月, 2023 1 次提交
    • Z
      delete relu6_raw (#55383) · 56d46ccc
      zhangyuqin1998 提交于
      * delete relu6_raw
      
      * fix codestyle
      
      * Update test_mkldnn_matmul_activation_fuse_pass.py
      
      * fix
      
      * Update backward.yaml
      
      * Update ops.yaml
      
      * Update backward.yaml
      56d46ccc
  9. 12 7月, 2023 2 次提交
    • Y
      [ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7
      YangQun 提交于
      * squash pick the poc code
      * fix build after rebase
      * fix int8 conv and fc uts
      * Fix and clean-up Get_SRC_Scale_Memory
      * fix floating point fc uts
      * fix test_analyzer_int8_googlenet
      * test_analyzer_int8_mobilenetv1
      * fix int8 mobilenet v2 and v3
      * fix build error after rebase
      * [oneDNN] rename library version
      * fix conv bias datatype
      * try to fix import error
      * fix rebase error
      * [oneDNN] pack library into python wheel
      * add MKLDNN_SHARED_LIB_3 to env_dict
      * fix test_analyzer_bert
      * fix fill_constant op kernel
      * fix ernie and matmul op ut
      * fix softplus ut
      * fix conv+relu6 fusion ut
      * fix hardswish fusion
      * fix quant+transpose fusion ut
      * fixsgd ut
      * fix int8 matmul with flatten
      * fix fc+scale fusion
      * fix conv/matmul+gelu fusion uts
      * fix rebase error
      * Revert "fix conv/matmul+gelu fusion uts"
      This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
      * upgrade to onednn v3.1
      * remove older version onednn
      * use densetensor::data() for achieving mean and var in layernorm impl
      * comments for atol of integer tests
      * fix clang-format
      * Revert "remove older version onednn"
      This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
      * improve binary handle
      * fix expand kernel
      * Revert "use densetensor::data() for achieving mean and var in layernorm impl"
      * always use forward_inference for conv
      * remove activation scales
      * rollback changes to mkldnn.cmake
      * address comments
      * port changes to dequantize kernel
      * fix merge error
      * fix fused_elementwise_kernel
      * upgrade onednn version to v3.1.1
      * fix some approval error
      * fix error msg format
      * remove old onednn libs
      * try to fix symbolic link issue
      * fix cinn test case segfault
      * do not explicit link test with onednn
      * remove unnecessary changes
      * integrate CINN with onednn v3
      * link with mkldnn project
      * fix cinn build file
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      cfa513f7
    • W
      [clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7
      Wang Xin 提交于
      * [clang-tidy] enable readability-container-size-empty check
      
      * fix test_custom_kernel Failed
      
      * add clang-tid-10 in dockerfile
      
      * add clang-tidy in dockerfile
      
      * fix bug
      be3a6fa7
  10. 27 6月, 2023 1 次提交
  11. 20 6月, 2023 1 次提交
  12. 09 6月, 2023 1 次提交
  13. 05 6月, 2023 2 次提交
  14. 02 6月, 2023 1 次提交
  15. 01 6月, 2023 1 次提交
  16. 24 5月, 2023 1 次提交
  17. 23 5月, 2023 2 次提交
  18. 19 5月, 2023 2 次提交
  19. 18 5月, 2023 1 次提交
    • H
      Fused elementwises kernels and ops (#51427) · fb4a6ecf
      Hulek 提交于
      * Fused elementwises kernels and ops
      
      * change fuse pass name
      
      * adjust .pbtxt files
      
      * adjust quantization attributes
      
      * add missing arguments and fix others, review fixed
      
      * simplify fused kernel registration
      
      * fix elementwise unit tests
      
      * reuse one fused elementwise op
      
      * adjust proto
      
      * Add supported datatypes
      
      * Change 'Scale' to 'scale' in tests, change some tests to onednn
      
      * Revert breaking changes
      
      * Fix unit tests
      
      * Delete obsolete test cases
      
      * Delete commented out code
      
      * Fix codestyle
      
      * delete temporary condition
      
      * fix conflicts and delete duplicate fusing
      
      * Fix code after merge
      
      * Move tests to new directory
      
      * fix tests volatility
      
      * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py
      
      * Update CMakeLists.txt add mkldnn op test
      
      ---------
      Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
      fb4a6ecf
  20. 15 5月, 2023 3 次提交
  21. 11 5月, 2023 1 次提交
  22. 26 4月, 2023 1 次提交
  23. 24 4月, 2023 2 次提交
  24. 17 4月, 2023 1 次提交
  25. 14 4月, 2023 2 次提交
  26. 13 4月, 2023 2 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
    • Z
      rename_bilinear_tensor_op (#52745) · eb93b5c9
      zhangyuqin1998 提交于
      eb93b5c9
  27. 06 4月, 2023 1 次提交
    • S
      Remove oneDNN-specific attributes from matmul (#49444) · 4d97b25d
      Sławomir Siwek 提交于
      * replace matmul with matmul_v2 in fuse passes
      
      * Remove fusion logic from matmul
      
      * removing fusion methods
      
      * add proper name
      
      * adjust namespaces
      
      * clean attrs in python tests
      
      * delete checkpoint and restore matmul version
      
      * remove unused code
      
      * matmul and reshape/transpose fuses migrated
      
      * split MatmulOneDNN headers
      
      * fuse activation and eltwise_add
      
      * add fuse_activation
      
      * matmul_transpose_reshape/reshape_transpose_matmul
      
      * matmul + elementwise_add (fused)
      
      * activation temporary modifciation
      
      * restore matmul(v1) version 0
      
      * merge newest develop
      
      * remove depedency from other PR
      
      * revert pbtxt
      
      * remove placeholders from matmul_v2
      
      * add description in OPMaker
      
      * remove matmul_v2_op.h and all depedencies
      
      * remove dims changing in base op
      
      * add possibility to fuse already fused_matmul
      
      * restart broken CI
      
      * Empty-Commit
      
      * revert matmul_utils.h
      
      * codestyle
      
      * adjust imports
      
      * add pbtxt file
      
      * 100% matmul unit tests coverage
      
      * trigger CI with minimal changes to develop
      
      * adjust changes to develop
      
      * add fused_matmul op
      
      * inherit base ops
      
      * add "v2"
      
      * move OPMaker
      
      * Gradually add fused_matmul files
      
      * second batch of fused_matmul changes
      
      * split infershapes of matmul_v2 and fused_matmul
      
      * merge code from other PR
      
      * 2023
      
      * inherit fused_matmul from matmul_v2
      
      * Update paddle/phi/backends/onednn/onednn_reuse.h
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * resolve conflicts
      
      * codestyle
      
      * simplify isgemmlinear
      
      * 2023
      
      * remove import
      
      * reuse methods
      
      * matmul_v2_mkldnn cleanup
      
      * simplify ExecuteMatMulV1Grad
      
      * matmul refactored
      
      * fc
      
      * SetOutMemDescWithLogicalLayoutFusesSupport
      
      * matmul_v2
      
      * alpha support
      
      * group repetetive funcs
      
      * matmul utils
      
      * execute matmul methods
      
      * restore registered kernel names
      
      * split header and impl files
      
      * remove double negatives
      
      * reduce numer of modified files
      
      * adjust ExecuteMatmul
      
      * add scales for ut
      
      * dates
      
      * limit number of modified files
      
      * fluid imports
      
      * remove alpha
      
      * codestyle
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      4d97b25d
  28. 04 4月, 2023 1 次提交
    • R
      Improve new executor static build (#51149) · 5bac67d4
      Ruibiao Chen 提交于
      * Improve new executor static build
      
      * Skip GC for static build
      
      * Skip infershape for static build
      
      * Handle read_op
      
      * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix argsort typos
      
      * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix skip share lod errors
      
      * Fix errors for adam
      
      * Fix errors for eigvals, memcpy and fake_quantize
      
      * Add static_build.cc
      
      * Add black list
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix TensorArray
      
      * Fix TensorArray
      
      * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel
      
      * Fix copy
      
      * Fix errors
      
      * Fix momentum
      
      * Skip mkldnn
      
      * Fix CI errors
      
      * Fix c_sync_calc_stream_op
      
      * Fix CINN
      
      * Fix while op
      
      * All CI pass, disable FLAGS to merge code, enable it after more tests in future
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add mkldnn UT
      
      * Remove mkldnn test
      
      * Fix typos
      
      * Fix dist test
      
      * Fix typos
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add sparse tests
      
      * ToComplexType -> ToComplex
      
      * Add test_matmul_op_static_build to disable_win_inference_test
      5bac67d4
  29. 29 3月, 2023 1 次提交
    • Z
      [AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7
      zengshao0622 提交于
      * pad3d add unittests of fp16 and bf16
      
      * pad3d add unittests of fp16 and bf16
      
      * fix cuda place
      
      * fix random to uniform
      
      * fix class name
      
      * fix fp16 max relative error to 1.5e-3
      
      * add dytpe register for onednn
      
      * add pad uint16 check of common.py
      
      * remove check_eager
      
      * test_check_grad --> test_check_grad_normal
      f86d0be7
  30. 27 3月, 2023 2 次提交
    • X
      2c1d494e
    • S
      Fused elementwise_(mul/div) (#50428) · 968f7f24
      Sławomir Siwek 提交于
      * extract Op and OPMaker to .h
      
      * extend pattern for fused_op
      
      * set "with_residual" default to false
      
      * adjust fuse passes
      
      * remove fc+eltwise flag
      
      * fused_output_scale
      
      * activation attrs
      
      * remove extra attrs
      
      * fix int8/bf16 unit tests
      
      * simplify RecomputeOutputDims
      
      * remove unused method
      
      * Add description for attributes
      
      * add extra check
      
      * adjust op compats
      
      * update quantize test
      
      * fix protobuf parsing error
      
      * fix int8 performance
      
      * fused elementwises
      
      * merge develop
      
      * remove activation
      
      * restore activation for existing add/sub ops
      968f7f24