1. 12 7月, 2023 2 次提交
    • Y
      [ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7
      YangQun 提交于
      * squash pick the poc code
      * fix build after rebase
      * fix int8 conv and fc uts
      * Fix and clean-up Get_SRC_Scale_Memory
      * fix floating point fc uts
      * fix test_analyzer_int8_googlenet
      * test_analyzer_int8_mobilenetv1
      * fix int8 mobilenet v2 and v3
      * fix build error after rebase
      * [oneDNN] rename library version
      * fix conv bias datatype
      * try to fix import error
      * fix rebase error
      * [oneDNN] pack library into python wheel
      * add MKLDNN_SHARED_LIB_3 to env_dict
      * fix test_analyzer_bert
      * fix fill_constant op kernel
      * fix ernie and matmul op ut
      * fix softplus ut
      * fix conv+relu6 fusion ut
      * fix hardswish fusion
      * fix quant+transpose fusion ut
      * fixsgd ut
      * fix int8 matmul with flatten
      * fix fc+scale fusion
      * fix conv/matmul+gelu fusion uts
      * fix rebase error
      * Revert "fix conv/matmul+gelu fusion uts"
      This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
      * upgrade to onednn v3.1
      * remove older version onednn
      * use densetensor::data() for achieving mean and var in layernorm impl
      * comments for atol of integer tests
      * fix clang-format
      * Revert "remove older version onednn"
      This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
      * improve binary handle
      * fix expand kernel
      * Revert "use densetensor::data() for achieving mean and var in layernorm impl"
      * always use forward_inference for conv
      * remove activation scales
      * rollback changes to mkldnn.cmake
      * address comments
      * port changes to dequantize kernel
      * fix merge error
      * fix fused_elementwise_kernel
      * upgrade onednn version to v3.1.1
      * fix some approval error
      * fix error msg format
      * remove old onednn libs
      * try to fix symbolic link issue
      * fix cinn test case segfault
      * do not explicit link test with onednn
      * remove unnecessary changes
      * integrate CINN with onednn v3
      * link with mkldnn project
      * fix cinn build file
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      cfa513f7
    • W
      [clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7
      Wang Xin 提交于
      * [clang-tidy] enable readability-container-size-empty check
      
      * fix test_custom_kernel Failed
      
      * add clang-tid-10 in dockerfile
      
      * add clang-tidy in dockerfile
      
      * fix bug
      be3a6fa7
  2. 07 7月, 2023 5 次提交
  3. 06 7月, 2023 2 次提交
  4. 05 7月, 2023 3 次提交
  5. 04 7月, 2023 2 次提交
  6. 03 7月, 2023 2 次提交
  7. 30 6月, 2023 3 次提交
  8. 29 6月, 2023 4 次提交
    • H
      Refactor build attribute (#54968) · eef38db1
      hong 提交于
      * update
      
      * refactor build context
      
      * fix bug
      
      * polish code
      
      * change func name
      eef38db1
    • [CodeStyle][CINN] format cpp code via clang-format (#54961) · af127342
      张经纬 提交于
      * fix clang-format
      
      * 'fix_clang-format'
      
      * fix remaining errors
      
      * format
      
      * empty commit, re-trigger all ci
      
      * empty commit, re-trigger all ci
      
      ---------
      Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
      af127342
    • H
      Refactor op info parser (#54859) · f18d538b
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      
      * support feed in new ir
      
      * update
      
      * fix bug
      
      * try to hack combine op
      
      * add scope guard
      
      * revert atan2 op
      
      * add scope guard
      
      * update
      
      * polish code
      
      * update
      
      * refactor build kernel context
      
      * fix unitest bug
      
      * polish code
      
      * use original order
      
      * remove useless code
      
      * polish code
      
      * fix bug
      f18d538b
    • W
      [XPU]add layer_norm fuse pass (#54930) · b94b3ac0
      wz1qqx 提交于
      b94b3ac0
  9. 28 6月, 2023 2 次提交
  10. 27 6月, 2023 7 次提交
    • W
      [Paddle Inference]Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass,… (#54861) · e49c17d2
      Wangzheee 提交于
      * Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass, embedding_eltwise_layernorm_fuse_pass
      e49c17d2
    • R
      fix compiler error (#54883) · 8dc97857
      risemeup1 提交于
      8dc97857
    • X
      add xpu_optimize_cachekv_initialization_pass (#54809) · 610a47dd
      xinxinZi 提交于
      610a47dd
    • commit (#54894) · 70288456
      周周周 提交于
      70288456
    • 6
      Code merge | Merge CINN into Paddle (#54749) · 67c69dca
      6clc 提交于
      * feat(cmake): add cmake of cinn
      
      * feat(cmake): add cmake of cinn python test
      
      * feat(cmake): add jit
      
      * feat(cmake): test/CMakeList.txt
      
      * feat(cmake): rebase to develop
      
      * feat(cmake): remove some flags
      
      * fix(cmake): fix cinn's gflags depends
      
      * feat(cmake): add ci scripts of cinn
      
      * feat(cmake): copy code of cinn
      
      * fix(cmake): fix cinn third_party model path
      
      * gflags dynamic dependce
      
      * fix ci build_demo
      
      * tmp update to c++17 of cinn-only test
      
      * fix cinn only with c++17
      67c69dca
    • Z
      delete_assign_op_pass (#54887) · 813266a2
      zhupengyang 提交于
      813266a2
    • H
      New ir support data transfer (#54763) · b58869fa
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * update
      
      * add scope guard
      
      * polish code
      b58869fa
  11. 26 6月, 2023 6 次提交
    • H
      Support feed op new ir (#54840) · 1e323137
      hong 提交于
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      
      * support feed in new ir
      
      * fix bug
      
      * try to hack combine op
      
      * add scope guard
      
      * revert atan2 op
      
      * polish code
      1e323137
    • X
      [XPU] support xpu runtime profiler: follow up (#54690) · 9c3f4b13
      XiaociZhang 提交于
      * [XPU] support xpu runtime profiler: follow up
      
      * fix compile issue
      9c3f4b13
    • W
      add squeeze2+matmul pass (#54779) · f1c8d3fa
      wz1qqx 提交于
      f1c8d3fa
    • S
      remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85
      Sonder 提交于
      * remove ops from OpsWithFluidKernelNeedMoveToPhi set
      
      * open static build flag
      
      * OpsWithFluidKernelNeedMoveToPhi
      
      * open new_executor_static_build
      
      * add infermate for cudnn_lstm
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix pow2 decay
      
      * fix pow2 decay
      
      * recover analysis_predictor.cc
      
      * fix pow2 decay
      
      * fix cudnn lstm
      
      * add output register info for svd
      
      * fix pow2_decay_with_linear_warmup_kernel
      
      * recover test lstm cudnn
      
      * recover svg register codes
      
      * fix register info
      
      * fix reduce sum register info
      
      * add output info for adadelta
      
      * add output info for adadelta
      
      * add output info for adamax
      
      * fix complex abs register info
      
      * add register info for cudnn_lstm_grad
      
      * recover
      
      * fix lstm cudnn
      
      * fix
      
      * fix xpu output registe info
      
      * remove std::cout
      
      * add backend
      
      * remove output info in pow2_decay_with_linear_warmup_kernel
      
      * add judgment in TensorShouldBeFakeInitialized
      
      * recover power_
      
      * close new_executor_static_build
      
      * fix set_value_xpu
      733eca85
    • R
      Share workqueue cross-interpretercores (#54780) · 59dd97af
      Ruibiao Chen 提交于
      * Share workqueue cross-interpretercores
      
      * Fix UT
      59dd97af
    • Z
  12. 25 6月, 2023 1 次提交
    • H
      Support fetch in new ir (#54826) · e66beb0b
      hong 提交于
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      e66beb0b
  13. 21 6月, 2023 1 次提交