1. 24 10月, 2022 3 次提交
  2. 21 10月, 2022 1 次提交
  3. 20 10月, 2022 3 次提交
  4. 19 10月, 2022 5 次提交
  5. 18 10月, 2022 3 次提交
    • W
      Merge layernorm trt fuse (#46320) · 5e9f491e
      Wang Bojun 提交于
      * first version, accuracy corrected
      
      * disable debug print
      
      * use blockReduceSum in phi
      
      * add UT
      
      * add opCompat
      
      * code style
      
      * code refine
      
      * bug fix
      
      * code refine
      
      * test fix
      
      * bugfix
      
      * codesytle fix
      
      * code style
      
      * code-style
      
      * code-style
      
      * code-style
      5e9f491e
    • S
      FC + activation fuse passes (#45183) · b7a23adb
      Sławomir Siwek 提交于
      * git
      
      * style
      
      * leave default relu in kernel
      
      * style
      
      * cleanup FCMKLDNN pattern
      
      * merge conflicts
      
      * update develop
      
      * update develop
      
      * add const
      
      * rename to oneDNN and adjust attributes
      
      * whitespace
      b7a23adb
    • Z
      [code-gen] Support code-gen for opmaker of sparse op (#46993) · bdd3dde3
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      bdd3dde3
  6. 17 10月, 2022 7 次提交
  7. 16 10月, 2022 1 次提交
  8. 14 10月, 2022 1 次提交
  9. 13 10月, 2022 5 次提交
    • Y
      Fix quantize model deploy bugs when using MKLDNN (#45920) · 561fd8c8
      yeliang2258 提交于
      * fix immutable op quantize bugs
      
      * fix
      
      * fix build bug
      
      * fix test
      
      * notest,test=inference
      
      * fix ppyoloe acc drop bugs
      
      * fix test
      
      * fix test
      
      * add test
      
      * fix
      
      * fix
      
      * fix test
      
      * fix refined name bug
      
      * fix test
      
      * bias fix
      
      * fix matmul weight dequant bug
      
      * re-ci
      
      * fix tester
      
      * fix test
      
      * fix tester
      
      * update weight dequantize func
      
      * update code
      
      * update test for converage
      
      * update test
      
      * update cmake
      
      * update cmakelist
      
      * update code
      
      * rerun ci
      
      * remove useless code
      561fd8c8
    • Y
      fix bugs (#46951) · 20335b7c
      YuanRisheng 提交于
      20335b7c
    • L
      [new-exec] remove variable scope, stage2 (#43936) · 1230a3f4
      Leo Chen 提交于
      * remove class ScopeBase
      
      * reopen test
      1230a3f4
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759
      HongyuJia 提交于
      * remove PADDLE_WITH_MKLDNN, test white_list=abs
      
      * fix unique_ptr
      
      * fix op.Type()
      
      * remove TODO in kernel_dispatch.h
      
      * remove IndicateVarDataType function, update white_list
      
      * remove mkldnn hard code
      
      * add comments
      
      * fix ==
      
      * update mkldnn_op_list
      
      * delete hard code of OPs
      
      * update mkldnn_op_list
      
      * update mkldnn_op_list, remove interp
      
      * add error check for ExecutionContext
      
      * update mkldnn_op_list, remove transpose2_grad
      
      * remove interpolate mkldnn
      
      * remove fill_constant mkldnn
      
      * opt HasAttr in DygraphExecutionContext
      
      * deprecated commit, test mkldnn_white_list
      
      * deprecated commit, test mkldnn_white_list
      
      * deprecated commit, test mkldnn_black_list
      
      * update mkldnn_op_list, add assert error op
      
      * solve cudnn related op
      
      * fix error
      
      * add mkldnn fallback in phi_utils.cc
      
      * remove mkldnn fallback in phi_utils.cc
      
      * opt code implementation
      
      * polish Copyright License
      ef1c8759
    • J
      Add unsigned int8 scale propagation (#46378) · c72b3bfa
      joanna.wozna.intel 提交于
      * Add unsigned int8 propagation
      
      * Add or modify unit tests
      
      * Correct concat scale checking
      
      * Apply review suggestions
      
      * Corrections
      c72b3bfa
  10. 12 10月, 2022 5 次提交
  11. 11 10月, 2022 4 次提交
  12. 10 10月, 2022 2 次提交
    • Y
      [PHI]Add RNN yaml (#46812) · ab60fd8b
      YuanRisheng 提交于
      * add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta
      
      * WIP: move rnn kernrl to phi
      
      * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
      This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.
      
      * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      ab60fd8b
    • L
      reduce time cost on atomic in interpretercore (#46688) · dd3d45de
      Leo Chen 提交于
      * reduce time cost on atomic in interpretercore
      
      * clear code of PrepareAtomic in interpretercore
      
      * refine threadpool cache
      dd3d45de