1. 09 12月, 2022 4 次提交
  2. 08 12月, 2022 1 次提交
  3. 07 12月, 2022 3 次提交
  4. 06 12月, 2022 2 次提交
    • Z
      Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38
      zyfncg 提交于
      * delete Bias and ResidualData in OpMaker of conv2d
      
      * delete extra input of conv3d
      
      * refactor pass of conv_bias_fusion
      
      * fix mkldnn dependency
      
      * fix mkldnn compile
      
      * fix test_conv_bias_mkldnn_fuse_pass
      
      * police some code
      
      * remove useless log
      
      * fix analyzer_vit_ocr_tester
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix test_analyzer_ocr
      
      * add fused_conv_sig
      
      * fix performence regression
      
      * fix performance regression
      0a2dfa38
    • S
      [PHI] Migrate elementwise_(add/mul) kernels (#48625) · 7575d37c
      Sławomir Siwek 提交于
      * remove fluid code
      
      * init
      
      * typo
      
      * fix merge conflicts
      7575d37c
  5. 05 12月, 2022 8 次提交
  6. 02 12月, 2022 3 次提交
  7. 01 12月, 2022 2 次提交
  8. 30 11月, 2022 8 次提交
  9. 29 11月, 2022 9 次提交
    • L
      fix mma_tensorcore (#48386) · bf4d1792
      lzy 提交于
      * fix mma_tensorcore (__CUDA_ARCH__)
      
      * disable tensorcore by default.
      
      disable tensorcore by default, because the judgment of __CUDA_ARCH__ will cause undefined behavior in some environments, can manually enable it on a machine that supports tensorcore.
      bf4d1792
    • P
      [PHI] traspose2 kernel migration (#47748) · d86aa4ca
      Paulina Gacek 提交于
      * traspose2 kernel migrated
      
      * Got rid of mutable_data
      
      * x modification added
      
      * ops added in extra info file
      
      * Formatting fix
      
      * 2 fuse passes with tanpose2 commented
      
      * nr of outs changed in 2 passes, passes uncommented
      
      * Changes in passes reverted
      
      * transpose chnaged in operator.cc
      
      * MKLDNN check in operator.cc
      
      * Transpose fixes
      
      * Fix deleted from operato
      
      * template corrected
      Co-authored-by: NPaulina Gacek <paulinagacek@intel.com>
      d86aa4ca
    • Replace LoDTensor with phi::DenseTensor in fluid\operators (#48417) · 91dd8a2e
      张春乔 提交于
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * Update split_lod_tensor_op.cc
      
      * Update warpctc_op.cc
      
      * Update broadcast_tensors_op.cc
      
      * Update crf_decoding_op.cc
      
      * Update lstm_op.cc
      
      * Update lstm_op.cc
      
      * Update lod_reset_op.cc
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * resume 2 files for confilct
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * Update lstm_op.cc
      91dd8a2e
    • N
      [CodeStyle][isort] introduce isort (part4) (#48402) · f85def97
      Nyakku Shigure 提交于
      * isort all files
      
      * revert conflicting files
      
      * revert conflicting files
      
      * revert conflicting files
      f85def97
    • S
      eltwise_div + scale [PHI] (#48484) · fa10524d
      Sławomir Siwek 提交于
      fa10524d
    • S
      [PHI] Migrate matmul kernel (#48162) · f41ccbd5
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      
      * matmul fwd
      
      * add extra attr
      
      * temp disable passes
      
      * re-enable passes
      
      * workaround for matmul+act
      
      * fix for matmul+eltwise_add
      
      * fix typo
      
      * merge bugfix #48364
      
      * remove merge conflict
      f41ccbd5
    • K
      [Control Flow] replace executor in while op with InterpreterCore (#47573) · 6dbfbfa5
      kangguangli 提交于
      * fix:add no support for cuda_arch<700
      
      * replace Executor in while op with InterpreterCore
      
      * cache InterpreterCore as the member of WhileOp
      
      * fix bug: tensor place changed because of assign op in while loop
      
      * refine code
      
      * refine code
      
      * refine code
      
      * hot fix
      
      * fix compile
      
      * merge develop
      
      * follow comments
      
      * add log for test
      
      * remove LoDTensor
      
      * set flag control_flow_use_new_executor false
      Co-authored-by: Nfengshuai <fengshuai03@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      6dbfbfa5
    • J
      Bugfix for Collective default calc stream (#48308) · a66bb67a
      JZ-LIANG 提交于
      * get default calc stream from execution ctx instead of global dev ctx pool.
      a66bb67a
    • L
      [Fluid API]Remove multiple APIs in control_flow (#48279) · c0d31dac
      LiYuRio 提交于
      * remove lod_tensor_to_array, array_to_lod_tensor, DynamicRNN
      
      * remove less_equal, greater_than, greater_equal, equal, not_equal
      c0d31dac