1. 29 11月, 2022 12 次提交
    • L
      fix mma_tensorcore (#48386) · bf4d1792
      lzy 提交于
      * fix mma_tensorcore (__CUDA_ARCH__)
      
      * disable tensorcore by default.
      
      disable tensorcore by default, because the judgment of __CUDA_ARCH__ will cause undefined behavior in some environments, can manually enable it on a machine that supports tensorcore.
      bf4d1792
    • P
      [PHI] traspose2 kernel migration (#47748) · d86aa4ca
      Paulina Gacek 提交于
      * traspose2 kernel migrated
      
      * Got rid of mutable_data
      
      * x modification added
      
      * ops added in extra info file
      
      * Formatting fix
      
      * 2 fuse passes with tanpose2 commented
      
      * nr of outs changed in 2 passes, passes uncommented
      
      * Changes in passes reverted
      
      * transpose chnaged in operator.cc
      
      * MKLDNN check in operator.cc
      
      * Transpose fixes
      
      * Fix deleted from operato
      
      * template corrected
      Co-authored-by: NPaulina Gacek <paulinagacek@intel.com>
      d86aa4ca
    • Replace LoDTensor with phi::DenseTensor in fluid\operators (#48417) · 91dd8a2e
      张春乔 提交于
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * Update split_lod_tensor_op.cc
      
      * Update warpctc_op.cc
      
      * Update broadcast_tensors_op.cc
      
      * Update crf_decoding_op.cc
      
      * Update lstm_op.cc
      
      * Update lstm_op.cc
      
      * Update lod_reset_op.cc
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * resume 2 files for confilct
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * Update lstm_op.cc
      91dd8a2e
    • N
      [CodeStyle][isort] introduce isort (part4) (#48402) · f85def97
      Nyakku Shigure 提交于
      * isort all files
      
      * revert conflicting files
      
      * revert conflicting files
      
      * revert conflicting files
      f85def97
    • S
      eltwise_div + scale [PHI] (#48484) · fa10524d
      Sławomir Siwek 提交于
      fa10524d
    • S
      [PHI] Migrate matmul kernel (#48162) · f41ccbd5
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      
      * matmul fwd
      
      * add extra attr
      
      * temp disable passes
      
      * re-enable passes
      
      * workaround for matmul+act
      
      * fix for matmul+eltwise_add
      
      * fix typo
      
      * merge bugfix #48364
      
      * remove merge conflict
      f41ccbd5
    • K
      [Control Flow] replace executor in while op with InterpreterCore (#47573) · 6dbfbfa5
      kangguangli 提交于
      * fix:add no support for cuda_arch<700
      
      * replace Executor in while op with InterpreterCore
      
      * cache InterpreterCore as the member of WhileOp
      
      * fix bug: tensor place changed because of assign op in while loop
      
      * refine code
      
      * refine code
      
      * refine code
      
      * hot fix
      
      * fix compile
      
      * merge develop
      
      * follow comments
      
      * add log for test
      
      * remove LoDTensor
      
      * set flag control_flow_use_new_executor false
      Co-authored-by: Nfengshuai <fengshuai03@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      6dbfbfa5
    • J
      Bugfix for Collective default calc stream (#48308) · a66bb67a
      JZ-LIANG 提交于
      * get default calc stream from execution ctx instead of global dev ctx pool.
      a66bb67a
    • L
      [Fluid API]Remove multiple APIs in control_flow (#48279) · c0d31dac
      LiYuRio 提交于
      * remove lod_tensor_to_array, array_to_lod_tensor, DynamicRNN
      
      * remove less_equal, greater_than, greater_equal, equal, not_equal
      c0d31dac
    • S
      [PHI decoupling] Move MKLDNN code (#48352) · fa051eec
      Sławomir Siwek 提交于
      fa051eec
    • HappyHeavyRain's avatar
      Generate static graph code for lerp by yaml (#48322) · d5387de2
      HappyHeavyRain 提交于
      * generate static graph code for lerp by yaml, test=develop
      
      * modify the op_compat.yaml of lerp, test=develop
      
      * generate static graph code for lerp by yaml, test=develop
      
      * modify the op_compat.yaml of lerp, test=develop
      
      * remove the 'attrs' of lerp, test=develop
      Signed-off-by: HappyHeavyRain's avatarlizhiyu02 <1528794076@qq.com>
      Signed-off-by: HappyHeavyRain's avatarlizhiyu02 <1528794076@qq.com>
      d5387de2
    • Z
      [Sparse]BatchNorm use inplace (#48254) · d33d6db0
      zhangkaihuo 提交于
      d33d6db0
  2. 28 11月, 2022 9 次提交
  3. 24 11月, 2022 2 次提交
    • H
      [PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6
      huangjiyi 提交于
      * rm dependence to "convert_utils.h" in some files
      
      * fix bugs
      
      * replace DataType2String with DataTypeToString
      
      * replace framework::DataTypeSize with phi::SizeOf
      
      * mv convert_function from fluid to phi and rm old map
      
      * recommit with pre-commit
      
      * repalce ProtoVarType with ProtoDataType and update comment.
      
      * fix error about include "dnnl.hpp"
      
      * revert add dep mkldnn to convert_utils in phi
      
      * add mkldnn deps in convert_utils.h in phi
      
      * move deps to convert_utils.h in phi
      de4310e6
    • S
      [PHI] Migrate batch_norm_grad kernel (#48288) · 561b7278
      Sławomir Siwek 提交于
      561b7278
  4. 23 11月, 2022 4 次提交
  5. 22 11月, 2022 7 次提交
  6. 21 11月, 2022 5 次提交
    • S
      add fc-residual quantization (#46917) · fed0ed34
      Sylwester Fraczek 提交于
      * add fc-residual quantization
      
      * revert removal of check for use_mkldnn
      
      * fix bug
      
      * add disable_logs
      
      * review fix
      
      call twice AreScalesPresntForNodes instead of if-else
      
      * rewrite residual input to output
      
      * revert fc mkldnn taking residual data
      
      * format fix
      
      * fix LoDTensor->DenseTensor
      
      * LoDTensor->DenseTensor
      
      * output->input
      
      * revert changes to unsupported script
      
      revert changes to unsupported script
      
      * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
      fed0ed34
    • S
      [PHI] Migrate mul_grad kernel (#48061) · 55f6fb3d
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      55f6fb3d
    • L
      mma qk tensor_core (#48087) · d79eda71
      lzy 提交于
      * use mma for QK dot computing in fused_multi_transformer.
      * Update fused_multi_transformer_op.cu.h
      d79eda71
    • H
      [PHI decoupling] move cross_entropy from fluid to phi (#48160) · 3501ff7d
      huangjiyi 提交于
      * move cross_entropy from fluid to phi
      
      * replace mutable_data with Alloc
      
      * use .template
      3501ff7d
    • W
      Unify `ProcessGroupNCCL` APIs underlying implementation (#48163) · 88410225
      Wen Sun 提交于
      * refactor: replace Collective & PointToPoint with NCCLEnv
      
      * refactor: rename to RunFnInNCCLEnv
      
      * refactor: pass std::function by value
      88410225
  7. 18 11月, 2022 1 次提交