1. 05 12月, 2022 4 次提交
  2. 02 12月, 2022 3 次提交
  3. 01 12月, 2022 2 次提交
  4. 30 11月, 2022 8 次提交
  5. 29 11月, 2022 12 次提交
    • L
      fix mma_tensorcore (#48386) · bf4d1792
      lzy 提交于
      * fix mma_tensorcore (__CUDA_ARCH__)
      
      * disable tensorcore by default.
      
      disable tensorcore by default, because the judgment of __CUDA_ARCH__ will cause undefined behavior in some environments, can manually enable it on a machine that supports tensorcore.
      bf4d1792
    • P
      [PHI] traspose2 kernel migration (#47748) · d86aa4ca
      Paulina Gacek 提交于
      * traspose2 kernel migrated
      
      * Got rid of mutable_data
      
      * x modification added
      
      * ops added in extra info file
      
      * Formatting fix
      
      * 2 fuse passes with tanpose2 commented
      
      * nr of outs changed in 2 passes, passes uncommented
      
      * Changes in passes reverted
      
      * transpose chnaged in operator.cc
      
      * MKLDNN check in operator.cc
      
      * Transpose fixes
      
      * Fix deleted from operato
      
      * template corrected
      Co-authored-by: NPaulina Gacek <paulinagacek@intel.com>
      d86aa4ca
    • Replace LoDTensor with phi::DenseTensor in fluid\operators (#48417) · 91dd8a2e
      张春乔 提交于
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * Update split_lod_tensor_op.cc
      
      * Update warpctc_op.cc
      
      * Update broadcast_tensors_op.cc
      
      * Update crf_decoding_op.cc
      
      * Update lstm_op.cc
      
      * Update lstm_op.cc
      
      * Update lod_reset_op.cc
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * resume 2 files for confilct
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * Update lstm_op.cc
      91dd8a2e
    • N
      [CodeStyle][isort] introduce isort (part4) (#48402) · f85def97
      Nyakku Shigure 提交于
      * isort all files
      
      * revert conflicting files
      
      * revert conflicting files
      
      * revert conflicting files
      f85def97
    • S
      eltwise_div + scale [PHI] (#48484) · fa10524d
      Sławomir Siwek 提交于
      fa10524d
    • S
      [PHI] Migrate matmul kernel (#48162) · f41ccbd5
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      
      * matmul fwd
      
      * add extra attr
      
      * temp disable passes
      
      * re-enable passes
      
      * workaround for matmul+act
      
      * fix for matmul+eltwise_add
      
      * fix typo
      
      * merge bugfix #48364
      
      * remove merge conflict
      f41ccbd5
    • K
      [Control Flow] replace executor in while op with InterpreterCore (#47573) · 6dbfbfa5
      kangguangli 提交于
      * fix:add no support for cuda_arch<700
      
      * replace Executor in while op with InterpreterCore
      
      * cache InterpreterCore as the member of WhileOp
      
      * fix bug: tensor place changed because of assign op in while loop
      
      * refine code
      
      * refine code
      
      * refine code
      
      * hot fix
      
      * fix compile
      
      * merge develop
      
      * follow comments
      
      * add log for test
      
      * remove LoDTensor
      
      * set flag control_flow_use_new_executor false
      Co-authored-by: Nfengshuai <fengshuai03@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      6dbfbfa5
    • J
      Bugfix for Collective default calc stream (#48308) · a66bb67a
      JZ-LIANG 提交于
      * get default calc stream from execution ctx instead of global dev ctx pool.
      a66bb67a
    • L
      [Fluid API]Remove multiple APIs in control_flow (#48279) · c0d31dac
      LiYuRio 提交于
      * remove lod_tensor_to_array, array_to_lod_tensor, DynamicRNN
      
      * remove less_equal, greater_than, greater_equal, equal, not_equal
      c0d31dac
    • S
      [PHI decoupling] Move MKLDNN code (#48352) · fa051eec
      Sławomir Siwek 提交于
      fa051eec
    • HappyHeavyRain's avatar
      Generate static graph code for lerp by yaml (#48322) · d5387de2
      HappyHeavyRain 提交于
      * generate static graph code for lerp by yaml, test=develop
      
      * modify the op_compat.yaml of lerp, test=develop
      
      * generate static graph code for lerp by yaml, test=develop
      
      * modify the op_compat.yaml of lerp, test=develop
      
      * remove the 'attrs' of lerp, test=develop
      Signed-off-by: HappyHeavyRain's avatarlizhiyu02 <1528794076@qq.com>
      Signed-off-by: HappyHeavyRain's avatarlizhiyu02 <1528794076@qq.com>
      d5387de2
    • Z
      [Sparse]BatchNorm use inplace (#48254) · d33d6db0
      zhangkaihuo 提交于
      d33d6db0
  6. 28 11月, 2022 9 次提交
  7. 24 11月, 2022 2 次提交
    • H
      [PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6
      huangjiyi 提交于
      * rm dependence to "convert_utils.h" in some files
      
      * fix bugs
      
      * replace DataType2String with DataTypeToString
      
      * replace framework::DataTypeSize with phi::SizeOf
      
      * mv convert_function from fluid to phi and rm old map
      
      * recommit with pre-commit
      
      * repalce ProtoVarType with ProtoDataType and update comment.
      
      * fix error about include "dnnl.hpp"
      
      * revert add dep mkldnn to convert_utils in phi
      
      * add mkldnn deps in convert_utils.h in phi
      
      * move deps to convert_utils.h in phi
      de4310e6
    • S
      [PHI] Migrate batch_norm_grad kernel (#48288) · 561b7278
      Sławomir Siwek 提交于
      561b7278