1. 29 11月, 2022 7 次提交
  2. 28 11月, 2022 9 次提交
  3. 24 11月, 2022 2 次提交
    • H
      [PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6
      huangjiyi 提交于
      * rm dependence to "convert_utils.h" in some files
      
      * fix bugs
      
      * replace DataType2String with DataTypeToString
      
      * replace framework::DataTypeSize with phi::SizeOf
      
      * mv convert_function from fluid to phi and rm old map
      
      * recommit with pre-commit
      
      * repalce ProtoVarType with ProtoDataType and update comment.
      
      * fix error about include "dnnl.hpp"
      
      * revert add dep mkldnn to convert_utils in phi
      
      * add mkldnn deps in convert_utils.h in phi
      
      * move deps to convert_utils.h in phi
      de4310e6
    • S
      [PHI] Migrate batch_norm_grad kernel (#48288) · 561b7278
      Sławomir Siwek 提交于
      561b7278
  4. 23 11月, 2022 4 次提交
  5. 22 11月, 2022 7 次提交
  6. 21 11月, 2022 5 次提交
    • S
      add fc-residual quantization (#46917) · fed0ed34
      Sylwester Fraczek 提交于
      * add fc-residual quantization
      
      * revert removal of check for use_mkldnn
      
      * fix bug
      
      * add disable_logs
      
      * review fix
      
      call twice AreScalesPresntForNodes instead of if-else
      
      * rewrite residual input to output
      
      * revert fc mkldnn taking residual data
      
      * format fix
      
      * fix LoDTensor->DenseTensor
      
      * LoDTensor->DenseTensor
      
      * output->input
      
      * revert changes to unsupported script
      
      revert changes to unsupported script
      
      * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
      fed0ed34
    • S
      [PHI] Migrate mul_grad kernel (#48061) · 55f6fb3d
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      55f6fb3d
    • L
      mma qk tensor_core (#48087) · d79eda71
      lzy 提交于
      * use mma for QK dot computing in fused_multi_transformer.
      * Update fused_multi_transformer_op.cu.h
      d79eda71
    • H
      [PHI decoupling] move cross_entropy from fluid to phi (#48160) · 3501ff7d
      huangjiyi 提交于
      * move cross_entropy from fluid to phi
      
      * replace mutable_data with Alloc
      
      * use .template
      3501ff7d
    • W
      Unify `ProcessGroupNCCL` APIs underlying implementation (#48163) · 88410225
      Wen Sun 提交于
      * refactor: replace Collective & PointToPoint with NCCLEnv
      
      * refactor: rename to RunFnInNCCLEnv
      
      * refactor: pass std::function by value
      88410225
  7. 18 11月, 2022 6 次提交
    • MarDino's avatar
      Fused QKVBiasAdd and Transpose with Split Q, KV (#47680) · d595928e
      MarDino 提交于
      * fused qkvBiasAdd and transpose with split qkv
      
      * fix typo
      
      * fix format
      
      * fix name
      
      * add annotation
      
      * fix comment
      d595928e
    • S
      [PHI] Migrate matmul_grad kernel (#48023) · 4ab18ada
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      4ab18ada
    • Z
      [PHI] Migrate conv_transpose kernel (#48119) · 9aacb31b
      Zuza Gawrysiak 提交于
      * Migrate conv_transpose to phi
      
      * Move handler to kernel
      
      * kernel m
      
      * Fix formatting
      
      * handler
      
      * remove fluid
      
      * revert tcp_store
      
      * tcp_store
      
      * remove unused
      
      * Fix declaration
      
      * add dnn input
      
      * Fix typo
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      9aacb31b
    • MarDino's avatar
      Optimize FusedBiasAddGelu Kernel (#47679) · b0e28540
      MarDino 提交于
      * Add quick gelu and fused bias add kernel
      
      * fix annotation
      
      * remove useless code
      
      * add fast gelu option and set it in multi transformer op
      
      * add flag to restrict if use fast gelu approximate
      
      * fix flags conflict
      
      * fix use tanh function instead
      
      * add cudart version limit
      
      * use phi fast tanh func
      
      * fix comment
      b0e28540
    • W
      [PHI decoupling] remove "gpu_primitives.h" in fluid (#48063) · 9918bf9c
      Wang Xin 提交于
      * remove "gpu_primitives.h" in fluid namespace
      
      * fix PR-CI-GpuPS fail
      
      * fix PR-CI-GpuPS fail
      9918bf9c
    • F
      fix: supoort huge length of attention (#48053) · 42f35841
      feng_shuai 提交于
      42f35841