1. 24 11月, 2022 10 次提交
  2. 23 11月, 2022 13 次提交
  3. 22 11月, 2022 10 次提交
  4. 21 11月, 2022 7 次提交
    • L
      fix doc of NPUPlace (#48148) · 809516f6
      Leo Chen 提交于
      * fix doc of NPUPlace
      
      * fix doc of NPUPlace, test=document_fix
      809516f6
    • R
      Fix Ctx Dev pointer for KUNLUN (#48184) · 2d0fb059
      Roc 提交于
      2d0fb059
    • S
      add fc-residual quantization (#46917) · fed0ed34
      Sylwester Fraczek 提交于
      * add fc-residual quantization
      
      * revert removal of check for use_mkldnn
      
      * fix bug
      
      * add disable_logs
      
      * review fix
      
      call twice AreScalesPresntForNodes instead of if-else
      
      * rewrite residual input to output
      
      * revert fc mkldnn taking residual data
      
      * format fix
      
      * fix LoDTensor->DenseTensor
      
      * LoDTensor->DenseTensor
      
      * output->input
      
      * revert changes to unsupported script
      
      revert changes to unsupported script
      
      * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
      fed0ed34
    • R
      delete unnecessary shape and slice op (#48112) · 41483383
      RichardWooSJTU 提交于
      41483383
    • S
      [PHI] Migrate mul_grad kernel (#48061) · 55f6fb3d
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      55f6fb3d
    • L
      mma qk tensor_core (#48087) · d79eda71
      lzy 提交于
      * use mma for QK dot computing in fused_multi_transformer.
      * Update fused_multi_transformer_op.cu.h
      d79eda71
    • W
      refine reduce_all (#48133) · 56f15c43
      wanghuancoder 提交于
      * refine reduce_all
      56f15c43