1. 23 11月, 2022 6 次提交
  2. 22 11月, 2022 10 次提交
  3. 21 11月, 2022 16 次提交
  4. 19 11月, 2022 2 次提交
  5. 18 11月, 2022 6 次提交
    • W
      refine save hook (#48124) · 04709310
      wanghuancoder 提交于
      04709310
    • MarDino's avatar
      Fused QKVBiasAdd and Transpose with Split Q, KV (#47680) · d595928e
      MarDino 提交于
      * fused qkvBiasAdd and transpose with split qkv
      
      * fix typo
      
      * fix format
      
      * fix name
      
      * add annotation
      
      * fix comment
      d595928e
    • S
      [PHI] Migrate matmul_grad kernel (#48023) · 4ab18ada
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      4ab18ada
    • Z
      [PHI] Migrate conv_transpose kernel (#48119) · 9aacb31b
      Zuza Gawrysiak 提交于
      * Migrate conv_transpose to phi
      
      * Move handler to kernel
      
      * kernel m
      
      * Fix formatting
      
      * handler
      
      * remove fluid
      
      * revert tcp_store
      
      * tcp_store
      
      * remove unused
      
      * Fix declaration
      
      * add dnn input
      
      * Fix typo
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      9aacb31b
    • Z
      Fix bug of zero_allocator in HostAlloc (#48108) · 7f92e27e
      zyfncg 提交于
      * fix bug of zero_allocator in host
      
      * fix test compile bug
      
      * add unittest
      
      * update test
      7f92e27e
    • MarDino's avatar
      Optimize FusedBiasAddGelu Kernel (#47679) · b0e28540
      MarDino 提交于
      * Add quick gelu and fused bias add kernel
      
      * fix annotation
      
      * remove useless code
      
      * add fast gelu option and set it in multi transformer op
      
      * add flag to restrict if use fast gelu approximate
      
      * fix flags conflict
      
      * fix use tanh function instead
      
      * add cudart version limit
      
      * use phi fast tanh func
      
      * fix comment
      b0e28540