1. 02 8月, 2022 5 次提交
    • W
      Multihead matmul fp16 (#44792) · 0fd8ee63
      Wilber 提交于
      * multihead matmul add fp16
      
      * fix windows error
      
      * fix rocm error
      
      * fix rocm error
      0fd8ee63
    • Y
      [Phi] Move QR to Phi (#44742) · 2cf2e786
      Yulong Ao 提交于
      * [Phi] Move Qr to the Phi
      
      * [Phi] Regiter the cpu grad kernel for qr
      
      * [Phi] Share the cuda kernels to lstsq
      
      * [Phi] Remove some improper inlcude files
      
      * [Phi] Modify codes based on the reviews
      
      * [Phi] Remove unecessary files and add the cuda_only comment
      
      * [Phi] Remove the unecessary include file
      
      * [Phi] Remove qr_op.cu and lstsq_op.cu
      2cf2e786
    • H
      [XPU] fp16 for layer_norm op (#44778) · 4c3e13de
      houj04 提交于
      * [XPU] fp16 for layer_norm op. test=kunlun
      4c3e13de
    • C
      [phi] add yolov3_loss yaml and unittest (#44476) · c7cf12fc
      ccrrong 提交于
      * add yaml and unittest
      
      * update yaml
      
      * update backward yaml and unittest
      
      * update yaml
      
      * add Yolov3LossGradInferMeta
      
      * update yolov3_loss_op.cc
      
      * fix bug
      
      * code format
      c7cf12fc
    • M
      support beam_search operator on xpu. test=kunlun (#44720) · 9bf80772
      mengqingchun02 提交于
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      9bf80772
  2. 01 8月, 2022 7 次提交
  3. 30 7月, 2022 1 次提交
  4. 29 7月, 2022 9 次提交
  5. 28 7月, 2022 10 次提交
  6. 27 7月, 2022 5 次提交
  7. 26 7月, 2022 3 次提交