1. 08 8月, 2022 6 次提交
  2. 05 8月, 2022 7 次提交
  3. 04 8月, 2022 7 次提交
  4. 03 8月, 2022 14 次提交
  5. 02 8月, 2022 6 次提交
    • L
      fix namespace of GPUContext (#44822) · 65f38869
      Leo Chen 提交于
      65f38869
    • S
      move renorm op (#44676) · 669353c1
      seemingwang 提交于
      * move renorm op
      
      * change python api
      
      * change op class func
      
      * alloc data
      
      * remove comments
      
      * fix grad arguments
      
      * fix python argument
      
      * fix python argument
      
      * change unit-test
      
      * remove shape func registration
      
      * recover extra-arguments
      
      * recover shape functor
      669353c1
    • W
      Multihead matmul fp16 (#44792) · 0fd8ee63
      Wilber 提交于
      * multihead matmul add fp16
      
      * fix windows error
      
      * fix rocm error
      
      * fix rocm error
      0fd8ee63
    • Y
      [Phi] Move QR to Phi (#44742) · 2cf2e786
      Yulong Ao 提交于
      * [Phi] Move Qr to the Phi
      
      * [Phi] Regiter the cpu grad kernel for qr
      
      * [Phi] Share the cuda kernels to lstsq
      
      * [Phi] Remove some improper inlcude files
      
      * [Phi] Modify codes based on the reviews
      
      * [Phi] Remove unecessary files and add the cuda_only comment
      
      * [Phi] Remove the unecessary include file
      
      * [Phi] Remove qr_op.cu and lstsq_op.cu
      2cf2e786
    • H
      [XPU] fp16 for layer_norm op (#44778) · 4c3e13de
      houj04 提交于
      * [XPU] fp16 for layer_norm op. test=kunlun
      4c3e13de
    • C
      [phi] add yolov3_loss yaml and unittest (#44476) · c7cf12fc
      ccrrong 提交于
      * add yaml and unittest
      
      * update yaml
      
      * update backward yaml and unittest
      
      * update yaml
      
      * add Yolov3LossGradInferMeta
      
      * update yolov3_loss_op.cc
      
      * fix bug
      
      * code format
      c7cf12fc