1. 04 8月, 2022 8 次提交
  2. 03 8月, 2022 25 次提交
  3. 02 8月, 2022 7 次提交
    • L
      fix namespace of GPUContext (#44822) · 65f38869
      Leo Chen 提交于
      65f38869
    • S
      move renorm op (#44676) · 669353c1
      seemingwang 提交于
      * move renorm op
      
      * change python api
      
      * change op class func
      
      * alloc data
      
      * remove comments
      
      * fix grad arguments
      
      * fix python argument
      
      * fix python argument
      
      * change unit-test
      
      * remove shape func registration
      
      * recover extra-arguments
      
      * recover shape functor
      669353c1
    • C
      【PFCC算子性能优化】 SeluKernel Optimization (#44490) · 859c4077
      carryyu 提交于
      * [PFCC] SeluKernel Optimization
      
      * selu kernel optimization
      
      * add private
      
      Co-authored-by: carryyu <>
      859c4077
    • W
      Multihead matmul fp16 (#44792) · 0fd8ee63
      Wilber 提交于
      * multihead matmul add fp16
      
      * fix windows error
      
      * fix rocm error
      
      * fix rocm error
      0fd8ee63
    • D
      3491d183
    • W
      [Eager] use eager final state instead intermediate state (#44722) · f1873b90
      Weilong Wu 提交于
      * [Eager] call final_state_slice under eager mode
      
      * rm useless comments
      
      * use eager final state instead intermidiate state
      
      * update fill_constant yaml
      
      * update fill_constant yaml
      
      * modify wrapped_infermeta_gen logic to fix special case
      
      * fix slice in manipulation
      
      * use fill_constant_
      
      * modify slice infermeta
      
      * rm final_state_conv2d
      
      * use final_state_slice
      
      * use final_state_slice only
      
      * polish slice, use final state
      
      * add paddle_throw for SplitInferMeta
      
      * rm fill_constant_ temply
      
      * recover array_equal, not allclose
      
      * recover original code
      f1873b90
    • Y
      [Phi] Move QR to Phi (#44742) · 2cf2e786
      Yulong Ao 提交于
      * [Phi] Move Qr to the Phi
      
      * [Phi] Regiter the cpu grad kernel for qr
      
      * [Phi] Share the cuda kernels to lstsq
      
      * [Phi] Remove some improper inlcude files
      
      * [Phi] Modify codes based on the reviews
      
      * [Phi] Remove unecessary files and add the cuda_only comment
      
      * [Phi] Remove the unecessary include file
      
      * [Phi] Remove qr_op.cu and lstsq_op.cu
      2cf2e786