1. 08 5月, 2023 7 次提交
  2. 06 5月, 2023 6 次提交
  3. 05 5月, 2023 7 次提交
  4. 04 5月, 2023 1 次提交
  5. 30 4月, 2023 1 次提交
  6. 28 4月, 2023 10 次提交
  7. 27 4月, 2023 8 次提交
    • B
      Support different dtypes of inputs for broadcast for dropout optimization (#52093) · 3474e09c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * PR comment
      3474e09c
    • G
      [phi] Move sequence_pool to phi - Step 3 :sequence_pool_grad_op (#52680) · fe053396
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * mv kernels impl
      
      * fix parameter error
      
      * clean include
      
      * fix compat filename
      
      * [phi] move fluid sequence_pool_grad to phi
      
      * [phi][compat] sig rm GradVarName
      
      * [phi] fix sequence_pool out type
      
      * [phi] rm impl, add const string
      
      * [phi] fix const str
      
      * fix sequence_pooling cmake
      
      * [phi] mv sequence_pooling_test
      
      * [phi] fix grad sig
      
      * [phi] fix sequence_pool is_test error
      
      * [phi] fix sequence_pooling gpu include
      
      * [phi] mv to impl
      
      * [phi] fix SequencePoolFunctor cu include
      
      * [phi] modify out max_index int32_t
      
      * [phi] add pooltype mapping determine
      
      * [phi] fix sequence_pool_sig
      
      * [phi] fix sequence_pool_sig sum
      
      * [phi] try ci
      
      * [phi] fix max_index optional
      fe053396
    • H
      [XPU] c_sync_calc_stream support more types (#53389) · 9c1eb98a
      houj04 提交于
      9c1eb98a
    • G
      [static op generation] triangular_solve (#53328) · 18968e7e
      gouzil 提交于
      * [static op generation] triangular_solve
      
      * [phi] mv triangular_solve_grad to static_backward
      
      * [phi] fix import
      
      * [phi] mv to ops.yaml、 backward.yaml
      
      * fix forward attr
      
      * [phi] fix triangular_solve_grad args
      18968e7e
    • W
      cf6cbc34
    • N
      【PaddlePaddle Hackathon 4】:为maxout算子支持 float16 数据类型 (#50976) · 8bfd978f
      NetPunk 提交于
      * support fp16 for maxout op
      
      * format code
      
      * change api
      
      * add test for static float16
      
      * format code
      
      * formatting code
      
      * atol alignment
      
      * experiment—1
      
      * experiment-2
      
      * experiment-3
      
      * format code
      8bfd978f
    • S
      Move fused feedforward (#53166) · 25b4ba7f
      Sonder 提交于
      * trans fused_feedward Compute function to phi
      
      * add register info
      
      * remove maxfunctor
      
      * move fused feedward to phi
      
      * remove sig file
      
      * remove fliud include
      
      * add include
      
      * add include
      
      * add sig file
      
      * add output register info
      
      * fix sig file
      
      * Update fused_feedforward_sig.cc
      
      * fix grad kernel
      
      * update output register info
      
      * fix
      
      * open fused_feedforward static build
      
      * add optional and fix code style
      
      * fix output info for fused attention
      
      * add optional param
      
      * merge
      25b4ba7f
    • X
      【prim】Concat bug (#53350) · 6768c6ec
      xiaoguoguo626807 提交于
      * modify concat_grad add sum comp rule
      
      * modify opcompat
      6768c6ec