1. 24 5月, 2023 3 次提交
  2. 23 5月, 2023 2 次提交
  3. 22 5月, 2023 2 次提交
  4. 19 5月, 2023 2 次提交
    • W
      [XPU] fix fallback (#53801) · 4b85e5db
      wz1qqx 提交于
      4b85e5db
    • L
      Add flash attention to speedup fused_gate_attention. (#52731) · d29c1f8e
      limingshu 提交于
      * Reorganize the forward codes of flash-attention.
      
      * Fix forward.
      
      * Remove some noused codes.
      
      * Simplify codes and fix backward.
      
      * Change all LOG(INFO) to VLOG and fix the backward.
      
      * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes
      
      * decrease the effect of debug print on performance
      
      * Unify the initialize of flashattn arguments.
      
      * Rewirte the reshape of temp_mask and temp_bias.
      
      * API support use_flash_attn.
      
      * Fix compiling error on CI.
      
      * Try to crop the flash-attention lib.
      
      * Correct the condition of whether can use flash-attn.
      
      * Remove the softmax_out argument.
      
      * Remove is_causal.
      
      * Polish codes.
      
      * Fix qkv_transpose_out's shape and scaling of Q * K.
      
      * Update commit of flash-attention.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      d29c1f8e
  5. 18 5月, 2023 2 次提交
  6. 15 5月, 2023 2 次提交
  7. 12 5月, 2023 2 次提交
  8. 11 5月, 2023 4 次提交
  9. 10 5月, 2023 1 次提交
  10. 09 5月, 2023 3 次提交
  11. 08 5月, 2023 2 次提交
  12. 06 5月, 2023 1 次提交
  13. 28 4月, 2023 1 次提交
  14. 27 4月, 2023 1 次提交
  15. 26 4月, 2023 1 次提交
  16. 25 4月, 2023 1 次提交
    • Y
      [PHI]Add flags macro for PHI (#52991) · 22e96bde
      YuanRisheng 提交于
      * add flags for phi
      
      * fix compile bugs
      
      * fix ci bugs
      
      * fix inference bugs
      
      * fix cinn' bugs
      
      * fix cinn bugs
      
      * perfect code according comment
      
      * fix ci bugs
      
      * fix ci bugs
      22e96bde
  17. 24 4月, 2023 1 次提交
    • G
      remove some [-Wunused-parameter] (#53185) · 834eb2ba
      Galaxy1458 提交于
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test ,test=develop
      834eb2ba
  18. 20 4月, 2023 2 次提交
  19. 19 4月, 2023 1 次提交
  20. 17 4月, 2023 1 次提交
  21. 14 4月, 2023 2 次提交
  22. 13 4月, 2023 3 次提交