1. 26 7月, 2023 1 次提交
  2. 20 7月, 2023 1 次提交
  3. 19 7月, 2023 2 次提交
  4. 13 7月, 2023 1 次提交
  5. 11 7月, 2023 1 次提交
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
  6. 06 7月, 2023 2 次提交
  7. 03 7月, 2023 1 次提交
  8. 29 6月, 2023 2 次提交
  9. 27 6月, 2023 1 次提交
  10. 25 6月, 2023 1 次提交
  11. 16 6月, 2023 1 次提交
  12. 14 6月, 2023 3 次提交
  13. 12 6月, 2023 1 次提交
  14. 09 6月, 2023 1 次提交
  15. 01 6月, 2023 1 次提交
  16. 31 5月, 2023 2 次提交
  17. 30 5月, 2023 1 次提交
  18. 23 5月, 2023 1 次提交
  19. 22 5月, 2023 1 次提交
    • M
      [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() (#53856) · 3794d171
      Meteor Liu 提交于
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * fixed cyclic reference that caused patial import
      
      * fixed bad change
      
      * fix bad import
      
      * fix bad import
      
      * fix bad import
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fixed usage of in_dynamic_mode() or in_dygraph_mode()
      
      * revert python3 to python in .pre-commit-config.yaml
      
      * fix merge conflicts
      3794d171
  20. 19 5月, 2023 1 次提交
    • L
      Add flash attention to speedup fused_gate_attention. (#52731) · d29c1f8e
      limingshu 提交于
      * Reorganize the forward codes of flash-attention.
      
      * Fix forward.
      
      * Remove some noused codes.
      
      * Simplify codes and fix backward.
      
      * Change all LOG(INFO) to VLOG and fix the backward.
      
      * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes
      
      * decrease the effect of debug print on performance
      
      * Unify the initialize of flashattn arguments.
      
      * Rewirte the reshape of temp_mask and temp_bias.
      
      * API support use_flash_attn.
      
      * Fix compiling error on CI.
      
      * Try to crop the flash-attention lib.
      
      * Correct the condition of whether can use flash-attn.
      
      * Remove the softmax_out argument.
      
      * Remove is_causal.
      
      * Polish codes.
      
      * Fix qkv_transpose_out's shape and scaling of Q * K.
      
      * Update commit of flash-attention.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      d29c1f8e
  21. 18 5月, 2023 1 次提交
    • H
      Fused elementwises kernels and ops (#51427) · fb4a6ecf
      Hulek 提交于
      * Fused elementwises kernels and ops
      
      * change fuse pass name
      
      * adjust .pbtxt files
      
      * adjust quantization attributes
      
      * add missing arguments and fix others, review fixed
      
      * simplify fused kernel registration
      
      * fix elementwise unit tests
      
      * reuse one fused elementwise op
      
      * adjust proto
      
      * Add supported datatypes
      
      * Change 'Scale' to 'scale' in tests, change some tests to onednn
      
      * Revert breaking changes
      
      * Fix unit tests
      
      * Delete obsolete test cases
      
      * Delete commented out code
      
      * Fix codestyle
      
      * delete temporary condition
      
      * fix conflicts and delete duplicate fusing
      
      * Fix code after merge
      
      * Move tests to new directory
      
      * fix tests volatility
      
      * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py
      
      * Update CMakeLists.txt add mkldnn op test
      
      ---------
      Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
      fb4a6ecf
  22. 15 5月, 2023 1 次提交
  23. 11 5月, 2023 1 次提交
  24. 06 5月, 2023 1 次提交
  25. 30 4月, 2023 1 次提交
  26. 27 4月, 2023 1 次提交
  27. 26 4月, 2023 2 次提交
  28. 23 4月, 2023 1 次提交
    • L
      relocate paddle/fluid/contrib/layers (#52820) · 802be98b
      LoneRanger 提交于
      * relocate metri_op.py
      
      * reloacte nn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix variable->tensor and fix __all__
      
      * fix ctr_metric_bundle and sparse_embedding
      
      * fix bug of function init
      
      * fix bug of importing sparse_embedding and ctr_metric_bundle
      
      * fix bug
      
      * Update __init__.py
      802be98b
  29. 22 4月, 2023 1 次提交
  30. 21 4月, 2023 2 次提交
  31. 20 4月, 2023 1 次提交
  32. 19 4月, 2023 1 次提交