1. 29 6月, 2023 2 次提交
  2. 27 6月, 2023 1 次提交
  3. 25 6月, 2023 1 次提交
  4. 16 6月, 2023 1 次提交
  5. 14 6月, 2023 3 次提交
  6. 12 6月, 2023 1 次提交
  7. 09 6月, 2023 1 次提交
  8. 01 6月, 2023 1 次提交
  9. 31 5月, 2023 2 次提交
  10. 30 5月, 2023 1 次提交
  11. 23 5月, 2023 1 次提交
  12. 22 5月, 2023 1 次提交
    • M
      [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() (#53856) · 3794d171
      Meteor Liu 提交于
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * fixed cyclic reference that caused patial import
      
      * fixed bad change
      
      * fix bad import
      
      * fix bad import
      
      * fix bad import
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fixed usage of in_dynamic_mode() or in_dygraph_mode()
      
      * revert python3 to python in .pre-commit-config.yaml
      
      * fix merge conflicts
      3794d171
  13. 19 5月, 2023 1 次提交
    • L
      Add flash attention to speedup fused_gate_attention. (#52731) · d29c1f8e
      limingshu 提交于
      * Reorganize the forward codes of flash-attention.
      
      * Fix forward.
      
      * Remove some noused codes.
      
      * Simplify codes and fix backward.
      
      * Change all LOG(INFO) to VLOG and fix the backward.
      
      * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes
      
      * decrease the effect of debug print on performance
      
      * Unify the initialize of flashattn arguments.
      
      * Rewirte the reshape of temp_mask and temp_bias.
      
      * API support use_flash_attn.
      
      * Fix compiling error on CI.
      
      * Try to crop the flash-attention lib.
      
      * Correct the condition of whether can use flash-attn.
      
      * Remove the softmax_out argument.
      
      * Remove is_causal.
      
      * Polish codes.
      
      * Fix qkv_transpose_out's shape and scaling of Q * K.
      
      * Update commit of flash-attention.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      d29c1f8e
  14. 18 5月, 2023 1 次提交
    • H
      Fused elementwises kernels and ops (#51427) · fb4a6ecf
      Hulek 提交于
      * Fused elementwises kernels and ops
      
      * change fuse pass name
      
      * adjust .pbtxt files
      
      * adjust quantization attributes
      
      * add missing arguments and fix others, review fixed
      
      * simplify fused kernel registration
      
      * fix elementwise unit tests
      
      * reuse one fused elementwise op
      
      * adjust proto
      
      * Add supported datatypes
      
      * Change 'Scale' to 'scale' in tests, change some tests to onednn
      
      * Revert breaking changes
      
      * Fix unit tests
      
      * Delete obsolete test cases
      
      * Delete commented out code
      
      * Fix codestyle
      
      * delete temporary condition
      
      * fix conflicts and delete duplicate fusing
      
      * Fix code after merge
      
      * Move tests to new directory
      
      * fix tests volatility
      
      * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py
      
      * Update CMakeLists.txt add mkldnn op test
      
      ---------
      Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
      fb4a6ecf
  15. 15 5月, 2023 1 次提交
  16. 11 5月, 2023 1 次提交
  17. 06 5月, 2023 1 次提交
  18. 30 4月, 2023 1 次提交
  19. 27 4月, 2023 1 次提交
  20. 26 4月, 2023 2 次提交
  21. 23 4月, 2023 1 次提交
    • L
      relocate paddle/fluid/contrib/layers (#52820) · 802be98b
      LoneRanger 提交于
      * relocate metri_op.py
      
      * reloacte nn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix variable->tensor and fix __all__
      
      * fix ctr_metric_bundle and sparse_embedding
      
      * fix bug of function init
      
      * fix bug of importing sparse_embedding and ctr_metric_bundle
      
      * fix bug
      
      * Update __init__.py
      802be98b
  22. 22 4月, 2023 1 次提交
  23. 21 4月, 2023 2 次提交
  24. 20 4月, 2023 1 次提交
  25. 19 4月, 2023 3 次提交
  26. 17 4月, 2023 1 次提交
  27. 14 4月, 2023 2 次提交
    • F
      1. modify set_value op, use Scalars to represent attr `values`, instead of a... · dd2a749a
      Feiyu Chan 提交于
      1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408)
      
      2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition);
      3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version;
      3. provide an option `legacy_format=false` in  serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute;
      4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
      dd2a749a
    • J
      【Prim】Add more infer var type (#52818) · 630d14f5
      Jiabin Yang 提交于
      * add more infer var type
      
      * fix split error
      
      * fix ut
      
      * fix top_k infer vartype
      
      * fix top_k infer vartype
      630d14f5
  28. 12 4月, 2023 2 次提交
    • H
      Modify LayerNorm Composite Rule (#52712) · a2060568
      Huihuang Zheng 提交于
      * [Do NOT merge] Expr PR on Composite
      
      * Expr PR on Composite
      
      * Revert some compsite experiment
      
      * Remove unnecessary composite code
      
      * Add rsqrt as sub primitives
      a2060568
    • C
      [Prim] Add instance_norm composite rule (#52203) · b0f17d05
      chenjian 提交于
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * isamp
      
      * gpu
      
      * cpu
      
      * noamp
      
      * fix instance_norm
      
      * fix
      
      * fix unit test
      
      * fix unit test
      
      * add unit test
      
      * fix
      
      * add big data tests
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * add test case
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * remove amp test
      
      ---------
      Co-authored-by: Nheyanru01 <429520051@qq.com>
      b0f17d05
  29. 10 4月, 2023 1 次提交
  30. 04 4月, 2023 1 次提交