1. 07 3月, 2023 1 次提交
  2. 06 3月, 2023 3 次提交
  3. 02 3月, 2023 1 次提交
  4. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  5. 28 2月, 2023 2 次提交
  6. 27 2月, 2023 6 次提交
  7. 24 2月, 2023 2 次提交
  8. 20 2月, 2023 1 次提交
  9. 17 2月, 2023 3 次提交
  10. 16 2月, 2023 2 次提交
  11. 14 2月, 2023 1 次提交
  12. 10 2月, 2023 1 次提交
  13. 07 2月, 2023 2 次提交
  14. 06 2月, 2023 3 次提交
  15. 03 2月, 2023 2 次提交
  16. 02 2月, 2023 4 次提交
  17. 01 2月, 2023 2 次提交
    • Z
      remove fluid.initializer.UniformInitializer, ConstantInitializer,... · 6edc7bba
      zqw_1997 提交于
      remove fluid.initializer.UniformInitializer, ConstantInitializer, NormalInitializer, TruncatedNormalInitializer, XavierInitializer, BilinearInitializer, MSRAInitializer, NumpyArrayInitializer and calculate_gain.. (#49498)
      
      * move UniformInitializer and ConstantInitializer
      
      * more modify
      
      * circular import resolved
      
      * another circular import resolved?
      
      * more circular import 2
      
      * circular import 3
      
      * change import paddle in metric.py
      
      * BuildStrategy import from fluid
      
      * modify the framework import path in common.py
      
      * change rnn.py import, from static to original framework
      
      * change import static in the nn folder
      
      * default_main_program should import from common_ops_import
      
      * add import paddle in param_attr.py
      
      * use core not paddle module for using VarDesc
      
      * another old uniform
      
      * mistake that use Uniform instead of UniformInitializer
      
      * modify UniformInitializer doc
      
      * move fluid.NormalInitializer to nn.initializer.NormalInitializer
      
      * remove import of Normal in fluid.layers.nn.py
      
      * remove more import of old Normal
      
      * remove more import of old Normal
      
      * sample code modify and tests modify import
      
      * is_listen_failed passing arg should be log file
      
      * problem solved
      
      * a mistake solved
      
      * comments resoleved and remove paddle.fluid.initializer.TruncatedNormalInitializer
      
      * remove paddle.fluid.initializer.XavierInitializer and paddle.fluid.initializer.MSRAInitializer
      
      * remove paddle.fluid.initializer.BilinearInitializer NumpyArrayInitializer and set_global_initializer
      
      * change fluid to static
      
      * change static to fluid to avoid circular import in distributed_strategy.py
      
      * fix example code and test_initializer
      
      * ValueType
      
      * sample code fix
      
      * change set_global_initializer back to fluid
      
      * put paddle.static.BuildStrategy.ReduceStrategy into the fuction to avoid circular import
      
      * remove calculate_gain, delete BilinearInitializer and revert set_global_initializer
      
      * change the time of using UniformInitializer, ConstantInitializer, NormalInitializer, TruncatedNormalInitializer, XavierInitializer, MSRAInitializer, NumpyArrayInitializer as few as possible
      
      * fix argument incampatible
      
      * fix more arg incompatible
      
      * fix test_prelu_op_xpu.py Constant
      
      * fix inaccurate doc
      
      * more doc fix: default value
      6edc7bba
    • Z
      add clip_grad_norm_ API (#49935) · 0855d982
      zxcd 提交于
      * add clip_grad_norm_ api.
      
      * fix docs and some details according to the comments.
      
      * fix code style.
      
      * fix no_grad problem, and fix doc.
      
      * fix code style.
      
      * fix doc and remove type information
      0855d982
  18. 31 1月, 2023 3 次提交