1. 14 7月, 2023 1 次提交
    • C
      [AutoTuner] Distribute best cfg (#54834) · 7f6d222f
      caozhou 提交于
      * distribute best cfg
      
      * adapt to multi args transmission
      
      * update metric extracting
      
      * fix bugs of prune and reading log
      
      * fix time default value
      
      * remove time record
      
      * adjust the order of searching dim
      
      * fix prune bugs
      
      * fix adding cfg bug
      
      * fix multi nodes bug
      
      * reset status
      
      * remove alarm and set logdir
      
      * deepcopy ctx
      
      * change alarm
      
      * fix restart bug
      
      * add exit
      
      * best no need alarm
      
      * add warmup time
      7f6d222f
  2. 13 7月, 2023 7 次提交
  3. 11 7月, 2023 7 次提交
    • P
      support sharding parallel (#54634) · b7a05057
      pangengzheng 提交于
      * support sharding parallel
      
      * fix name
      
      * fix
      
      * update
      
      * test amp for sharding
      
      ---------
      
      Co-authored-by: pangengzheng <pangengzheng.baidu.com>
      b7a05057
    • M
      DOCS: Adding imformation about datatype in math.py (#55297) · ab73b8c6
      Muhammad Ishaque Nizamani 提交于
      * DOCS: Adding imformation about datatype in math.py
      
      * replaced uint16 with bfloat16.
      ab73b8c6
    • W
      Pipeline pass base (#55174) · 5434560a
      Wennie396 提交于
      * format correction
      
      * variable names adjustment
      
      * variable names adjustment, name-->type, value-->sub_program
      5434560a
    • L
      replace the AdagradOptimizer... · 94365855
      LoneRanger 提交于
      replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)
      
      * replace the AdadeltaOptimizer with Adadelta
      
      * replace the RMSPropOptimizer with RMSProp
      
      * replace the LambOptimizer with lamb
      
      * replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug of Lamp
      
      * fix bug of Lamp
      
      * fix bug of import
      
      * replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update optimizer.py
      
      * fix bug
      
      * fix bug
      94365855
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
    • FormlessUnit's avatar
      Linear compress (#55128) · f4290a92
      FormlessUnit 提交于
      * rename weight_only/llm.int8
      f4290a92
    • Q
      赛题七-开发grad_fn、next_functions两个API 并暴露到python端-v1 (#54838) · ab46b14c
      qiuwenbo 提交于
      * [尝试] 给tensor增加一个属性, 这个属性是一个定值 1
      
      * 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问
      
      * 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理
      
      * 增加一个单元测试
      
      * 优化 code-style
      ab46b14c
  4. 10 7月, 2023 3 次提交
  5. 07 7月, 2023 2 次提交
  6. 06 7月, 2023 7 次提交
  7. 05 7月, 2023 5 次提交
  8. 03 7月, 2023 6 次提交
  9. 30 6月, 2023 2 次提交