1. 11 7月, 2023 3 次提交
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
    • FormlessUnit's avatar
      Linear compress (#55128) · f4290a92
      FormlessUnit 提交于
      * rename weight_only/llm.int8
      f4290a92
    • Q
      赛题七-开发grad_fn、next_functions两个API 并暴露到python端-v1 (#54838) · ab46b14c
      qiuwenbo 提交于
      * [尝试] 给tensor增加一个属性, 这个属性是一个定值 1
      
      * 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问
      
      * 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理
      
      * 增加一个单元测试
      
      * 优化 code-style
      ab46b14c
  2. 10 7月, 2023 3 次提交
  3. 07 7月, 2023 2 次提交
  4. 06 7月, 2023 7 次提交
  5. 05 7月, 2023 5 次提交
  6. 03 7月, 2023 6 次提交
  7. 30 6月, 2023 4 次提交
  8. 29 6月, 2023 5 次提交
  9. 28 6月, 2023 5 次提交
    • L
      remove nets.py in fluid (#51717) · 87f72107
      LoneRanger 提交于
      * remove function sequence_conv_pool
      
      * fix test_glu.py
      
      * remove function scaled_dot_product_attention
      
      * remove function img_conv_group
      
      * remove function simple_img_conv_pool
      
      * delete the nets.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * Update test_image_classification.py
      
      * Update test_recognize_digits.py
      
      * Update test_recommender_system.py
      
      * Update test_image_classification_fp16.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * remove the nets.py in fluid
      
      * fix bug
      
      * fix the codestyle
      
      * fix conflict
      
      * fix bug
      
      * Fix TODO
      87f72107
    • G
      【Inplace】Add copy for inplace (#54683) · 98debaa8
      GGBond8488 提交于
      * add clone for inpalce
      
      * fix name
      
      * add inplace pow
      
      * fix typro
      
      * add note
      
      * fix typro
      
      * fix typro
      
      * fix bug
      
      * fix test error
      
      * add type error test
      
      * adjust indentation
      98debaa8
    • Z
      add gc for multi jobs (#54897) · fcffd84d
      zhaoyingli 提交于
      * add gc for multi jobs
      
      * fix job.h
      
      * update OpInfo to OpInOutInfo
      
      * update get_skip_gc_vars algo order
      fcffd84d
    • Z
      Add set_lr_scheduler api (#54752) · 99c593bc
      zqw_1997 提交于
      * demo1
      
      * add test cases
      
      * modify the usage of StepDecay
      
      * refine
      99c593bc
    • L
      replace PiecewiseDecay, StepDecay, MultiStepDecay, LambdaDecay with 2.0 version (#53992) · 63f242b6
      LoneRanger 提交于
      * replace PiecewiseDecay(LearningRateDecay) with PiecewiseDecay(LRScheduler)
      
      * fix bug
      
      * fix bug
      
      * replace the StepDecay,MultiStepDecay,LambdaDecay with 2.0 version
      63f242b6