1. 20 3月, 2023 21 次提交
  2. 19 3月, 2023 3 次提交
  3. 18 3月, 2023 1 次提交
  4. 17 3月, 2023 9 次提交
  5. 16 3月, 2023 6 次提交
    • H
      [Custom Operator] Custom op support inplace mechanism (#51620) · f824bc0d
      HongyuJia 提交于
      * init unit test commit, contains register thinking
      
      * support inplace
      
      * get inplaced x.grad
      
      * Try support inplace and hook at the same time
      
      * Support inplace, need debug
      
      * Support inplace successfully
      
      * Inplace use Tensor&, consistent with Tensor*
      
      * fix MapPlainOutputs bug
      
      * fix double grad inplace error
      f824bc0d
    • C
      rename flash_attn_raw to flash_attn_unpadded (#51704) · 0b778bdc
      Chitsing KUI 提交于
      * rename flash_attn_raw to flash_attn_unpadded
      
      * fix static api
      
      * fix static return
      0b778bdc
    • X
      Add Deformable Conv Dynamic Shape Support (#50698) · 86bf8274
      xjmxyt 提交于
      * add dynamic support
      
      * add more test
      
      * fix bug
      
      * change test
      
      * change test
      86bf8274
    • shaojie_wang's avatar
      add fp32 grad plus fp16 param in adamw (#51141) · 290aa368
      shaojie_wang 提交于
      * add fp32 grad plus fp16 param in adamw
      
      * add python UT
      
      * fix test case
      
      * in test_adamw_op py file, force the moment2 value LE 0
      
      * add a compare option
      
      * remove bf16 fused adam kernel case
      290aa368
    • H
      Update from_blob API (#51646) · c07c7712
      Huang Jiyi 提交于
      * remove contexts in tensor_utils
      
      * update from_blob
      
      * update from_blob
      
      * update from_blob
      
      * fix bug
      
      * fix bug
      c07c7712
    • J
      [Auto Parallel Performance] Support BF16 Training (#51285) · 9ded5707
      JZ-LIANG 提交于
      * update env setting
      
      * update pass logic
      
      * dist op support bf16
      
      * backward cast update
      
      * update setting
      
      * update backward
      
      * revert amp pass
      
      * update fp16 backward logic
      
      * register c_embedding bf16
      
      * revert engine
      
      * add unitest
      
      * add unitest
      
      * update unitest
      
      * update cmake
      
      * update math
      
      * update math.py
      
      * update unitest
      
      * update unitest
      
      * revise unitest
      
      * revise unitest
      
      * update unitest
      
      * update unitest
      
      * update unitest
      9ded5707