• shaojie_wang's avatar
    add fp32 grad plus fp16 param in adamw (#51141) · 290aa368
    shaojie_wang 提交于
    * add fp32 grad plus fp16 param in adamw
    
    * add python UT
    
    * fix test case
    
    * in test_adamw_op py file, force the moment2 value LE 0
    
    * add a compare option
    
    * remove bf16 fused adam kernel case
    290aa368
fused_adam_kernel.cu 19.7 KB