• J
    [Auto Parallel Performance] Support BF16 Training (#51285) · 9ded5707
    JZ-LIANG 提交于
    * update env setting
    
    * update pass logic
    
    * dist op support bf16
    
    * backward cast update
    
    * update setting
    
    * update backward
    
    * revert amp pass
    
    * update fp16 backward logic
    
    * register c_embedding bf16
    
    * revert engine
    
    * add unitest
    
    * add unitest
    
    * update unitest
    
    * update cmake
    
    * update math
    
    * update math.py
    
    * update unitest
    
    * update unitest
    
    * revise unitest
    
    * revise unitest
    
    * update unitest
    
    * update unitest
    
    * update unitest
    9ded5707
parallelizer_v2.py 13.8 KB