1. 23 2月, 2022 8 次提交
  2. 22 2月, 2022 18 次提交
  3. 21 2月, 2022 9 次提交
  4. 20 2月, 2022 1 次提交
  5. 19 2月, 2022 2 次提交
    • Z
      Enabled test_matmul_v2_op for final state Eager Dygraph (#39504) · 77625d7d
      Zhanlue Yang 提交于
      * Enabled test_matmul_v2_op for final state Eager Dygraph
      
      * Fixed minor issue
      
      * Fixed format issue
      77625d7d
    • S
      Add the DistributedFusedLamb optimizer (#39148) · 5df3cd61
      sneaxiy 提交于
      * add DistributedFusedLamb op
      
      * polish code
      
      * fix compile error
      
      * compatible with pten changement
      
      * fix rocm compile error
      
      * improve converage
      
      * update upstream/develop
      
      * fix cast_with_ptr.h
      
      * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1
      
      * fix clip before allreduce
      
      * add use_master_param_norm
      
      * code polish
      
      * fix bug
      
      * fix ROCM ci
      5df3cd61
  6. 18 2月, 2022 2 次提交
    • Z
      bug fix (#39630) · bbf31a4e
      zhaoyingli 提交于
      bbf31a4e
    • Z
      [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
      zhangbo9674 提交于
      * support dtype param for auto_cast
      
      * add amp_dtype for tracer
      
      * add unsupported bf16 list
      
      * support bf16 amp for O2
      
      * refine python interface for bfloat16
      
      * refine code
      
      * refine code
      
      * refine unittest
      
      * refine code
      
      * refine code
      
      * add bf16 o1
      
      * refine code by comment
      
      * add gradient accumulator
      
      * add recompute
      7d6d3848