[3D-Parallel:Sharding] Optimizations for supporting ERNIE 3.0 training (#31884)
Showing
paddle/fluid/framework/distributed_strategy.proto
100644 → 100755
python/paddle/fluid/backward.py
100644 → 100755
python/paddle/fluid/tests/unittests/test_dist_base.py
100644 → 100755
想要评论请 注册 或 登录