提交 24b2d14e 编写于 作者: X Xin Pan

fix

上级 bf59d622
# Parallelism, Asynchronous, Synchronous, Codistillation
[TOC]
For valuable models, it’s worth using more hardware resources to reduce the training time and improve the final model quality. This doc discuss various solutions, their empirical results and some latest researches.
# Model Parallelism
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册