[IPU] Decoupling ipu sharding and modeling (#43164)
* Decoupling ipu sharding and modeling (#665)
* feat(shard): decoupling shard setting with modeling.
* fix(shard): split test cases to avoid failure.
* fix(shard): add function docs and fix typo.
* test(shard): add tests.
* test(shard): more test case.
* fix(): change ipu_index/stage default value to -1.
* fix format
Co-authored-by: Nczr-gc <96037699+czr-gc@users.noreply.github.com>
Showing
想要评论请 注册 或 登录