New Features
- 在有tensorcore的pass前添加nchw->nchw4的pass
- NCHW→NCHW4转换pass增加对Pooling/WarpPerspective/Resize等op的支持
- 增加加载pretrained的int8模型,再dump的支持
Bug Fix
- 修复当一个进程中存在多个 dataloader 实例时,MGE_PLASMA_STORE_MANAGER销毁行为不正确的问题
- 让FakeQuantize和Observer能够针对weight和activation使用不同的qmin,避免极端情况下出现数值溢出
- 修复mgb.opr.deformable_psroi_pooling 实现错误
- 修复cuda int8 nchw4支持channel小于4的问题
- 修复网络搭建文档中的拼写错误 @ztjryg4
Thanks to our Contributors
- 本次release非常感谢@ztjryg4 提交PR,期待更多的开发者一起共建MegEngine!
New Features
- Insert nchw->nchw4 pass before tensorcore pass.
- NCHW→NCHW4 pass supports ops such as Pooling/WarpPerspective/Resize.
- Pretrained int8 models can now firstly be loaded and then be dumped.
Bug Fix
- Fix MGE_PLASMA_STORE_MANAGER was destroyed incorrectly when there were multiple dataloader instances in a process.
- Allow FakeQuantize and Observer to use different qmin in terms of different weights and activation to avoid numerical overflow in extreme cases
- fix implementation mistakes in mgb.opr.deformable_psroi_pooling
- Fix cuda int8 nchw4 support channel less than 4
- Fix typo in network_build @ztjryg4
Thanks to our Contributors
- A kind acknowledgement to PR lodged by @ztjryg4 , and we are genuinely welcoming more developers to co-build MegEngine!