New Features

  • 在有tensorcore的pass前添加nchw->nchw4的pass
  • NCHW→NCHW4转换pass增加对Pooling/WarpPerspective/Resize等op的支持
  • 增加加载pretrained的int8模型,再dump的支持

Bug Fix

  • 修复当一个进程中存在多个 dataloader 实例时,MGE_PLASMA_STORE_MANAGER销毁行为不正确的问题
  • 让FakeQuantize和Observer能够针对weight和activation使用不同的qmin,避免极端情况下出现数值溢出
  • 修复mgb.opr.deformable_psroi_pooling 实现错误
  • 修复cuda int8 nchw4支持channel小于4的问题
  • 修复网络搭建文档中的拼写错误 @ztjryg4

Thanks to our Contributors

  • 本次release非常感谢@ztjryg4 提交PR,期待更多的开发者一起共建MegEngine!

New Features

  • Insert nchw->nchw4 pass before tensorcore pass.
  • NCHW→NCHW4 pass supports ops such as Pooling/WarpPerspective/Resize.
  • Pretrained int8 models can now firstly be loaded and then be dumped.

Bug Fix

  • Fix MGE_PLASMA_STORE_MANAGER was destroyed incorrectly when there were multiple dataloader instances in a process.
  • Allow FakeQuantize and Observer to use different qmin in terms of different weights and activation to avoid numerical overflow in extreme cases
  • fix implementation mistakes in mgb.opr.deformable_psroi_pooling
  • Fix cuda int8 nchw4 support channel less than 4
  • Fix typo in network_build @ztjryg4

Thanks to our Contributors

  • A kind acknowledgement to PR lodged by @ztjryg4 , and we are genuinely welcoming more developers to co-build MegEngine!

项目简介

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

发行版本 15

MegEngine v1.4.0

全部发行版

贡献者 13

全部贡献者

开发语言

  • C++ 79.8 %
  • Cuda 13.8 %
  • Python 4.9 %
  • C 0.9 %
  • CMake 0.5 %