v1.1.0 · 标签 · MegEngine 天元 / MegEngine

v1.1.0

问题修复

Fixed static shape inference in trace to allow training larger models
Link io-opr in trace to avoid deadlock
Fixed cd4 conversion error when group=1 in convolution and some cases in elemwise
Fixed the problem of shape matching when the bias shape is fixed in fuse conv bias optpass
Fixed LOG mode of elemwise in MKL calculation abnormal
Fix the error processing when load_and_run --input does not specify the correct input name for a single input

Support representation of scalar-type tensor
Enable users to control error check during asynchronous execution by parameter async_level
Add operators including group_norm, instance_norm and layer_norm, conv1d and remap
Use weakref for GradManger.attach
Support distributed quantize aware training
After weight preprocessing, release the original weight memory during inference
Support Elemwise and DimShuffle operators in JIT of mlir backend
Support DCT operator in cv

Reduce host overhead for operators including batch normalization, elementwise, and broadcast
Improve performance of the step function in optimizers
Improve performance of quantization training
Optimize arm64 int8X8X16_mk4_k8x8x8 matmul operator