[1]JIANG Wentao,YOU Zhuocheng,ZHANG Shengchong.Dynamic mask convolution for image classification networks[J].CAAI Transactions on Intelligent Systems,2026,21(2):423-434.[doi:10.11992/tis.202503019]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
21
Number of periods:
2026 2
Page number:
423-434
Column:
学术论文—机器感知与模式识别
Public date:
2026-05-16
- Title:
-
Dynamic mask convolution for image classification networks
- Author(s):
-
JIANG Wentao1; YOU Zhuocheng1; ZHANG Shengchong2
-
1. College of Software, Liaoning Technology University, Huludao 125105, China;
2. Science and Technology on Electro-Optical Information Security Control Laboratory, Tianjin 300308, China
-
- Keywords:
-
image classification; masking mechanism; residual networks; dynamic mask convolution; dilated convolution; attention mechanism; feature fusion; feature extraction
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202503019
- Abstract:
-
Aiming at the problems of traditional image classification methods in complex scenes, such as weak feature adaptability, limited ability to capture multi-scale information, and insufficient ability to express detailed features, an image classification network based on dynamic mask convolution is proposed. Firstly, the multi-branch mask convolution fusion module is designed, which combines the multi-branch structure with the dynamic mask mechanism to realize the fusion of different scale information, and dynamically selects and strengthens the key features according to the context information of the input image, so as to improve the feature extraction ability of the network. Secondly, the adaptive enhancement module is introduced in the residual learning, and the feature weights are adaptively adjusted by integrating the pixel-level and channel level attention mechanisms to accurately capture the important details in the image. Through experiments on CIFAR-10, CIFAR-100, SVHN, Imagenette, and Imagewoof datasets, the classification accuracy of 96.85%, 82.39%, 97.88%, 93.35% and 85.93% respectively, which is significantly better than the traditional image classification methods. The network can show excellent and stable classification performance in the face of diverse image features and complex scenes, and provides a new idea for the application of deep learning in the field of image classification.