[1]WANG Fengsui,CHEN Jingang,WANG Qisheng,et al.Multi-scale target detection algorithm based on adaptive context features[J].CAAI Transactions on Intelligent Systems,2022,17(2):276-285.[doi:10.11992/tis.202101029]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
17
Number of periods:
2022 2
Page number:
276-285
Column:
学术论文—机器学习
Public date:
2022-03-05
- Title:
-
Multi-scale target detection algorithm based on adaptive context features
- Author(s):
-
WANG Fengsui1; 2; 3; CHEN Jingang1; 2; 3; WANG Qisheng1; 2; 3; LIU Furong1; 2; 3
-
1. School of Electrical Engineering, Anhui Polytechnic University, Wuhu 241000, China;
2. Anhui Key Laboratory of Detection Technology and Energy Saving Devices, Wuhu 241000, China;
3. Key Laboratory of Advanced Perception and Intelligent Control of High-end Equipment, Ministry of Education, Wuhu 241000, China
-
- Keywords:
-
machine vision; target detection; convolution neural network; channel attention; parallel empty convolution; multi-scale feature fusion; contextual feature; deep learning
- CLC:
-
TP391.4
- DOI:
-
10.11992/tis.202101029
- Abstract:
-
Multi-scale target recognition is a challenge in any detection task. Aiming at the multi-scale problem in detection, a multi-scale target detection algorithm with adaptive context features is proposed. A multi-receptive field feature extraction network was constructed to solve the problem wherein targets of different scales require different receptive field features to be recognized. Using multi-branch parallel void convolution, contextual information in tags was extracted from high-level semantic features. Based on an improved channel attention mechanism, an adaptive feature fusion network was proposed to solve the problem wherein the semantic features of different scale targets appear in feature maps of different resolutions. The local location features were fused into global semantic features by learning the correlation between feature maps of different resolutions. The feature maps of different scales were used to identify objects of different scales. The proposed algorithm was verified on a Pascal VOC data set; the detection accuracy of the proposed method reached 85.74%, which was approximately 8.7% higher than the Faster R-CNN and about 2.06% higher than the baseline detection algorithm YOLOV3 +.