[1]CHENG Deqiang,MA Shang,KOU Qiqi,et al.Target detection algorithm for improving feature fusion and global perception based on YOLOv4[J].CAAI Transactions on Intelligent Systems,2024,19(2):325-334.[doi:10.11992/tis.202207018]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 2
Page number:
325-334
Column:
学术论文—机器感知与模式识别
Public date:
2024-03-05
- Title:
-
Target detection algorithm for improving feature fusion and global perception based on YOLOv4
- Author(s):
-
CHENG Deqiang1; MA Shang1; KOU Qiqi2; ZHANG Haoxiang1; QIAN Jiansheng1
-
1. School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China;
2. School of Computer Science & Technology, China University of Mining and Technology, Xuzhou 221116, China
-
- Keywords:
-
YOLOv4; target detection; feature fusion; cross-scale; multiscale variation; global attention; average pooling; contextual information
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202207018
- Abstract:
-
The YOLOv4 algorithm has a good balance in detection speed and accuracy, but there are still drawbacks of inaccurate positioning frame and low detection rate, especially for small detection targets and great changes in scale. Dealing with these problems, a new YOLOv4-based target detection algorithm is developed. The algorithm utilizes an enhanced feature fusion module—PANet combined with the bidirectional feature pyramid network instead of PANet to increase cross-scale connections, introduce weights at the output to improve the expressiveness of important features and solve accuracy degradation as a result of multiscale changes. Afterward, a new global association network is adopted to improve the output of the Sigmoid function while reducing the average pooling and computation, strengthen the model’s learning of the contextual relationship of the target, and reduce noise interference and global information loss. The RSOD and NWPU VHR-10 datasets are employed here, with average detection accuracies being enhanced by about 5% and 3%, respectively; the generalization experiment uses the VOC2007 + 2012 public dataset, with the average detection accuracy being enhanced by about 0.6%. The experimental results reveal that the improved algorithm can effectively enhance the detection ability of the model.