<-上一篇/Previous Article 下一篇/Next Article->

[1]储文娟,李震,黄炜嘉,等.面向失效增强和改进YOLOv8的目标检测[J].智能系统学报,2026,21(2):353-364.[doi:10.11992/tis.202503010]
　CHU Wenjuan,LI Zhen,HUANG Weijia,et al.A failure enhancement and improvement of YOLOv8 for target detection[J].CAAI Transactions on Intelligent Systems,2026,21(2):353-364.[doi:10.11992/tis.202503010]

点击复制

面向失效增强和改进YOLOv8的目标检测

PDF下载 HTML

《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷: 21 期数: 2026年第2期页码: 353-364 栏目: 学术论文—机器学习出版日期: 2026-03-05

Title:: A failure enhancement and improvement of YOLOv8 for target detection

作者:: 储文娟, 李震, 黄炜嘉, 王宇轩; 江苏科技大学海洋学院, 江苏镇江 212003

Author(s):: CHU Wenjuan, LI Zhen, HUANG Weijia, WANG Yuxuan; Ocean College, Jiangsu University of Science and Technology, Zhenjiang 212003, China

关键词:: 计算机视觉; 复杂环境; 目标检测; YOLO; 图像增强; 注意力机制; 特征融合; 损失函数

Keywords:: computer vision; complex environment; object detection; YOLO; image enhancement; attention mechanism; feature fusion; loss function

分类号:: TP391.41

DOI:: 10.11992/tis.202503010

摘要:: 针对当前在光照、天气、遮挡等复杂背景条件下进行目标检测技术的检测性能较低、泛化能力弱等问题，文章提出一种基于失效增强和改进YOLOv8的目标检测算法（asymptotic structure of YOLO, AS_YOLO）。1）基于复杂场景构建了多种目标单元数据集，并设计面向应用环境的图像失效增强技术；2）引入通道–空间并行注意力机制同时关注复杂环境下目标的特征信息与位置信息；3）采用AFPN结构强化非相邻层级的特征融合效果；4）采用了Inner_IoU(inner intersection over union)损失函数改善现有IoU(intersection over union)损失函数，在不同检测任务中的泛化能力不足的问题，并在WSODD多目标数据集下进行迁移实验。实验结果表明，改进后的算法与基线模型YOLOv8n相比，mAP_0.5达到了94.0%，提升12.5百分点，mAP_0.95达到了72.5%，提升15.7百分点，具有更好的检测性能。

Abstract:: To address the issues of low detection performance and weak generalization ability in target detection under complex background conditions such as illumination, weather, and occlusion, this paper proposes an improved object detection algorithm based on failure augmentation and enhanced YOLOv8 (AS_YOLO). First, a variety of target unit datasets were constructed based on complex military scenarios, and an image failure augmentation technique tailored to the application environment was developed. Second, a channel-spatial parallel attention mechanism was introduced to simultaneously focus on feature and position information of targets in complex environments. Then, the AFPN structure was used to enhance feature fusion of non-adjacent hierarchical layers. Finally, the Inner_IoU loss function was adopted to address the generalization limitations of existing IoU loss functions in different detection tasks. Transfer experiments were conducted on the WSODD multi-target dataset. The experimental results show that the improved algorithm achieves an mAP_0.5 of 94.0%, a 12.5 percentage point improvement over the baseline YOLOv8n model, and an mAP_0.95 of 72.5%, a 15.7 percentage point improvement, indicating superior detection performance.

参考文献/References:: [1] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84-90
[2] REN Shaoqing, HE Kaiming, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(6): 1137-1149
[3] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 779-788.
[4] HUANG Xiaochen, WANG Xiaofeng, TENG Qizhi, et al. Degradation type-aware image restoration for effective object detection in adverse weather[J]. Sensors, 2024, 24(19): 6330
[5] WANG Zhipan, LIU Di, WANG Zhongwu, et al. A new remote sensing change detection data augmentation method based on mosaic simulation and haze image simulation[J]. IEEE journal of selected topics in applied earth observations and remote sensing, 2023, 16: 4579-4590
[6] WU Junjun, RAO Yunbo, ZENG Shaoning, et al. Pre-trained SAM as data augmentation for image segmentation[J]. CAAI transactions on intelligence technology, 2025, 10(1): 268-282
[7] 肖晶晶, 樊博彦, 杨雨婷. 雾环境下的船舶目标检测研究[J]. 重庆理工大学学报(自然科学), 2024, 38(3): 212-219 XIAO Jingjing, FAN Boyan, YANG Yuting. Research on ship object detection in foggy environments[J]. Journal of Chongqing University of Technology (natural science), 2024, 38(3): 212-219
[8] 马淦, 谷雨, 彭冬亮. 结合改进YOLOv5s和动态数据增强的海面舰船检测[J]. 计算机工程, 2025, 51(9): 294-305 MA Gan, GU Yu, PENG Dongliang. Combining improved YOLOv5s and dynamic data augmentation for sea surface ship detection[J]. Computer engineering, 2025, 51(9): 294-305
[9] FAN Pan, ZHENG Chusan, SUN Jin, et al. Enhanced real-time target detection for picking robots using lightweight CenterNet in complex orchard environments[J]. Agriculture, 2024, 14(7): 1059
[10] 邢汇源, 崔亚奇, 王子玲, 等. 复杂海况下的海上船舶目标检测算法[J]. 现代防御技术, 2024, 52(6): 88-96 XING Huiyuan, CUI Yaqi, WANG Ziling, et al. Target detection algorithm for ships at sea under complex sea conditions[J]. Modern defence technology, 2024, 52(6): 88-96
[11] 张国印, 王传博, 高伟. 抗遮挡的行人多目标跟踪算法[J]. 智能系统学报, 2024, 19(5): 1248-1256 ZHANG Guoyin, WANG Chuanbo, GAO Wei. Pedestrian multiobject tracking algorithm with anti-occlusion[J]. CAAI transactions on intelligent systems, 2024, 19(5): 1248-1256
[12] LYU Yunkai, YANG Xiaobing, GUAN Ai, et al. Construction personnel dress code detection based on YOLO framework[J]. CAAI transactions on intelligence technology, 2024, 9(3): 709-721
[13] 吴攀超, 郑卓纹, 王婷婷, 等. 基于CF-YOLO的雾霾交通标志识别[J]. 计算机工程与设计, 2024, 45(7): 2203-2211 WU Panchao, ZHENG Zhuowen, WANG Tingting, et al. Foggy traffic sign recognition based on CF-YOLO[J]. Computer engineering and design, 2024, 45(7): 2203-2211
[14] 赵文清, 康怿瑾, 赵振兵, 等. 改进YOLOv5s的遥感图像目标检测[J]. 智能系统学报, 2023, 18(1): 86-95 ZHAO Wenqing, KANG Yijin, ZHAO Zhenbing, et al. A remote sensing image object detection algorithm with improved YOLOv5s[J]. CAAI transactions on intelligent systems, 2023, 18(1): 86-95
[15] 许迪, 张淑卿, 葛超. 面向复杂环境的YOLOv8安全装备检测[J]. 电子测量技术, 2024, 47(7): 121-129 XU Di, ZHANG Shuqing, GE Chao. YOLOv8 security equipment inspection for complex environments[J]. Electronic measurement technology, 2024, 47(7): 121-129
[16] WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//Computer Vision–ECCV 2018. Cham: Springer, 2018: 3-19.
[17] YANG Guoyu, LEI Jie, ZHU Zhikuan, et al. AFPN: asymptotic feature pyramid network for object detection[C]//2023 IEEE International Conference on Systems, Man, and Cybernetics. Honolulu: IEEE, 2024: 2184-2189.
[18] ZHANG Hao, XU Cong, ZHANG Shuaijie. Inner-IoU: more effective intersection over union loss with auxiliary bounding box[EB/OL]. (2023-11-06)[2025-03-06]. https://arxiv.org/abs/2311.02877.
[19] WANG Weijun, HOWARD A. Mosaic: mobile segmentation via decoding aggregated information and encoded context[EB/OL]. (2021-12-22)[2025-03-06]. https://arxiv.org/abs/2112.11623.
[20] LIU Shu, QI Lu, QIN Haifang, et al. Path aggregation network for instance segmentation[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 8759-8768.
[21] JADERBERG M, SIMONYAN K, ZISSERMAN A. Spatial Transformer networks[J]. Advances in neural information processing systems, 2015, 28: 1-9
[22] HU Jie, SHEN Li, SUN Gang. Squeeze-and-excitation networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132-7141.
[23] LIN T Y, DOLL?R P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 936-944.
[24] TAN Mingxing, PANG Ruoming, LE Q V. EfficientDet: scalable and efficient object detection[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 10778-10787.
[25] LIU Songtao, HUANG Di, WANG Yunhong. Learning spatial fusion for single-shot object detection[EB/OL]. (2019-11-21)[2025-03-06]. https://arxiv.org/abs/1911.09516.
[26] LI Chuyi, LI Lulu, JIANG Hongliang, et al. YOLOv6: a single-stage object detection framework for industrial applications[EB/OL]. (2022-09-07)[2025-03-06]. https://arxiv.org/abs/2209.02976.
[27] WANG C Y, BOCHKOVSKIY A, LIAO H M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver: IEEE, 2023: 7464-7475.
[28] WANG Jinwang, YANG Wen, GUO Haowen, et al. Tiny object detection in aerial images[C]//2020 25th International Conference on Pattern Recognition. Milan: IEEE, 2021: 3791-3798.
[29] SELVARAJU R R, COGSWELL M, DAS A, et al. Grad-CAM: visual explanations from deep networks via gradient-based localization[J]. International journal of computer vision, 2020, 128(2): 336-359
[30] ZHOU Zhiguo, SUN Jiaen, YU Jiabao, et al. An image-based benchmark dataset and a novel object detector for water surface object detection[J]. Frontiers in neurorobotics, 2021, 15: 723336

相似文献/References:: [1]夏凡,王宏.基于局部异常行为检测的欺骗识别研究[J].智能系统学报,2007,2(5):12.
　XIA Fan,WANG Hong.Methodologies for deception detection based on abnormal b ehavior[J].CAAI Transactions on Intelligent Systems,2007,2():12.
[2]杨戈,刘宏.视觉跟踪算法综述[J].智能系统学报,2010,5(2):95.
　YANG Ge,LIU Hong.Survey of visual tracking algorithms[J].CAAI Transactions on Intelligent Systems,2010,5():95.
[3]刘宏,李哲媛,许超.视错觉现象的分类和研究进展[J].智能系统学报,2011,6(1):1.
　LIU Hong,LI Zheyuan,XU Chao.The categories and research advances of visual illusions[J].CAAI Transactions on Intelligent Systems,2011,6():1.
[4]叶果,程洪,赵洋.电影中吸烟活动识别[J].智能系统学报,2011,6(5):440.
　YE Guo,CHENG Hong,ZHAO Yang.moking recognition in movies[J].CAAI Transactions on Intelligent Systems,2011,6():440.
[5]史晓鹏,何为,韩力群.采用Hough变换的道路边界检测算法[J].智能系统学报,2012,7(1):81.
　SHI Xiaopeng,HE Wei,HAN Liqun.A road edge detection algorithm based on the Hough transform[J].CAAI Transactions on Intelligent Systems,2012,7():81.
[6]顾照鹏,刘宏.单目视觉同步定位与地图创建方法综述[J].智能系统学报,2015,10(4):499.[doi:10.3969/j.issn.1673-4785.201503003]
　GU Zhaopeng,LIU Hong.A survey of monocular simultaneous localization and mapping[J].CAAI Transactions on Intelligent Systems,2015,10():499.[doi:10.3969/j.issn.1673-4785.201503003]
[7]赵军,於俊,汪增福.基于改进逆向运动学的人体运动跟踪[J].智能系统学报,2015,10(4):548.[doi:10.3969/j.issn.1673-4785.201403032]
　ZHAO Jun,YU Jun,WANG Zengfu.Human motion tracking based on an improved inverse kinematics[J].CAAI Transactions on Intelligent Systems,2015,10():548.[doi:10.3969/j.issn.1673-4785.201403032]
[8]姬晓飞,王昌汇,王扬扬.分层结构的双人交互行为识别方法[J].智能系统学报,2015,10(6):893.[doi:10.11992/tis.201505006]
　JI Xiaofei,WANG Changhui,WANG Yangyang.Human interaction behavior-recognition method based on hierarchical structure[J].CAAI Transactions on Intelligent Systems,2015,10():893.[doi:10.11992/tis.201505006]
[9]方鹏,李贤,汪增福.运用核聚类和偏最小二乘回归的歌唱声音转换[J].智能系统学报,2016,11(1):55.[doi:10.11992/tis.201506022]
　FANG Peng,LI Xian,WANG Zengfu.Conversion of singing voice based on kernel clustering and partial least squares regression[J].CAAI Transactions on Intelligent Systems,2016,11():55.[doi:10.11992/tis.201506022]
[10]李雪,蒋树强.智能交互的物体识别增量学习技术综述[J].智能系统学报,2017,12(2):140.[doi:10.11992/tis.201701006]
　LI Xue,JIANG Shuqiang.Incremental learning and object recognition system based on intelligent HCI: a survey[J].CAAI Transactions on Intelligent Systems,2017,12():140.[doi:10.11992/tis.201701006]

备注/Memo

收稿日期:2025-3-6。
基金项目:国家自然科学基金项目（62276285）；教育部学位与研究生教育发展中心主题案例库项目（ZT-231028914）；江苏省研究生科研与实践创新计划项目（KYCX24-4178）；中国科学院软件研究所合作项目（2205072325）.
作者简介:储文娟，硕士研究生，主要研究方向为视觉目标识别与图像处理。E-mail：2294806304@qq.com。;李震，教授，博士，主要研究方向为可靠性与系统工程。主持国家级项目2项、省部级项目2项、横向项目20余项。E-mail：justlz@just.edu.cn。;黄炜嘉，副教授，博士，主要研究方向为图像处理。主持横向项目1项，参研国家自然科学基金面上项目2项。E-mail：huangweijia@just.edu.cn。
通讯作者:李震. E-mail：justlz@just.edu.cn

更新日期/Last Update: 1900-01-01

面向失效增强和改进YOLOv8的目标检测 PDF下载HTML

备注/Memo

面向失效增强和改进YOLOv8的目标检测

PDF下载 HTML