[1]储文娟,李震,黄炜嘉,等.面向失效增强和改进YOLOv8的目标检测[J].智能系统学报,2026,21(2):353-364.[doi:10.11992/tis.202503010]
 CHU Wenjuan,LI Zhen,HUANG Weijia,et al.A failure enhancement and improvement of YOLOv8 for target detection[J].CAAI Transactions on Intelligent Systems,2026,21(2):353-364.[doi:10.11992/tis.202503010]
点击复制

面向失效增强和改进YOLOv8的目标检测

参考文献/References:
[1] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84-90
[2] REN Shaoqing, HE Kaiming, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(6): 1137-1149
[3] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 779-788.
[4] HUANG Xiaochen, WANG Xiaofeng, TENG Qizhi, et al. Degradation type-aware image restoration for effective object detection in adverse weather[J]. Sensors, 2024, 24(19): 6330
[5] WANG Zhipan, LIU Di, WANG Zhongwu, et al. A new remote sensing change detection data augmentation method based on mosaic simulation and haze image simulation[J]. IEEE journal of selected topics in applied earth observations and remote sensing, 2023, 16: 4579-4590
[6] WU Junjun, RAO Yunbo, ZENG Shaoning, et al. Pre-trained SAM as data augmentation for image segmentation[J]. CAAI transactions on intelligence technology, 2025, 10(1): 268-282
[7] 肖晶晶, 樊博彦, 杨雨婷. 雾环境下的船舶目标检测研究[J]. 重庆理工大学学报(自然科学), 2024, 38(3): 212-219 XIAO Jingjing, FAN Boyan, YANG Yuting. Research on ship object detection in foggy environments[J]. Journal of Chongqing University of Technology (natural science), 2024, 38(3): 212-219
[8] 马淦, 谷雨, 彭冬亮. 结合改进YOLOv5s和动态数据增强的海面舰船检测[J]. 计算机工程, 2025, 51(9): 294-305 MA Gan, GU Yu, PENG Dongliang. Combining improved YOLOv5s and dynamic data augmentation for sea surface ship detection[J]. Computer engineering, 2025, 51(9): 294-305
[9] FAN Pan, ZHENG Chusan, SUN Jin, et al. Enhanced real-time target detection for picking robots using lightweight CenterNet in complex orchard environments[J]. Agriculture, 2024, 14(7): 1059
[10] 邢汇源, 崔亚奇, 王子玲, 等. 复杂海况下的海上船舶目标检测算法[J]. 现代防御技术, 2024, 52(6): 88-96 XING Huiyuan, CUI Yaqi, WANG Ziling, et al. Target detection algorithm for ships at sea under complex sea conditions[J]. Modern defence technology, 2024, 52(6): 88-96
[11] 张国印, 王传博, 高伟. 抗遮挡的行人多目标跟踪算法[J]. 智能系统学报, 2024, 19(5): 1248-1256 ZHANG Guoyin, WANG Chuanbo, GAO Wei. Pedestrian multiobject tracking algorithm with anti-occlusion[J]. CAAI transactions on intelligent systems, 2024, 19(5): 1248-1256
[12] LYU Yunkai, YANG Xiaobing, GUAN Ai, et al. Construction personnel dress code detection based on YOLO framework[J]. CAAI transactions on intelligence technology, 2024, 9(3): 709-721
[13] 吴攀超, 郑卓纹, 王婷婷, 等. 基于CF-YOLO的雾霾交通标志识别[J]. 计算机工程与设计, 2024, 45(7): 2203-2211 WU Panchao, ZHENG Zhuowen, WANG Tingting, et al. Foggy traffic sign recognition based on CF-YOLO[J]. Computer engineering and design, 2024, 45(7): 2203-2211
[14] 赵文清, 康怿瑾, 赵振兵, 等. 改进YOLOv5s的遥感图像目标检测[J]. 智能系统学报, 2023, 18(1): 86-95 ZHAO Wenqing, KANG Yijin, ZHAO Zhenbing, et al. A remote sensing image object detection algorithm with improved YOLOv5s[J]. CAAI transactions on intelligent systems, 2023, 18(1): 86-95
[15] 许迪, 张淑卿, 葛超. 面向复杂环境的YOLOv8安全装备检测[J]. 电子测量技术, 2024, 47(7): 121-129 XU Di, ZHANG Shuqing, GE Chao. YOLOv8 security equipment inspection for complex environments[J]. Electronic measurement technology, 2024, 47(7): 121-129
[16] WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//Computer Vision–ECCV 2018. Cham: Springer, 2018: 3-19.
[17] YANG Guoyu, LEI Jie, ZHU Zhikuan, et al. AFPN: asymptotic feature pyramid network for object detection[C]//2023 IEEE International Conference on Systems, Man, and Cybernetics. Honolulu: IEEE, 2024: 2184-2189.
[18] ZHANG Hao, XU Cong, ZHANG Shuaijie. Inner-IoU: more effective intersection over union loss with auxiliary bounding box[EB/OL]. (2023-11-06)[2025-03-06]. https://arxiv.org/abs/2311.02877.
[19] WANG Weijun, HOWARD A. Mosaic: mobile segmentation via decoding aggregated information and encoded context[EB/OL]. (2021-12-22)[2025-03-06]. https://arxiv.org/abs/2112.11623.
[20] LIU Shu, QI Lu, QIN Haifang, et al. Path aggregation network for instance segmentation[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 8759-8768.
[21] JADERBERG M, SIMONYAN K, ZISSERMAN A. Spatial Transformer networks[J]. Advances in neural information processing systems, 2015, 28: 1-9
[22] HU Jie, SHEN Li, SUN Gang. Squeeze-and-excitation networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132-7141.
[23] LIN T Y, DOLL?R P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 936-944.
[24] TAN Mingxing, PANG Ruoming, LE Q V. EfficientDet: scalable and efficient object detection[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 10778-10787.
[25] LIU Songtao, HUANG Di, WANG Yunhong. Learning spatial fusion for single-shot object detection[EB/OL]. (2019-11-21)[2025-03-06]. https://arxiv.org/abs/1911.09516.
[26] LI Chuyi, LI Lulu, JIANG Hongliang, et al. YOLOv6: a single-stage object detection framework for industrial applications[EB/OL]. (2022-09-07)[2025-03-06]. https://arxiv.org/abs/2209.02976.
[27] WANG C Y, BOCHKOVSKIY A, LIAO H M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver: IEEE, 2023: 7464-7475.
[28] WANG Jinwang, YANG Wen, GUO Haowen, et al. Tiny object detection in aerial images[C]//2020 25th International Conference on Pattern Recognition. Milan: IEEE, 2021: 3791-3798.
[29] SELVARAJU R R, COGSWELL M, DAS A, et al. Grad-CAM: visual explanations from deep networks via gradient-based localization[J]. International journal of computer vision, 2020, 128(2): 336-359
[30] ZHOU Zhiguo, SUN Jiaen, YU Jiabao, et al. An image-based benchmark dataset and a novel object detector for water surface object detection[J]. Frontiers in neurorobotics, 2021, 15: 723336
相似文献/References:
[1]夏 凡,王 宏.基于局部异常行为检测的欺骗识别研究[J].智能系统学报,2007,2(5):12.
 XIA Fan,WANG Hong.Methodologies for deception detection based on abnormal b ehavior[J].CAAI Transactions on Intelligent Systems,2007,2():12.
[2]杨 戈,刘 宏.视觉跟踪算法综述[J].智能系统学报,2010,5(2):95.
 YANG Ge,LIU Hong.Survey of visual tracking algorithms[J].CAAI Transactions on Intelligent Systems,2010,5():95.
[3]刘宏,李哲媛,许超.视错觉现象的分类和研究进展[J].智能系统学报,2011,6(1):1.
 LIU Hong,LI Zheyuan,XU Chao.The categories and research advances of visual illusions[J].CAAI Transactions on Intelligent Systems,2011,6():1.
[4]叶果,程洪,赵洋.电影中吸烟活动识别[J].智能系统学报,2011,6(5):440.
 YE Guo,CHENG Hong,ZHAO Yang.moking recognition in movies[J].CAAI Transactions on Intelligent Systems,2011,6():440.
[5]史晓鹏,何为,韩力群.采用Hough变换的道路边界检测算法[J].智能系统学报,2012,7(1):81.
 SHI Xiaopeng,HE Wei,HAN Liqun.A road edge detection algorithm based on the Hough transform[J].CAAI Transactions on Intelligent Systems,2012,7():81.
[6]顾照鹏,刘宏.单目视觉同步定位与地图创建方法综述[J].智能系统学报,2015,10(4):499.[doi:10.3969/j.issn.1673-4785.201503003]
 GU Zhaopeng,LIU Hong.A survey of monocular simultaneous localization and mapping[J].CAAI Transactions on Intelligent Systems,2015,10():499.[doi:10.3969/j.issn.1673-4785.201503003]
[7]赵军,於俊,汪增福.基于改进逆向运动学的人体运动跟踪[J].智能系统学报,2015,10(4):548.[doi:10.3969/j.issn.1673-4785.201403032]
 ZHAO Jun,YU Jun,WANG Zengfu.Human motion tracking based on an improved inverse kinematics[J].CAAI Transactions on Intelligent Systems,2015,10():548.[doi:10.3969/j.issn.1673-4785.201403032]
[8]姬晓飞,王昌汇,王扬扬.分层结构的双人交互行为识别方法[J].智能系统学报,2015,10(6):893.[doi:10.11992/tis.201505006]
 JI Xiaofei,WANG Changhui,WANG Yangyang.Human interaction behavior-recognition method based on hierarchical structure[J].CAAI Transactions on Intelligent Systems,2015,10():893.[doi:10.11992/tis.201505006]
[9]方鹏,李贤,汪增福.运用核聚类和偏最小二乘回归的歌唱声音转换[J].智能系统学报,2016,11(1):55.[doi:10.11992/tis.201506022]
 FANG Peng,LI Xian,WANG Zengfu.Conversion of singing voice based on kernel clustering and partial least squares regression[J].CAAI Transactions on Intelligent Systems,2016,11():55.[doi:10.11992/tis.201506022]
[10]李雪,蒋树强.智能交互的物体识别增量学习技术综述[J].智能系统学报,2017,12(2):140.[doi:10.11992/tis.201701006]
 LI Xue,JIANG Shuqiang.Incremental learning and object recognition system based on intelligent HCI: a survey[J].CAAI Transactions on Intelligent Systems,2017,12():140.[doi:10.11992/tis.201701006]

备注/Memo

收稿日期:2025-3-6。
基金项目:国家自然科学基金项目(62276285);教育部学位与研究生教育发展中心主题案例库项目(ZT-231028914);江苏省研究生科研与实践创新计划项目(KYCX24-4178);中国科学院软件研究所合作项目(2205072325).
作者简介:储文娟,硕士研究生,主要研究方向为视觉目标识别与图像处理。E-mail:2294806304@qq.com。;李震,教授,博士,主要研究方向为可靠性与系统工程。主持国家级项目2项、省部级项目2项、横向项目20余项。E-mail:justlz@just.edu.cn。;黄炜嘉,副教授,博士,主要研究方向为图像处理。主持横向项目1项,参研国家自然科学基金面上项目2项。E-mail:huangweijia@just.edu.cn。
通讯作者:李震. E-mail:justlz@just.edu.cn

更新日期/Last Update: 1900-01-01
Copyright © 《 智能系统学报》 编辑部
地址:(150001)黑龙江省哈尔滨市南岗区南通大街145-1号楼 电话:0451- 82534001、82518134 邮箱:tis@vip.sina.com