[1]李小菲,苟光磊,韩岩奇,等.多尺度知识引导局部增强的小样本细粒度图像分类方法[J].智能系统学报,2024,19(5):1157-1167.[doi:10.11992/tis.202309003]
LI Xiaofei,GOU Guanglei,HAN Yanqi,et al.Multiscale knowledge-guided local enhancement method for few-shot fine-grained image classification[J].CAAI Transactions on Intelligent Systems,2024,19(5):1157-1167.[doi:10.11992/tis.202309003]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
19
期数:
2024年第5期
页码:
1157-1167
栏目:
学术论文—机器感知与模式识别
出版日期:
2024-09-05
- Title:
-
Multiscale knowledge-guided local enhancement method for few-shot fine-grained image classification
- 作者:
-
李小菲, 苟光磊, 韩岩奇, 朱东华
-
重庆理工大学 计算机科学与工程学院, 重庆 400054
- Author(s):
-
LI Xiaofei, GOU Guanglei, HAN Yanqi, ZHU Donghua
-
College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400054, China
-
- 关键词:
-
小样本学习; 细粒度图像; 图像分类; 多尺度特征; 关系网络; 深度学习; 数据增强; 语义相关性
- Keywords:
-
few-shot learning; fine-grained image; image classification; multi-scale feature; relation network; deep learning; data augmentation; semantical correlations
- 分类号:
-
TP391
- DOI:
-
10.11992/tis.202309003
- 文献标志码:
-
2024-08-29
- 摘要:
-
小样本细粒度图像分类任务中,由于支持样本与查询样本之间缺少局部关联性,导致图像关键可区分区域不易精确定位。针对这一问题,提出了多尺度知识引导局部增强的小样本细粒度图像分类方法,采用图像金字塔向下降低采样率获得多个不同分辨率的子图作为输入图像,融合多尺度特征,丰富了样本信息,利用知识引导模块捕获支持样本和查询样本之间的语义相关性,增强了支持样本重要区域的特征表示,对支持样本和查询样本的嵌入特征图进行克罗内克积操作,生成的空间相关图,更精确地定位样本特征之间的位置对应关系,突出了辨别性区域。实验结果表明该方法在小样本细粒度图像分类任务中表现出较好的分类性能。
- Abstract:
-
In few-shot fine-grained image classification tasks, the absence of local correlation between support and query samples causes difficulty in precisely identifying discriminative regions in images. To solve this problem, this study proposes a multiscale knowledge-guided local enhancement method for few-shot fine-grained image classification. Multiple sub-images with different resolutions are obtained as input images by reducing the sampling rate downward using the image pyramid and by integrating multi-scale features to enrich the sample information. A knowledge-guided module is employed to capture semantic correlations between support and query samples, which enhances the feature representation of important regions in support samples. Kronecker product operations are performed on the embedded feature maps of support and query samples to generate spatial correlation maps. This way enables more accurate localization of the positional correspondence between sample features and highlights the discriminative area. Experimental results confirm the robust classification performance of the method in the classification task of few-shot fine-grained images.
备注/Memo
收稿日期:2023-9-1。
基金项目:国家自然科学基金项目(62141201);重庆市教委科学技术研究项目(KJZD-M202201102).
作者简介:李小菲,硕士研究生,主要研究方向为计算机视觉、人工智能。E-mail:sofina612@163.com;苟光磊,副教授,主要研究方向为智能信息处理、人工智能。已主持重庆市科委基础科学与前沿技术研究项目、重庆市教委科学技术研究项目等科研项目5项,授权发明专利3项,发表学术论文20余篇,出版专著1部、教材2部。E-mail:ggl@cqut.edu.cn;韩岩奇,硕士研究生,中国计算机学会会员,主要研究方向为计算机视觉、小样本学习。E-mail:hanyanqi0408@163.com。
通讯作者:苟光磊. E-mail:ggl@cqut.edu.cn
更新日期/Last Update:
2024-09-05