[1]LI Xiaofei,GOU Guanglei,HAN Yanqi,et al.Multiscale knowledge-guided local enhancement method for few-shot fine-grained image classification[J].CAAI Transactions on Intelligent Systems,2024,19(5):1157-1167.[doi:10.11992/tis.202309003]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 5
Page number:
1157-1167
Column:
学术论文—机器感知与模式识别
Public date:
2024-09-05
- Title:
-
Multiscale knowledge-guided local enhancement method for few-shot fine-grained image classification
- Author(s):
-
LI Xiaofei; GOU Guanglei; HAN Yanqi; ZHU Donghua
-
College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400054, China
-
- Keywords:
-
few-shot learning; fine-grained image; image classification; multi-scale feature; relation network; deep learning; data augmentation; semantical correlations
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202309003
- Abstract:
-
In few-shot fine-grained image classification tasks, the absence of local correlation between support and query samples causes difficulty in precisely identifying discriminative regions in images. To solve this problem, this study proposes a multiscale knowledge-guided local enhancement method for few-shot fine-grained image classification. Multiple sub-images with different resolutions are obtained as input images by reducing the sampling rate downward using the image pyramid and by integrating multi-scale features to enrich the sample information. A knowledge-guided module is employed to capture semantic correlations between support and query samples, which enhances the feature representation of important regions in support samples. Kronecker product operations are performed on the embedded feature maps of support and query samples to generate spatial correlation maps. This way enables more accurate localization of the positional correspondence between sample features and highlights the discriminative area. Experimental results confirm the robust classification performance of the method in the classification task of few-shot fine-grained images.