[1]KANG Jie,LIU Wei.A cross-modal retrieval algorithm of decoration cases on feature fusion[J].CAAI Transactions on Intelligent Systems,2024,19(2):429-437.[doi:10.11992/tis.202207030]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 2
Page number:
429-437
Column:
学术论文—人工智能基础
Public date:
2024-03-05
- Title:
-
A cross-modal retrieval algorithm of decoration cases on feature fusion
- Author(s):
-
KANG Jie; LIU Wei
-
School of Electrical and Control Engineering, Shaanxi University of Science & Technology, Xi’an 710021, China
-
- Keywords:
-
home decoration customer service system; decoration case retrieval; cross-modal retrieval; style aggregation; multimodal; feature fusion; channel attention mechanism; semantic information
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202207030
- Abstract:
-
Currently, home decoration customer service systems chiefly depend on manual decoration case retrieval, which leads to the system not meeting user demand for fast and timely consulting service and high labor costs. Thus, we propose a feature fusion-based cross-modal retrieval algorithm for decoration cases. Aiming at the problems of insufficient semantic information mining of multimodal data and low accuracy of model retrieval, the existing style aggregation module is improved. The channel attention mechanism is introduced into the original module to add suitable weights to the feature vectors of different pictures in each group of decoration cases, thereby improving the important features that include more helpful information and weakening other unimportant features. Conversely, a multimodal feature fusion module is developed for retrieval scenarios to make full use of multimodal information. The module can adaptively control a series of fusion operations of feature vectors from two different modalities to achieve knowledge flow and sharing between cross-modal data, thereby producing feature vectors with richer semantics and stronger expressive ability to improve the retrieval performance of the model further. Our developed algorithm is compared with other methods on a self-built multimodal dataset of decoration cases, and results show that the algorithm performs better in decoration case retrieval.