[1]SUN Qingmei,JIN Cong.Image annotation method based on visual attention mechanism and conditional random field[J].CAAI Transactions on Intelligent Systems,2016,11(4):442-448.[doi:10.11992/tis.201606004]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
11
Number of periods:
2016 4
Page number:
442-448
Column:
学术论文—机器学习
Public date:
2016-07-25
- Title:
-
Image annotation method based on visual attention mechanism and conditional random field
- Author(s):
-
SUN Qingmei; JIN Cong
-
School of Computer, Central China Normal University, Wuhan 430079, China
-
- Keywords:
-
automatic image annotation; visual attention mechanism; inter-word correlation; conditional random fields
- CLC:
-
TP391
- DOI:
-
10.11992/tis.201606004
- Abstract:
-
Traditional image annotation methods interpret all image regions equally, neglecting any understanding of the image. Therefore, an image annotation method based on the visual attention mechanism and conditional random field, called VAMCRF, is proposed. Firstly, people pay more attention to image salient regions during the process of image recognition; this can be achieved through the visual attention mechanism and the support vector machine is then used to assign semantic labels. It then labels the non-salient regions using a k-NN clustering algorithm. Finally, as the annotations of salient and non-salient regions are logically related, the ultimate label vector of the image can be corrected and determined by a conditional random field (CRF) model and inter-word correlation. From the values of average precision, average recall, and F1, the experimental results on Corel5k, IAPR TC-12, and ESP Game confirm that the proposed method is efficient compared with traditional annotation methods.