[1]ZHANG Ji,CAO Yi,WANG Yaru,et al.Zero-shot image classification method combining VAE and StackGAN[J].CAAI Transactions on Intelligent Systems,2022,17(3):593-601.[doi:10.11992/tis.202107012]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
17
Number of periods:
2022 3
Page number:
593-601
Column:
学术论文—人工智能基础
Public date:
2022-05-05
- Title:
-
Zero-shot image classification method combining VAE and StackGAN
- Author(s):
-
ZHANG Ji1; CAO Yi1; WANG Yaru2; ZHAO Wenqing1; ZHAI Yongjie2
-
1. Department of Computer, North China Electric Power University, Baoding 071003, China;
2. Department of Automation, North China Electric Power University, Baoding 071003, China
-
- Keywords:
-
Deep learning; Zero-shot learning; Image classification; Variational autoencoder; Generative adversarial network; Staged network; Sentence vector; Auxiliary information
- CLC:
-
TP18
- DOI:
-
10.11992/tis.202107012
- Abstract:
-
The zero-shot classification algorithm is designed to solve the classification problem in case of a few samples or even missing categories. With the development of deep learning, the application of the generation model in zero-shot classification has made a breakthrough. By generating images of missing categories, the zero-shot image classification is transformed into a traditional image classification problem based on supervised learning. However, the generated samples are unstable in quality, including missing details and color distortion, thus affecting the accuracy of image classification. To this end, the zero-shot image classification method combining variational auto-encoding (VAE) and stack generative adversarial networks (StackGAN) is proposed. Based on the VAE/GAN model, StackGAN is introduced to generate the data of missing categories. Meanwhile, the deep learning method is used to train and obtain the sentence vectors of each category as auxiliary information and build a new generation model stc–CLS–VAEStackGAN to improve the quality of generated images and subsequently improve the classification accuracy of the zero-shot images. A comparative experiment was conducted on the public dataset, and the experimental results verified the effectiveness and superiority of the method proposed herein.