模式识别与人工智能
Friday, Apr. 4, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2022, Vol. 35 Issue (12): 1078-1088    DOI: 10.16451/j.cnki.issn1003-6059.202212003
Deep Learning Based Image Understanding and Its Applications Current Issue| Next Issue| Archive| Adv Search |
Generalized Zero-Shot Image Classification Based on Reconstruction Contrast
XU Rui1, SHAO Shuai2, CAO Weijia3, LIU Baodi1, TAO Dapeng4, LIU Weifeng1
1. College of Control Science and Engineering, China University of Petroleum(East China), Qingdao 266580;
2. Research Institute of Basic Theories, Zhejiang Laboratory, Hangzhou 311121;
3. National Engineering Research Center of Remote Sensing Satellite Applications, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094;
4. School of Information Science and Engineering, Yunnan University, Yunnan 650500

Download: PDF (1449 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  In generalized zero-shot image classification, generative models are often exploited to reconstruct visual or semantic information for further learning. However, the representation performance of the methods based on variational autoencoders is poor due to the underutilization of the reconstructed samples. Therefore, a generalized zero-shot image classification model based on reconstruction and contrastive learning is proposed. Firstly, two variational self-encoders are utilized to encode visual information and semantic information into low dimensional latent vectors of the same dimension, and then the latent vectors are decoded into two modes respectively. Next, the project modules are utilized to project both the original visual information and the visual information reconstructed from semantic modal latent vectors. Then, reconstruction contrastive learning is performed to learn the features after projection. The reconstruction performance of the encoder is maintained, the discriminative performance of the encoder is enhanced, and the application ability of pre-training features on the generalized zero-shot task is improved by the proposed method. The effectiveness of the proposed model is verified on four benchmark datasets.
Key wordsGeneralized Zero-Shot Image Classification      Variational Autoencoders      Contrastive Lear-ning      Semantic Information      Visual Information     
Received: 20 May 2022     
ZTFLH: TP391  
  TP18  
Fund:National Natural Science Foundation of China(No.61671480), Major Scientific and Technological Projects of CNPC(No.ZD2019-183-008), Open Project of the National Laboratory of Pattern Recognition (NLPR)(No.202000009), Graduate Innovation Project of China University of Petroleum(East China)(No.YCX2021123)
Corresponding Authors: LIU Weifeng, Ph.D., professor. His research interests include pa-ttern recognition and machine learning.   
About author:: XU Rui, Ph.D. candidate. Her research interests include few-shot learning and zero-shot learning.SHAO Shuai, Ph.D. His research inte-rests include dictionary learning and few-shot learning.CAO Weijia, Ph.D. assistant professor. Her research interests include image encryption, image compression and image classification.LIU Baodi, Ph.D., associate professor. His research interests include computer vision and machine learning.TAO Dapeng, Ph.D., professor. His research interests include machine learning, computer vision and cloud computing.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
XU Rui
SHAO Shuai
CAO Weijia
LIU Baodi
TAO Dapeng
LIU Weifeng
Cite this article:   
XU Rui,SHAO Shuai,CAO Weijia等. Generalized Zero-Shot Image Classification Based on Reconstruction Contrast[J]. Pattern Recognition and Artificial Intelligence, 2022, 35(12): 1078-1088.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202212003      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2022/V35/I12/1078
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn