模式识别与人工智能
Saturday, March 15, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2024, Vol. 37 Issue (7): 638-651    DOI: 10.16451/j.cnki.issn1003-6059.202407006
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Image Generation Method for Cognizing Image Attribute Features from the Perspective of Disentangled Representation Learning
CAI Jianghai1,2, HUANG Chengquan1,2,3, WANG Shunxia2, LUO Senyan2, YANG Guiyan2, ZHOU Lihua2
1. Key Laboratory of Pattern Recognition and Intelligent Systems of Guizhou Province, Guizhou Minzu University, Guiyang 550025;
2. School of Data Sciences and Information Engineering, Guizhou Minzu University, Guiyang 550025;
3. Engineering Training Center, Guizhou Minzu University, Gui-yang 550025

Download: PDF (4136 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  In the field of generative artificial intelligence, the research of disentangled representation learning further promotes the development of image generation methods. However, existing disentanglement methods pay more attention to low-dimensional representation of image generation, ignoring inherent interpretable factors of the target variation image. This oversight results in generated image being susceptible to the influence of other irrelevant attribute features. To address this issue, an image generation method for cognizing image attribute features from the perspective of disentangled representation learning is proposed. Firstly, candidate traversal directions for the target variation image are obtained by training, starting from the latent space of the generative model. Secondly, an unsupervised semantic decomposition strategy is constructed, and the interpretable directions embedded in the latent space are jointly discovered based on the direction of candidate traversals. Finally, a contrast simulator and a variation space are constructed using disentangled encoders and contrastive learning. Consequently, the disentangled representations of the target variation image are extracted from the interpretable directions and the image is generated. Extensive experiments on five popular disentanglement datasets demonstrate the superior performance of the proposed method.
Key wordsDisentangled Representation Learning      Latent Space      Interpretable Direction      Image Ge-neration      Variation Space     
Received: 26 April 2024     
ZTFLH: TP391  
Fund:Supported by National Natural Science Foundation of China(No.62062024), Science and Technology Program of Guizhou Pro-vince(No.QKH Basic ZK [2021] Common 342), Key Project of Teaching Reform of Graduate Education of Guizhou Province(No.QJH YJSJGKT[2021]018), Natural Science Research Project of Education Department of Guizhou Province(No.QJJ [2022]015), 2022 Open Subjects of Key Laboratory of Pattern Recognition and Intelligent Systems of Guizhou Province(No.GZMUKL[2022]KF03)
Corresponding Authors: HUANG Chengquan, Ph.D., professor. His research interests include deep learning and image processing.   
About author:: CAI Jianghai, Master student. His research interests include deep learning, image processing and disentangled representation learning. WANG Shunxia, Master student. Her research interests include machine learning and pattern recognition. LUO Senyan, Master student. Her research interests include machine learning and pattern recognition. YANG Guiyan, Master student. Her research interests include machine learning and pattern recognition. ZHOU Lihua, Master, associate professor. Her research interests include deep learning and image processing.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
CAI Jianghai
HUANG Chengquan
WANG Shunxia
LUO Senyan
YANG Guiyan
ZHOU Lihua
Cite this article:   
CAI Jianghai,HUANG Chengquan,WANG Shunxia等. Image Generation Method for Cognizing Image Attribute Features from the Perspective of Disentangled Representation Learning[J]. Pattern Recognition and Artificial Intelligence, 2024, 37(7): 638-651.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202407006      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2024/V37/I7/638
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn