模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2023, Vol. 36 Issue (4): 300-312    DOI: 10.16451/j.cnki.issn1003-6059.202304002
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Weight Adaptive Generative Adversarial Imitation Learning Based on Noise Contrastive Estimation
GUAN Weifan1,2, ZHANG Xi1
1. National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190;
2. School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049

Download: PDF (1850 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The traditional imitation learning requires expert demonstrations of extremely high quality. This restriction not only increases the difficulty of data collection but also limits application scenarios of algorithms. To address this problem, weight adaptive generative adversarial imitation learning based on noise contrastive estimation(GLANCE) is proposed to maintain high performance in scenarios where the quality of expert demonstration is inconsistent. Firstly, a feature extractor is trained by noise contrastive estimation to improve the feature distribution of suboptimal expert demonstrations. Then, weight coefficients are set for the expert demonstrations, and generative adversarial imitation learning is performed on the expert demonstrations after redistribution based on the weight coefficients. Finally, ranking loss is calculated based on the known relative ranking evaluation data and weight coefficients are optimized through gradient descent to improve the data distribution. Experiments on multiple continuous control tasks show that GLANCE only needs to obtain 5% of the expert demonstrations dataset as evaluation data to achieve superior performance while the quality of the expert demonstration is inconsistent.
Key wordsReinforcement Learning      Imitation Learning      Noise Contrastive Estimation      Adaptive Weight     
Received: 25 November 2022     
ZTFLH: TP 18  
Fund:National Key Research and Development Program of China(No.2020AAA0103400)
Corresponding Authors: ZHANG Xi, Ph.D., associate professor. Her research interests include machine learning and reinforcement lear-ning.   
About author:: GUAN Weifan, master student. His research interests include reinforcement lear-ning and imitation learning.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
GUAN Weifan
ZHANG Xi
Cite this article:   
GUAN Weifan,ZHANG Xi. Weight Adaptive Generative Adversarial Imitation Learning Based on Noise Contrastive Estimation[J]. Pattern Recognition and Artificial Intelligence, 2023, 36(4): 300-312.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202304002      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2023/V36/I4/300
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn