模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2018, Vol. 31 Issue (7): 662-667    DOI: 10.16451/j.cnki.issn1003-6059.201807009
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Speech Recognition Based on Semi-supervised Data Selection via Decoding Multiple Candidate Results
WANG Xilou1, GUO Wu1, XIE Chuandong1
1.National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei 230027

Download: PDF (0 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  For speech recognition of low resources, a selection strategy for semi-supervised learning with a large number of unlabeled data is proposed, and this strategy is applied to both acoustic modeling and language modeling. After a small amount of data is used to train the seed model, the unlabeled data is decoded using the seed model. Firstly, high-confidence sentences are selected by using a combination of confidence measure and perplexity in the decoded best candidate results. Then, the high-confidence sentences are used to train acoustic model and language model. Furthermore, the decoded lattice is transformed to obtain multiple candidate texts for language model training. In the Japanese recognition task, the proposed method obtains a better recognition rate than the method of selecting data based on confidence measure.
Key wordsConfidence Measure      Semi-supervised Learning      N-BEST      Low Resource     
Received: 08 November 2017     
ZTFLH: TN 912.3  
Corresponding Authors: WANG Xilou(Corresponding author), master student. His research interests include speech recognition.   
About author:: GUO Wu, Ph.D., associate professor. His research interests include speaker recognition and verification, speech recognition.XIE Chuandong, master student. His research interests include speech recognition and keyword search.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WANG Xilou
GUO Wu
XIE Chuandong
Cite this article:   
WANG Xilou,GUO Wu,XIE Chuandong. Speech Recognition Based on Semi-supervised Data Selection via Decoding Multiple Candidate Results[J]. , 2018, 31(7): 662-667.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201807009      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2018/V31/I7/662
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn