基于解码多候选结果的半监督数据挑选的语音识别

doi:10.16451/j.cnki.issn1003-6059.201807009

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (0 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract For speech recognition of low resources, a selection strategy for semi-supervised learning with a large number of unlabeled data is proposed, and this strategy is applied to both acoustic modeling and language modeling. After a small amount of data is used to train the seed model, the unlabeled data is decoded using the seed model. Firstly, high-confidence sentences are selected by using a combination of confidence measure and perplexity in the decoded best candidate results. Then, the high-confidence sentences are used to train acoustic model and language model. Furthermore, the decoded lattice is transformed to obtain multiple candidate texts for language model training. In the Japanese recognition task, the proposed method obtains a better recognition rate than the method of selecting data based on confidence measure.

Key words： Confidence Measure Semi-supervised Learning N-BEST Low Resource

Received: 08 November 2017

ZTFLH:

TN 912.3

Corresponding Authors: WANG Xilou(Corresponding author), master student. His research interests include speech recognition.

About author:: GUO Wu, Ph.D., associate professor. His research interests include speaker recognition and verification, speech recognition.XIE Chuandong, master student. His research interests include speech recognition and keyword search.

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	WANG Xilou
	GUO Wu
	XIE Chuandong

Cite this article:

WANG Xilou,GUO Wu,XIE Chuandong. Speech Recognition Based on Semi-supervised Data Selection via Decoding Multiple Candidate Results[J]. , 2018, 31(7): 662-667.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201807009 OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2018/V31/I7/662