基于困惑度数据挑选的半监督声学建模<sup>*</sup>

doi:10.16451/j.cnki.issn1003-6059.201606008

Abstract
Figure/Table
References
Related Citation (1)

Download: PDF (367 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract For acoustic modeling of small languages with rare resource, a perplexity based approach is proposed to select unsupervised data in the decoding transcription and retrain the acoustic model. The large unsupervised corpus is decoded using the initial acoustic model trained with a small amount of labeled data, and the perplexity between the decoded text and the training set is calculated. Then, the selected data similar to the labeled data are used to train the acoustic model along with the labeled data. To improve the correctness of the decoded unsupervised data,the final network parameters of acoustic model are adjusted by only using the correct labeled data in the last iteration during the training of model parameters based on deep neural network. In the VLLP recognition task of Swahili provided by NIST 2015 open keyword search competition, the proposed approach can improve the recognition rate compared with other methods.

Key words： Semi-supervised Training Perplexity Deep Neural Networks (DNN)

Received: 21 October 2015

ZTFLH:

TN 912.3

Corresponding Authors: GUO Wu(Corresponding author), born in 1973, Ph.D., associate professor. His research interests include speech recognition and speaker recognition.

About author:: XIE Chuandong, born in 1990, master student. His research interests include speech recognition and keyword retrieval.

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	XIE Chuandong
	GUO Wu

Cite this article:

XIE Chuandong,GUO Wu. Semi-supervised Acoustic Modeling Based on Perplexity Data Selection[J]. , 2016, 29(6): 542-547.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201606008 OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2016/V29/I6/542