基于统计声学模型的单元挑选语音合成算法<sup>*</sup>

Abstract
Figure/Table
References
Related Citation (6)

Download: PDF (408 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract A statistical acoustic model based unit selection algorithm for speech synthesis is proposed. During training stage, the acoustic models for contextual dependent phonemes are built up by using acoustic features extracted from the training data, such as spectral parameters, F0, and segmental and prosodic labels in the corpus. The hidden Markov model (HMM) is adopted as the model structure. During synthesis stage, the optimal phoneme unit sequence is searched in the speech corpus by maximizing the probabilistic likelihood between its acoustic features and the sentence HMM constructed with the contextual information of input text. Finally, the waveforms of the selected candidate units are concatenated and smoothed to produce the synthesized speech. Based on the proposed method, a Chinese speech synthesis system using initials and finals as the basic concatenation units is constructed. Results of listening test prove that the proposed method can achieve better naturalness of synthesized speech compared to the conventional method.

Key words： Speech Synthesis Unit Selection Statistical Acoustic Model Hidden Markov Model (HMM) Maximum Likelihood Criterion

Received: 02 July 2007

ZTFLH:

TN912.33

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	LING Zhen-Hua
	WANG Ren-Hua

Cite this article:

LING Zhen-Hua,WANG Ren-Hua. Statistical Acoustic Model Based Unit Selection Algorithm for Speech Synthesis[J]. , 2008, 21(3): 280-284.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/ OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2008/V21/I3/280