模式识别与人工智能
Saturday, March 15, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2008, Vol. 21 Issue (3): 280-284    DOI:
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Statistical Acoustic Model Based Unit Selection Algorithm for Speech Synthesis
LING Zhen-Hua, WANG Ren-Hua
iFly Speech Laboratory, Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei 230027

Download: PDF (408 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  A statistical acoustic model based unit selection algorithm for speech synthesis is proposed. During training stage, the acoustic models for contextual dependent phonemes are built up by using acoustic features extracted from the training data, such as spectral parameters, F0, and segmental and prosodic labels in the corpus. The hidden Markov model (HMM) is adopted as the model structure. During synthesis stage, the optimal phoneme unit sequence is searched in the speech corpus by maximizing the probabilistic likelihood between its acoustic features and the sentence HMM constructed with the contextual information of input text. Finally, the waveforms of the selected candidate units are concatenated and smoothed to produce the synthesized speech. Based on the proposed method, a Chinese speech synthesis system using initials and finals as the basic concatenation units is constructed. Results of listening test prove that the proposed method can achieve better naturalness of synthesized speech compared to the conventional method.
Key wordsSpeech Synthesis      Unit Selection      Statistical Acoustic Model      Hidden Markov Model (HMM)      Maximum Likelihood Criterion     
Received: 02 July 2007     
ZTFLH: TN912.33  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
LING Zhen-Hua
WANG Ren-Hua
Cite this article:   
LING Zhen-Hua,WANG Ren-Hua. Statistical Acoustic Model Based Unit Selection Algorithm for Speech Synthesis[J]. , 2008, 21(3): 280-284.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2008/V21/I3/280
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn