模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2015, Vol. 28 Issue (8): 673-679    DOI: 10.16451/j.cnki.issn1003-6059.201508001
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Restricted Boltzmann Machine Based Spectrum Modeling and Unit Selection Speech Synthesis Method
SONG Yang, LING Zhen-Hua, DAI Li-Rong
National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei 230027

Download: PDF (467 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  A restricted Boltzmann machine based spectrum modeling and unit selection speech synthesis method is proposed. At the model training stage, the restricted Boltzmann machine is used to model spectral features with rich details, such as spectral envelopes and short-time spectral amplitudes, instead of using the single Gaussian model with diagonal variance and mel-cepstrum feature for spectral model in the traditional approach. Thus, the description capability of the acoustical model for spectral feature is improved. At the speech synthesis stage, the restricted Boltzmann machine model is adopted to calculate the log likelihoods of spectral feature of candidate sample, and a method of piecewise linear mapping is proposed to construct target cost function for unit selection. The experimental results indicate that the proposed method can effectively improve the naturalness of synthetic speech.
Key wordsSpeech Synthesis      Unit Selection      Hidden Markov Model      Restricted Boltzmann Machine     
Received: 25 April 2014     
ZTFLH: TN 912.33  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
SONG Yang
LING Zhen-Hua
DAI Li-Rong
Cite this article:   
SONG Yang,LING Zhen-Hua,DAI Li-Rong. Restricted Boltzmann Machine Based Spectrum Modeling and Unit Selection Speech Synthesis Method[J]. , 2015, 28(8): 673-679.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201508001      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2015/V28/I8/673
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn