基于音素绑定码本映射的说话人声音转换方法

摘要
图/表
参考文献
相关文章 (7)

全文: PDF (681 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要介绍说话人声音转换系统框架,并对传统的基于码本映射的说话人声音转换方法进行讨论.指出传统的码本映射方法由于对谱的转换采用所有码本加权叠加,因此会产生转换后语音频谱平滑效应过重的问题,从而使转换后语音音质较差.为了克服这种问题,本文提出基于音素绑定的码本加权叠加方法来完成语音谱的转换,同时利用决策树来完成韵律的转换.实验表明,即使在数据量较少的情况下,该方法也能较好地完成说话人声音转换,并能得到较高的语音音质.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	王子祥
	戴礼荣
	王玉平
	王仁华

关键词 ：声音转换, 码本映射, 决策树

Abstract：The voice conversion system framework is introduced in this paper. Further, the conventional codebook mapping method for voice conversion is discussed. This paper point out that the conventional codebook mapping method, which calculates the weighting coefficients based on whole codebooks, tends to generate overly smoothed effect on converted speech spectrum. So the converted speech quality is decreased greatly. To address this problem, a novel voice conversion method based on codebook mapping with phonemetied weighting is presented. And a new decision tree based prosodic conversion method is also proposed. The experiments show that the proposed methods can effectively convert speaker's individuality while maintaining high speech quality with only a small amount of training data.

Key words： Voice Conversion Codebook Mapping Decision Tree

收稿日期: 2005-01-07

ZTFLH:

TN391.42

作者简介: 王子祥,男,1982年生,硕士研究生,主要研究方向为说话人声音转换.戴礼荣,男,1962年生,博士,副教授,主要研究方向为语音信号处理、语音编码与通信、人机语声对话及DSP技术应用.E-mail: lrdai@ustc.edu.cn.王玉平,男,1983年生,硕士研究生,主要研究方向为语音合成.王仁华,男,1943年生,教授,博士生导师,主要研究方向为数字信号处理、语音通信、多媒体通信等.

引用本文:

王子祥，戴礼荣，王玉平，王仁华. 基于音素绑定码本映射的说话人声音转换方法[J]. 模式识别与人工智能, 2006, 19(3): 300-306. WANG ZiXiang, DAI LiRong, WANG YuPing, WANG RenHua. A Novel Voice Conversion Method Based on Codebook Mapping with PhonemeTied Weighting. , 2006, 19(3): 300-306.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2006/V19/I3/300