声纹识别中合成语音的鲁棒性

摘要
图/表
参考文献
相关文章 (1)

全文: PDF (542 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要随着以隐马尔科夫模型为基础的语音合成技术的发展，冒认者很容易利用该技术生成具有目标说话人特性的合成语音，这对现有的声纹识别系统构成巨大威胁。针对此问题，文中从统计学的角度分析自然语音与合成语音在实倒谱上的区别，并提出对合成语音具有鲁棒性的声纹识别系统。实验结果初步表明，相比于传统的声纹识别系统，在对自然语音的等错误率不变的情况下，该系统对合成语音的错误接受率由99。2%降为0。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	陈联武
	郭武
	戴礼荣

关键词 ：声纹识别, 合成语音, 实倒谱

Abstract：With the development of the hidden markov model (HMM) based speech synthesis technology, it is easy for impostors to produce synthetic speech with the specific speakers characteristics, which becomes an enormous threat to the existing speaker recognition system. In this paper, the difference between natural speech and synthetic speech is investigated on the real part of cepstrum. And a speaker recognition system is proposed which is robust against synthetic speech. Experimental results demonstrate that the false accept rate (FAR) for synthetic speech is zero in the proposed system, while that of the existing speaker recognition system is 99.2% with the equal error rate (EER) for natural speech unchanged.

Key words： Speaker Recognition Synthetic Speech Real Part of Cepstrum

收稿日期: 2010-07-22

ZTFLH:

TN912.3

基金资助:国家自然科学基金项目(No.60970161)、中央高校基本科研业务费专项项目(No.XD2100060001)资助

作者简介: 陈联武，男，1986年生，硕士研究生，主要研究方向为说话人识别.E-mail:clw2009@mail.ustc.edu.cn.郭武，男，1973年生，博士，讲师，主要研究方向为说话人与语种识别、确认.戴礼荣，男，1962年生，教授，博士生导师，主要研究方向为语音识别、信号处理.

引用本文:

陈联武，郭武，戴礼荣. 声纹识别中合成语音的鲁棒性[J]. 模式识别与人工智能, 2011, 24(6): 743-762. CHEN Lian-Wu, GUO Wu, DAI Li-Rong. Robust Speaker Recognition against Synthetic Speech. , 2011, 24(6): 743-762.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2011/V24/I6/743