模式识别与人工智能
2025年4月5日 星期六   首 页     期刊简介     编委会     投稿指南     伦理声明     联系我们                                                                English
模式识别与人工智能  2008, Vol. 21 Issue (1): 104-110    DOI:
研究与应用 最新目录| 下期目录| 过刊浏览| 高级检索 |
基于线性预测残差倒谱的基音周期检测
金学成1,2,汪增福1
1.中国科学技术大学 自动化系 合肥 230027
2.江西省电力公司 调度中心 南昌 330077
A Pitch Detection Algorithm Based on Linear Prediction Residual Cepstrum
JIN XueCheng1,2, WANG ZengFu1
1.Department of Automation, University of Science and Technology of China, Hefei 230027
2.Dispatching Center, Jiangxi Electric Power Corporation, Nanchang 330077

全文: PDF (737 KB)   HTML (1 KB) 
输出: BibTeX | EndNote (RIS)      
摘要 提出一种基于线性预测残差倒谱的基音周期检测算法.该算法对语音信号的线性预测残差信号做倒谱变换,将其作为基音检测特征.并综合残差倒谱峰、短时能量和短时过零率三种特征,构造一个清浊音判决函数,简化清浊音判决过程,提高判决精度.在基音周期检测过程中,根据基音连续原则,提出峰值重定位方法,有效降低基音倍频和半频的错误率.对比实验表明,本文算法的性能不仅较之传统的倒谱方法有明显改善,同时也优于目前效果较好的YIN算法和多尺度小波算法.
服务
把本文推荐给朋友
加入我的书架
加入引用管理器
E-mail Alert
RSS
作者相关文章
金学成
汪增福
关键词 基音周期检测清浊音判决线性预测(LP)倒谱    
Abstract:An algorithm based on the linear prediction (LP) residual cepstrum for pitch detection is presented. The cepstrum of linear prediction residual of the speech signal is used to be the information for pitch determination. Voicing decisions are made based on a decision function consisting of prediction residual cepstral peak, energy and zerocrossing rate of shorttime segments of the speech signal. By this decision function the procedure of voicing decision is greatly simplified and the accuracy of voiced/unvoiced classification is improved significantly. Based on the consecution of pitch, a peak relocation method is introduced in the process of pitch determination to resolve the problems of pitch doubling and pitch halving. The results of the contrast experiment show that the proposed algorithm not only obtains a considerable improvement compared with the conventional cepstrum method, but also performs better than YIN estimator and multiscale wavelet method, which are effective admittedly.
Key wordsPitch Detection    Voiced/Unvoiced Classification    Linear Prediction (LP)    Cepstrum   
    
ZTFLH: TN912.3  
作者简介: 金学成,男,1978年生,博士研究生,主要研究方向为语音信号处理、模式识别、情感计算.E-mail:xcking@mail.ustc.edu.cn.汪增福,男,1960年生,教授,博士生导师,主要研究方向为立体视觉、生物特征识别、情感计算以及智能机器人等.E-mail:zfwang@ustc.edu.cn.
引用本文:   
金学成,汪增福. 基于线性预测残差倒谱的基音周期检测[J]. 模式识别与人工智能, 2008, 21(1): 104-110. JIN XueCheng, WANG ZengFu. A Pitch Detection Algorithm Based on Linear Prediction Residual Cepstrum. , 2008, 21(1): 104-110.
链接本文:  
http://manu46.magtech.com.cn/Jweb_prai/CN/      或     http://manu46.magtech.com.cn/Jweb_prai/CN/Y2008/V21/I1/104
版权所有 © 《模式识别与人工智能》编辑部
地址:安微省合肥市蜀山湖路350号 电话:0551-65591176 传真:0551-65591176 Email:bjb@iim.ac.cn
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn