提出一种基于线性预测残差倒谱的基音周期检测算法.该算法对语音信号的线性预测残差信号做倒谱变换,将其作为基音检测特征.并综合残差倒谱峰、短时能量和短时过零率三种特征,构造一个清浊音判决函数,简化清浊音判决过程,提高判决精度.在基音周期检测过程中,根据基音连续原则,提出峰值重定位方法,有效降低基音倍频和半频的错误率.对比实验表明,本文算法的性能不仅较之传统的倒谱方法有明显改善,同时也优于目前效果较好的YIN算法和多尺度小波算法.
Abstract
An algorithm based on the linear prediction (LP) residual cepstrum for pitch detection is presented. The cepstrum of linear prediction residual of the speech signal is used to be the information for pitch determination. Voicing decisions are made based on a decision function consisting of prediction residual cepstral peak, energy and zerocrossing rate of shorttime segments of the speech signal. By this decision function the procedure of voicing decision is greatly simplified and the accuracy of voiced/unvoiced classification is improved significantly. Based on the consecution of pitch, a peak relocation method is introduced in the process of pitch determination to resolve the problems of pitch doubling and pitch halving. The results of the contrast experiment show that the proposed algorithm not only obtains a considerable improvement compared with the conventional cepstrum method, but also performs better than YIN estimator and multiscale wavelet method, which are effective admittedly.
关键词
基音周期检测 /
清浊音判决 /
线性预测(LP) /
倒谱
{{custom_keyword}} /
Key words
Pitch Detection /
Voiced/Unvoiced Classification /
Linear Prediction (LP) /
Cepstrum
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Hess W. Pitch Determination of Speech Signals. Berlin, Germany: SpringerVerlag, 1983
[2] Yang Xingjun, Chi Huisheng. Digital Processing of Speech Signals. Beijing, China: Publishing House of Electronics Industry, 1995 (in Chinese)
(杨行峻, 迟惠生. 语音信号数字处理. 北京: 电子工业出版社, 1995)
[3] Paul B. Accurate ShortTerm Analysis of the Fundamental Frequency and the HarmonicstoNoise Ratio of a Sampled Sound. Proc of the Institute of Phonetic Sciences of the University of Amsterdam, 1993, 17: 97110
[4] Bao Changchun, Fan Changxin. Pitch Detection Algorithm Based on Normalized CrossCorrelation Function. Journal of China Institute of Communications, 1998, 19(10): 2729,31 (in Chinese)
(鲍长春, 樊昌信. 基于归一化互相关函数的基音检测算法. 通信学报, 1998, 19(10): 2729,31)
[5] Kobayashi H, Shimamura T. A Modified Cepstrum Method for Pitch Extraction // Proc of the IEEE AsiaPacific Conference on Circuits and Systems. Chiangmai, Thailand, 1998: 299302
[6] Ahmadi S, Spanias A S. CepstrumBased Pitch Detection Using a New Statistical V/UV Classification Algorithm. IEEE Trans on Speech and Audio Processing, 1999, 7(3): 333338
[7] Zhang Wenyao, Xu Gang, Wang Yuguo. Circular AMDF and Pitch Estimation Based on It. Acta Electronica Sinica, 2003, 31(6): 886890 (in Chinese)
(张文耀, 许 刚, 王裕国. 循环AMDF及其语音基音周期估计算法. 电子学报, 2003, 31(6): 866890)
[8] de Cheveigné A, Kawahara H. YIN: A Fundamental Frequency Estimator for Speech and Music. Journal of the Acoustical Society of America, 2002, 111(4): 19171930
[9] Kadambe S, BoudreauxBartels G F. Application of the Wavelet Transform for Pitch Detection of Speech Signals. IEEE Trans on Information Theory, 1992, 38(2): 917924
[10] Li Xiangchun, Du Limin. A Pitch Detection Algorithm Using Multiscale Edges Feature Extraction. Acta Electronica Sinica, 2003, 31(10): 15001502 (in Chinese)
(李香春, 杜利民. 一种基于多尺度边缘特征提取的基音检测算法. 电子学报, 2003, 31(10): 15001502)
[11] Noll A M. Cepstrum Pitch Determination. Journal of the Acoustical Society of America, 1967, 41(2): 293309
[12] Gu Liang, Liu Runsheng. HighPerformance Mandarin Pitch Estimation. Acta Electronica Sinica, 1999, 27(1): 811 (in Chinese)
(顾 良, 刘润生. 高性能汉语语音基音周期估计. 电子学报, 1999, 27(1): 811)
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}