A Pitch Detection Algorithm Based on Linear Prediction Residual Cepstrum
JIN XueCheng1,2, WANG ZengFu1
1.Department of Automation, University of Science and Technology of China, Hefei 230027 2.Dispatching Center, Jiangxi Electric Power Corporation, Nanchang 330077
Abstract:An algorithm based on the linear prediction (LP) residual cepstrum for pitch detection is presented. The cepstrum of linear prediction residual of the speech signal is used to be the information for pitch determination. Voicing decisions are made based on a decision function consisting of prediction residual cepstral peak, energy and zerocrossing rate of shorttime segments of the speech signal. By this decision function the procedure of voicing decision is greatly simplified and the accuracy of voiced/unvoiced classification is improved significantly. Based on the consecution of pitch, a peak relocation method is introduced in the process of pitch determination to resolve the problems of pitch doubling and pitch halving. The results of the contrast experiment show that the proposed algorithm not only obtains a considerable improvement compared with the conventional cepstrum method, but also performs better than YIN estimator and multiscale wavelet method, which are effective admittedly.
金学成,汪增福. 基于线性预测残差倒谱的基音周期检测[J]. 模式识别与人工智能, 2008, 21(1): 104-110.
JIN XueCheng, WANG ZengFu. A Pitch Detection Algorithm Based on Linear Prediction Residual Cepstrum. , 2008, 21(1): 104-110.
[1] Hess W. Pitch Determination of Speech Signals. Berlin, Germany: SpringerVerlag, 1983 [2] Yang Xingjun, Chi Huisheng. Digital Processing of Speech Signals. Beijing, China: Publishing House of Electronics Industry, 1995 (in Chinese) (杨行峻, 迟惠生. 语音信号数字处理. 北京: 电子工业出版社, 1995) [3] Paul B. Accurate ShortTerm Analysis of the Fundamental Frequency and the HarmonicstoNoise Ratio of a Sampled Sound. Proc of the Institute of Phonetic Sciences of the University of Amsterdam, 1993, 17: 97110 [4] Bao Changchun, Fan Changxin. Pitch Detection Algorithm Based on Normalized CrossCorrelation Function. Journal of China Institute of Communications, 1998, 19(10): 2729,31 (in Chinese) (鲍长春, 樊昌信. 基于归一化互相关函数的基音检测算法. 通信学报, 1998, 19(10): 2729,31) [5] Kobayashi H, Shimamura T. A Modified Cepstrum Method for Pitch Extraction // Proc of the IEEE AsiaPacific Conference on Circuits and Systems. Chiangmai, Thailand, 1998: 299302 [6] Ahmadi S, Spanias A S. CepstrumBased Pitch Detection Using a New Statistical V/UV Classification Algorithm. IEEE Trans on Speech and Audio Processing, 1999, 7(3): 333338 [7] Zhang Wenyao, Xu Gang, Wang Yuguo. Circular AMDF and Pitch Estimation Based on It. Acta Electronica Sinica, 2003, 31(6): 886890 (in Chinese) (张文耀, 许 刚, 王裕国. 循环AMDF及其语音基音周期估计算法. 电子学报, 2003, 31(6): 866890) [8] de Cheveigné A, Kawahara H. YIN: A Fundamental Frequency Estimator for Speech and Music. Journal of the Acoustical Society of America, 2002, 111(4): 19171930 [9] Kadambe S, BoudreauxBartels G F. Application of the Wavelet Transform for Pitch Detection of Speech Signals. IEEE Trans on Information Theory, 1992, 38(2): 917924 [10] Li Xiangchun, Du Limin. A Pitch Detection Algorithm Using Multiscale Edges Feature Extraction. Acta Electronica Sinica, 2003, 31(10): 15001502 (in Chinese) (李香春, 杜利民. 一种基于多尺度边缘特征提取的基音检测算法. 电子学报, 2003, 31(10): 15001502) [11] Noll A M. Cepstrum Pitch Determination. Journal of the Acoustical Society of America, 1967, 41(2): 293309 [12] Gu Liang, Liu Runsheng. HighPerformance Mandarin Pitch Estimation. Acta Electronica Sinica, 1999, 27(1): 811 (in Chinese) (顾 良, 刘润生. 高性能汉语语音基音周期估计. 电子学报, 1999, 27(1): 811)