|
|
A Speaker Recognition Algorithm Based on Speech Component Unit |
HUANG Chang-Cun, WANG Zeng-Fu |
Department of Automation, University of Science and Technology of China, Hefei 230027 |
|
|
Abstract The linear prediction coefficients (LPC) are selected as features to construct the corresponding feature space. Firstly, the samples for training of each individual in system are clustered in the feature space to form initial clusters by using k-mean clustering algorithm. Then, the initial clustering results are optimized based on Gaussian mixture model (GMM) iterative algorithm to obtain the speech component unit representation of the individual. On the basis of the obtained speech component units, a text-independent speaker verification method, called averaging method, and a text-independent speaker identification method are presented. Experimental results show that the proposed algorithm can produce a satisfying result even in short utterances.
|
Received: 13 March 2007
|
|
|
|
|
[1] Pfeifer L L. New Techniques for Text-Independent Speaker Identification // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Tulsa, USA, 1978, Ⅲ: 283-286 [2] Matsumoto H, Nimura T. Text-Independent Speaker Identification Based on Piecewise Canonical Discriminant Analysis // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Tulsa, USA, 1978, Ⅲ: 291-294 [3] Li K P, Jr Wrench E H. An Approach to Text-Independent Speaker Recognition with Short Utterances // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Boston, USA, 1983, Ⅷ: 555-558 [4] Savic M, Gupta S K. Variable Parameter Speaker Verification System Based on Hidden Markov Modeling // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Albuquerque, Mexico, 1990: 281-284 [5] Poritz A B. Linear Predictive Hidden Markov Models and the Speech Signal // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Paris, France, 1982, Ⅶ: 1291-1294 [6] Reynolds D A, Rose R C. Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Trans on Speech and Audio Processing, 1995, 3(1): 72-83 [7] Jr Campbell J P. Speaker Recognition:A Tutorial. Proc of the IEEE, 1997, 85(9): 1437-1462 [8] Sun Jixiang. Modern Pattern Recognition. Changsha, China: National University of Defense Technology Press, 2003: 13-32 (in Chinese) (孙即祥.现代模式识别.长沙:国防科技大学出版社, 2003: 13-32) [9] Oppenheim A V, Schafer R W. Digital Signal Processing. Englewood Cliffs, USA: Prentice-Hall, 1975 [10] Cen Qixiang. A General Introduction to Phonetics. Beijing, China: Science Press, 1959: 41-42 (in Chinese) (岑麒祥.语音学概论.北京:科学出版社, 1959: 41-42) [11] Shen Yang. Fifteen Lectures of Linguistics' Fundamental Knowledge. Beijing, China: Peking University Press, 2005: 52-79 (in Chinese) (沈 阳.语言学常识十五讲.北京:北京大学出版社, 2005: 52-79) |
|
|
|