Abstract:The poor performance of speaker verification system results from the channel mismatch and the lack of distinction between statistical models. A text-independent speaker verification method is proposed which combines the channel compensation based on factor analysis and the discriminative support vector machine (SVM) model. Gaussian mixture model (GMM) is used to make the speech parameter clustered and ascended, then the channel information of GMM mean super-vectors is wiped off by using factor analysis. The parameters, which are used as inputting parameters, are employed for the construction of SVM speaker verification system. The proposed method solves the problems of large samples, dimension raising and channel mismatch effectively when SVM is used for the text-independent speaker verification. Experimental results on NIST 06 male speaker corpus show that the proposed method improves system performance. Compared with the baseline system Gaussian mixture model-universal background model (GMM-UBM), GMM-SVM without channel compensation, the system improves the equal error rate (EER) more than 50%.Compared with the system factor analysis (FA)-GMM-UBM which uses channel compensation based on factor analysis without discriminative models, it also gets the improvement of EER by 15.8%.
[1] Reynolds D A, Quatieri T F, Dunn R B. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 2000, 10(1): 19-41 [2] Vapnik V N.The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag, 1995 [3] Reynolds D A. Channel Robust Speaker Verification via Feature Mapping // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Hongkong, China, 2003, Ⅱ: 53-56 [4] Teunen R, Shahshahani B, Heck L. A Model-Based Transformational Approach to Robust Speaker Recognition // Proc of the 6th International Conference on Spoken Language Processing. Beijing, China, 2000, Ⅱ: 495-498 [5] Yin Shouchun, Rose R, Kenny P. A Joint Factor Analysis Approach to Progressive Model Adaptation in Text Independent Speaker Verification. IEEE Trans on Audio, Speech and Language Processing, 2007, 15(7): 1999-2010 [6] Matrouf D, Scheffer N, Fauve B, et al. A Straightforward and Efficient Implementation of the Factor Analysis Model for Speaker Verification [EB/OL]. [2007-08-29]. http://lia.univ-avignon.fr/fileadmin/documents/Users/Intranet/fich-art/987-Matrouf-IS07.pdf [7] Campbell W M, Sturim D E, Reynolds D A. Support Vector Machines Using GMM Supervectors for Speaker Verification. IEEE Signal Processing Letters, 2006, 13(5): 308-311 [8] The NIST Year 2006 Speaker Recognition Evaluation Plan [EB/OL]. [2006-03-08]. http://www.nist.gov/speech/tests/spk/2006/sre-06_evalplan-v9.pdf [9] Burget L, Matejka P, Schwarz P, et al. Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. IEEE Trans on Audio, Speech and Language Processing, 2007, 15(7): 1979-1986