Abstract:A hidden markov model (HMM)-universal background model (UBM) algorithm for the voice print password system is proposed. Due to the sparseness of the enrollment data in the voice print password system, a mono-phone HMM-UBM is firstly trained by using the speaker-independent database. Then, the hypothesized speaker model is obtained by adapting the parameters of the UBM using the speaker’s training speech and the maximum a posteriori (MAP) estimation. The data sparseness problem is thus solved. The equal error rate (EER) of the proposed system is 6.8% on the IFIY-DESKTOP Ⅱ database.
[1] Compbell J P Jr.Speaker Recognition: A Tutorial.Proc of the IEEE,1997,85(9): 1437-1462 [2] Reynolds D A,Quatieri T F,Dunn R B.Speaker Verification Using Adapted Gaussian Mixture Models.Digital Signal Processing,2000,10(1/2/3): 19-41 [3] Boakye K,Peskin B.Text-Constrained Speaker Recognition on a Text-Independent Task [EB/OL].[2004-1-6].Http://www.icsi.berkeley.edu/ftp/pub/speech/papers/spkrodyssey04-kofi.pdf [4] Chen Yan,Hong Qingyang.Voiceprint Verification Based on Two-Level Decision HMM-UBM // Proc of the 1st International Conference on Information Science and Engineering.Nanjing,China,2009: 3356-3359 [5] Li Xiaohan,Huang Nanchen,Dai Beiqian,et al.Research on the HMM-UBM and Short Text Based Speaker Verification.Information and Control,2004,33(6): 762-764 (in Chinese) (李萧寒,黄南晨,戴蓓蒨,等.基于HMM-UBM和短语音的说话人身份确认.信息与控制,2004,33(6): 762-764) [6] Rabiner L R.A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition.Proc of the IEEE,1989,77(2): 257-286 [7] Buyuk O,Arslan L M.HMM-Based Text-Dependent Speaker Recognition with Handset-Channel Recognition // Proc of the 18th IEEE Signal Processing and Communications Applications Conference.Diyarbakir,Turkey,2010: 383-386 [8] Lamel L F,Rabiner L R,Rosenberg A E,et al.An Improved Endpoint Detector for Isolated Word Recognition.IEEE Trans on Acoustics,Speech and Signal Processing,1981,29(4): 777-785 [9] Hermansky H,Morgan N,Bayya A,et al.RASTA-PLP Speech Analysis.ICSI Technical Report,TR-91-069.Berkeley,USA: International Computer Science Institute,1991 [10] Myers C,Rabiner L R,Rosenberg A E.Performance Tradeoffs in Dynamic Time Warping Algorithms for Isolated Word Recognition.IEEE Trans on Acoustics,Speech and Signal Processing,1980,28(6): 623-635