模式识别与人工智能
Thursday, Apr. 3, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2013, Vol. 26 Issue (1): 1-5    DOI:
Orignal Article Current Issue| Next Issue| Archive| Adv Search |
Speaker Clustering of Telephone Speech Based on Front-End Factor Analysis
WU Kui,SONG Yan,DAI Li-Rong
Department of Electronic Engineering Information Science,University of Science and Technology of China,Hefei 230027

Download: PDF (335 KB)   HTML (0 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The existing speaker clustering methods based on Gaussian mixture model (GMM) mainly obtain clusters′ GMMs by adapting from universal background model (UBM). However,this adaptive method suffers from the lack of data and results in poor models. In this paper,two factor analysis modeling methods based on eigenvoice (EV) space analysis and total variability (TV) space analysis respectively are explored. The two methods greatly reduce the number of estimated parameters when clusters′ GMMs are estimated by modeling variability space. The experimental results on two speakers telephone data in 2008 NIST Speaker Recognition Evaluation show that the two proposed methods achieve considerable reduction in speaker error rate compared to the baseline system using MAP adaptation,and the method based on TV space analysis obtains lower speaker error rate compared to the method based on EV space analysis.
Key wordsSpeaker Clustering      Eigenvoice Space      Total Variability Space      Cross Likelihood Ratio     
Received: 26 December 2011     
ZTFLH: TP391.42  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WU Kui
SONG Yan
DAI Li-Rong
Cite this article:   
WU Kui,SONG Yan,DAI Li-Rong. Speaker Clustering of Telephone Speech Based on Front-End Factor Analysis[J]. , 2013, 26(1): 1-5.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2013/V26/I1/1
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn