利用语音非线性特征改进说话人识别的性能

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (793 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract Chaotic characteristics in speech by calculating the maximum Lyapunov exponents of 38 Mandarin phonemes are presented. The physical significance of three nonlinear features of human speech, i.e. the largest Lyapunov exponent, the secondorder dynamical entropy, and the fractal dimension, is studied. A speaker recognition system based on the Gaussian mixture model is established. On the decision layer, the recognition results obtained from MFCC and nonlinear dynamics are combined in a serial manner to give an improved performance. The experimental result shows nonlinear dynamics coefficients can distinguish different speaker and aid speaker identification only by MFCC features.

Key words： Speaker Identification Chaos Maximum Lyapunov Exponent

Received: 11 May 2005

ZTFLH:

TP391

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	HOU LiMin
	DENG DeChun
	WANG ShuoZhong

Cite this article:

HOU LiMin,DENG DeChun,WANG ShuoZhong. Improvement of Speaker Identification Performance Using Nonlinear Features[J]. , 2006, 19(6): 776-781.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/ OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2006/V19/I6/776