一种基于人耳听觉感知和子带补偿滤波的鲁棒语言辨识特征参数提取算法

摘要
图/表
参考文献
相关文章 (8)

全文: PDF (380 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要针对目前语言辨识系统所采用的特征参数没有充分考虑人耳听觉机制、鲁棒性较差的问题，提出一种符合人耳听觉感知特性的鲁棒语言辨识参数提取算法。该算法主要从两个方面提高特征参数的鲁棒性:在计算各子带能量时采用更符合人耳感知特性的Gammachirp滤波器组代替常用的三角滤波器组;为每一子带通道设计一个补偿滤波器。子带补偿滤波器的设计采用数据驱动的策略，通过补偿使得各子带滤波器输出信号的失真及环境噪音导致的失真同时达到最小。实验表明，文中所提出的特征在常见噪声环境下，性能均优于目前普遍使用的Mel频率倒谱系数特征及其衍生参数。关键词听觉感知，补偿滤波器，鲁棒性，语言辨识中图法分类号TN912。3ARobustFeatureParameterExtractionAlgorithmforLanguageIdentificationBasedonAudioPerceptionandSub-BandCompensationFilteringHUANGShan-Qi，ZHANGLing-Hai，QUDan(InstituteofInformationEngineering，InformationEngineeringUniversityofPLA，Zhengzhou450002)ABSTRACTIncurrentlanguageidentificationsystem，thecommonlyusedfeatureparametershavenotmadethebestuseofauditorycharacteristicsandhaveweakrobustnessincomplexenvironments。Anauditory-basedrobustfeatureextractionalgorithmisproposed。Eachsub-bandenergyoftheextractedauditoryfeaturesiscalculatedbyusingaGammachirpfilterbankinsteadofthecommonlyusedtrianglefilterbank。Thecompensationfilterusingdata-drivenanalysisforeachsub-bandoutputisobtainedbyaconstrainedoptimizationprocesswhichjointlyminimizestheenvironmentaldistortionaswellasthedistortioncausedbythefilteritself。ExperimentalresultsshowthatthefeatureoutperformstheMel-frequencycepstralcoefficientwidelyusedinnoisyenvironments。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	黄山奇
	张连海
	屈丹

关键词 ：听觉感知, 补偿滤波器, 鲁棒性, 语言辨识

Abstract：In current language identification system, the commonly used feature parameters have not made the best use of auditory characteristics and have weak robustness in complex environments. An auditory-based robust feature extraction algorithm is proposed. Each sub-band energy of the extracted auditory features is calculated by using a Gammachirp filter bank instead of the commonly used triangle filter bank. The compensation filter using data-driven analysis for each sub-band output is obtained by a constrained optimization process which jointly minimizes the environmental distortion as well as the distortion caused by the filter itself. Experimental results show that the feature outperforms the Mel-frequency cepstral coefficient widely used in noisy environments.

Key words： Audio Percetion Compensation Filter Robust Language Identification

收稿日期: 2010-10-25

ZTFLH:

TN912.3

基金资助:国家自然科学基金资助项目(No.61175017)

作者简介: 黄山奇，男，1984年生，硕士研究生，主要研究方向为语种识别、说话人识别，E-mail:luckyhil。com。张连海，男，1971年生，博士，副教授，主要研究方向为语音信号处理。屈丹，女，1974年生，博士，副教授，主要研究方向为语音信号处理及模式识别。

引用本文:

黄山奇，张连海，屈丹. 一种基于人耳听觉感知和子带补偿滤波的鲁棒语言辨识特征参数提取算法[J]. 模式识别与人工智能, 2012, 25(1): 166-171. HUANG Shan-Qi, ZHANG Ling-Hai, QU Dan. A Robust Feature Parameter Extraction Algorithm for Language Identification. , 2012, 25(1): 166-171.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2012/V25/I1/166