基于层次稀疏DBN的瓶颈特征提取方法<sup>*</sup>

doi:10.16451/j.cnki.issn1003-6059.201502010

摘要
图/表
参考文献
相关文章 (4)

全文: PDF (516 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要针对现有语音特征无法有效利用长时段语音和监督性类别信息，及现有瓶颈特征提取方法耗时过长等缺陷，提出基于层次结构稀疏深度可信神经网络的瓶颈特征提取方法.该方法将重叠组套索作为深度可信神经网络目标函数的稀疏正则项使用，从而构建训练速度更快的稀疏深度可信神经网络.然后利用层次结构的网络架构方式，将两个稀疏深度可信神经网络串联后使用，进一步增强瓶颈特征的判决能力.文中将此瓶颈特征应用于音素识别中，实验表明该特征的有效性.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	王一
	杨俊安
	刘辉
	柳林

关键词 ：音素识别, 深度可信神经网络(DBN), 重叠组套索, 层次结构

Abstract：To overcome the drawbacks of original speech features that long temporal speeches and the supervised information can not be effectively utilized and the training time cost is high, a bottleneck feature extraction method based on hierarchical deep sparse belief network is presented. The overlapping group lasso is used as the sparse regularization constraint of the objective function of deep belief network to obtain a deep sparse belief network with a higher speed. To make full use of the hierarchical structure, two sparse deep belief networks are connected in series to enhance the discriminant ability of the bottleneck features. The experimental results on phoneme recognition task show that the proposed feature is effective.

Key words： Phoneme Recognition Deep Belief Network(DBN) Overlapping Group Lasso Hierarchical Structure

收稿日期: 2013-09-09

ZTFLH:

TN912.33

基金资助:国家自然科学基金项目(No.60872113)、安徽省自然科学基金项目(No.1208085MF94,1308085QF99)资助

作者简介: 王一(通讯作者)，男，1985年生，博士研究生，主要研究方向为语音识别、数据挖掘等.E-mail:wygggg@126.com.杨俊安，男，1965年生，教授，博士生导师，主要研究方向为信号处理、智能计算等.刘辉，男，1983年生，博士，讲师，主要研究方向为声信号分析与识别技术等.柳林，男，1982年生，硕士，工程师，主要研究方向为语音识别技术.

引用本文:

王一，杨俊安，刘辉，柳林. 基于层次稀疏DBN的瓶颈特征提取方法^*[J]. 模式识别与人工智能, 2015, 28(2): 173-180. WANG Yi, YANG Jun-An, LIU Hui, LIU Lin. Bottleneck Feature Extraction Method Based on Hierarchical Deep Sparse Belief Network. , 2015, 28(2): 173-180.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201502010 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2015/V28/I2/173