可解释化、结构化、多模态化的深度神经网络

doi:10.16451/j.cnki.issn1003-6059.201801001

摘要
图/表
参考文献(0)
相关文章 (7)

全文: PDF (1220 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要

深度学习方法依赖于大规模的标签数据,通过端到端的监督训练,在计算机视觉、自然语言处理领域都取得优异性能.但是,现有方法通常针对单一模态数据,忽视数据的内在结构,缺乏理论支撑.针对上述问题,文中从基于小波核学习的深度滤波器组网络设计、基于结构化学习的深度学习、基于多模态学习的深度学习3个角度阐述结合深度学习方法与小波理论、结构化预测的潜在方法,以及其拓展到多模态数据的可行机制.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	熊红凯
	高星
	李劭辉
	徐宇辉
	王涌壮
	余豪阳
	刘昕
	张云飞

关键词 ：深度学习, 滤波器组, 小波理论, 结构化学习, 多模态学习

Abstract：

Deep learning methods achieve excellent performance in the fields of computer vision and natural language processing through end-to-end supervised training dependent on large scale labeled datasets. However, the existing methods are often targeted for single modal data, ignoring the inherent structure of the data with the lack of theoretical support. Therefore, the wavelet theory based deep convolution networks, the structured deep learning and the multi-modal deep learning are discussed in this paper to demonstrate the potential methods of the combination of deep learning techniques, wavelet theory and structure prediction, and the viable mechanism for extending to multi-modal data is explored as well.

Key words： Deep Learning Filter Bank Wavelet Theory Structured Learning Multi-modal Learning

收稿日期: 2017-09-26

作者简介: 熊红凯(通讯作者),博士,教授,主要研究方向为多媒体通信、信号处理、计算机视觉与机器学习.E-mail:xionghongkai@sjtu.edu.cn.高星,博士研究生,主要研究方向为非监督表示学习.E-mail:williamg-g@sjtu.edu.cn. 李劭辉,硕士研究生,主要研究方向为小波与散射网络.E-mail:lishaohui@sjtu.edu.cn.徐宇辉,博士研究生,主要研究方向为计算机视觉、机器学习.E-mail:yuhuixu1993@126.com. 王涌壮,硕士研究生,主要研究方向为计算机视觉.E-mail:wyz1036880293@sjtu.edu.cn.余豪阳,学士,主要研究方向为微服务云平台的研发与运营.E-mail:rainyu@tencent.com.刘昕,学士,主要研究方向为网络通信、云计算.E-mail:xinliu@tencent.com. 张云飞,博士,高级工程师,主要研究方向为多媒体通信、5G、下一代互联网、智能终端.E-mail:13701196523@189.cn.

引用本文:

熊红凯, 高星, 李劭辉, 徐宇辉, 王涌壮, 余豪阳, 刘昕, 张云飞. 可解释化、结构化、多模态化的深度神经网络[J]. 模式识别与人工智能, 2018, 31(1): 1-11. XIONG Hongkai, GAO Xing, LI Shaohui, XU Yuhui, WANG Yongzhuang,YU Haoyang, LIU Xin, ZHANG Yunfei. Interpretable Structured Multi-modal Deep Neural Network. , 2018, 31(1): 1-11.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201801001 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2018/V31/I1/1