语义自编码结合关系网络的零样本图像识别算法

doi:10.16451/j.cnki.issn1003-6059.201903003

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (3018 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要为了解决零样本图像识别中传统模型容易出现投影域移位问题以及提高距离相似度度量的鲁棒性,提出关系网络改进语义自编码器的零样本识别算法.基于语义自编码器构建图像视觉特征和语义向量之间的特征映射,并将重构向量与对应向量真值进行级联后送入神经网络,最终利用输出的标量给出预测类别.实验表明,相比传统距离度量方法,文中算法在AWA、CUB和ImageNet-2数据集上的识别率均有所提高,在某些数据集上语义-视觉的投影效果优于反向投影.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	林克正
	李昊天
	白婧轩
	李骜

关键词 ：语义自编码器, 关系网络, 零样本识别, 语义向量, 投影域移位

Abstract：A semantic auto-encoder structure improved by relation network is proposed and used for zero sample identification algorithm to handle the projection domain shift problem and improve the robustness of distance similarity measure in the traditional model of zero-shot recognition. The feature map between image visual features and semantic vectors is constructed by the proposed algorithm based on the semantic auto-encoder, and then the reconstructed vector is sent to the neural network after concatenating the true value of the corresponding vector. Finally, the prediction category is determined by the output scalar. The experimental results show that compared with the traditional distance measurement method, the recognition rate of the proposed algorithm on the public datasets AWA, CUB and ImageNet-2 is improved and its semantic-visual projection has a better effect than back projection on some datasets.

Key words： Semantic Auto-Encoder Relation Network Zero-Shot Recognition Semantic Vector Projection Domain ShiftCitation

收稿日期: 2018-10-25

ZTFLH:

TP 391.4

基金资助:国家自然科学基金项目(No.61501147)、黑龙江省自然科学基金项目(No.F2015040)、黑龙江省普通本科高等学校青年创新人才培养计划(No.2018203)资助

通讯作者: 林克正,博士,教授,主要研究方向为图像处理、机器视觉、模式识别等.E-mail:link@hrbust.edu.cn.

作者简介: 李昊天,硕士研究生,主要研究方向为机器学习、计算机视觉.E-mail:18800426021@163.com.
白婧轩,硕士研究生,主要研究方向为图像处理、计算机视觉.E-mail:781758410@qq.com.
李骜,博士,讲师,主要研究方向为稀疏表示、图像复原、计算机视觉等.E-mail:29371647@qq.com.

引用本文:

林克正,李昊天,白婧轩,李骜. 语义自编码结合关系网络的零样本图像识别算法[J]. 模式识别与人工智能, 2019, 32(3): 214-224. LIN Kezheng, LI Haotian, BAI Jingxuan, LI Ao. Zero-Shot Image Recognition Algorithm via Semantic Auto-Encoder Combining Relation Network. , 2019, 32(3): 214-224.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201903003 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2019/V32/I3/214