融合深度特征和语义邻域的自动图像标注<sup>*</sup>

doi:10.16451/j.cnki.issn1003-6059.201703001

摘要
图/表
参考文献
相关文章 (5)

全文: PDF (1344 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要传统图像标注方法中人工选取特征费时费力,传统标签传播算法忽视语义近邻,导致视觉相似而语义不相似,影响标注效果.针对上述问题,文中提出融合深度特征和语义邻域的自动图像标注方法.首先构建基于深度卷积神经网络的统一、自适应深度特征提取框架,然后对训练集划分语义组并建立待标注图像的邻域图像集,最后根据视觉距离计算邻域图像各标签的贡献值并排序得到标注关键词.在基准数据集上实验表明,相比传统人工综合特征,文中提出的深度特征维数更低,效果更好.文中方法改善传统视觉近邻标注方法中的视觉相似而语义不相似的问题,有效提升准确率和准确预测的标签总数.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	柯逍
	周铭柯
	牛玉贞

关键词 ：语义邻域, 图像标注, 卷积神经网络(CNN), 深度特征

Abstract：In the traditional image annotation methods, the manual selection of features is time-consuming and laborious. In the traditional label propagation algorithm, semantic neighbors are ignored. Consequently visual similarity and semantic dissimilarity are caused and annotation results are affected. To solve these problems, an automatic image annotation method combining semantic neighbors and deep features is proposed. Firstly, a unified and adaptive depth feature extraction framework is constructed based on deep convolutional neural network. Then, the training set is divided into semantic groups and the neighborhood image sets of the unannotated images are set up. Finally, according to the visual distance, the contribution value of each label of the neighborhood images is calculated and the keywords are obtained by sorting their contribution values. Experiments on benchmark datasets show that compared with the traditional synthetic features, the proposed deep feature possesses lower dimension and better effect. The problem of visual similarity and semantic dissimilarity in visual nearest neighbor annotation method is improved, and the algorithm effectively enhances the accuracy and the number of accurate predicted tags.

Key words： Semantic Neighbor Image Annotation Convolutional Neural Network(CNN) Deep Feature

收稿日期: 2016-07-11

ZTFLH:

TP 311

基金资助:国家自然科学基金项目(No.61502105)、福建省自然科学基金项目(No.2013J05088)、福建省中青年教师教育科研项目(No.JA15075)资助

作者简介: 柯逍(通讯作者),男,1983年生,博士,讲师,主要研究方向为计算机视觉、模式识别.E-mail:kex@fzu.edu.cn.
周铭柯,男,1990年生,硕士,主要研究方向为深度学习、计算机视觉.E-mail:443810956@qq.com.
牛玉贞,女,1982年生,博士,教授,主要研究方向为计算机图形学、计算机视觉.E-mail:yuzhenniu@gmail.com.

引用本文:

柯逍，周铭柯，牛玉贞. 融合深度特征和语义邻域的自动图像标注^*[J]. 模式识别与人工智能, 2017, 30(3): 193-203. KE Xiao, ZHOU Mingke, NIU Yuzhen. Automatic Image Annotation Combining Semantic Neighbors and Deep Features. , 2017, 30(3): 193-203.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201703001 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2017/V30/I3/193