窗口锚定的偏移受限动态蛇形卷积网络航拍小目标检测

doi:10.16451/j.cnki.issn1003-6059.202408001

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (6317 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要为了从小目标有限特征中获取关键的有效信息,提升小目标的定位能力和检测精度,文中提出窗口锚定的偏移受限动态蛇形卷积网络航拍小目标检测方法.首先,构造偏移受限动态蛇形卷积,在不同方位动态偏移,受限蛇形卷积核自适应地关注不同大小和形状的特征区域,使特征提取聚焦于微小局部结构,促进小目标特征的捕获.然后,采用双阶段多尺度特征融合方法,对不同层阶特征图进行特征对齐、融合和注入,增强底层细节信息与高层语义信息的融合,并强化不同尺寸目标信息传输,提高小目标的检测能力.与此同时,设计窗口锚定的边界框回归损失函数,基于辅助边界框和最小点距离进行边界回归,获得准确的回归结果,提高小目标的定位能力.最后,在3个航拍数据集上的实验表明,文中方法对小目标的检测性能有不同程度的改善和提高.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	张荣国
	秦震
	胡静
	王丽芳
	刘小君

关键词 ：小目标检测, 特征提取, 特征融合, 多尺度特征, 边界框回归损失函数

Abstract：To obtain the key and effective information from limited features of small targets and improve the localization ability and detection accuracy of small targets, a window anchored offset constrained dynamic snake convolutional network for aerial small target detection is proposed. Firstly, the offset constrained dynamic snake convolution is constructed. By dynamical offsetting in different directions, the constrained snake convolution kernel adaptively focuses on feature regions of different sizes and shapes, making feature extraction concentrate on tiny local structures and thereby facilitating the capture of small target features. Secondly, by employing two-stage multi-scale feature fusion method, feature alignment fusion and injection are performed on different layer-order feature maps to enhance the fusion of the underlying detail information and the high-level semantic information, and strengthen the transmission of target information of different sizes. Thus, the detection capability of the method for small targets is improved. Meanwhile, the window anchored bounding box regression loss function is designed. The function performs the bounding regression based on the auxiliary bounding box and the minimum point distance to achieve more accurate regression results and enhance the small target localization capability of the model. Finally, comparative experiments on three aerial photography datasets show that the proposed method makes the improvements with different degrees in small target detection performance.

Key words： Small Object Detection Feature Extraction Feature Fusion Multi-scale Features Bounding Box Regression Loss Function

收稿日期: 2024-04-07

ZTFLH:

TP391.41

基金资助:国家自然科学基金项目(No.52375178)、山西省自然科学基金项目(No.202203021211206,202203021211189)、山西省教育厅项目(No.2022YJJG192)、太原科技大学研究生创新项目(No.SY2023039)资助

通讯作者: 张荣国,博士,教授,主要研究方向为图像处理、计算机视觉、模式识别.E-mail:rg_zh@163.com.

作者简介: 秦震,硕士研究生,主要研究方向为图像处理、计算机视觉.E-mail:1076212321@qq.com. 胡静,博士,教授,主要研究方向为图像处理、模式识别.E-mail:279641292@qq.com. 王丽芳,博士,副教授,主要研究方向为图像处理、计算机视觉.E-mail:wanglifang@tyust.edu.cn. 刘小君,博士,教授,主要研究方向为现代设计理论与方法、模式识别.E-mail:liuxjunhf@163.com.

引用本文:

张荣国, 秦震, 胡静, 王丽芳, 刘小君. 窗口锚定的偏移受限动态蛇形卷积网络航拍小目标检测[J]. 模式识别与人工智能, 2024, 37(8): 663-677. ZHANG Rongguo, QIN Zhen, HU Jing, WANG Lifang, LIU Xiaojun. Window Anchored Offset Constrained Dynamic Snake Convolutional Network for Aerial Small Target Detection. Pattern Recognition and Artificial Intelligence, 2024, 37(8): 663-677.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202408001 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2024/V37/I8/663