基于深度重塑的航拍目标检测增强网络

doi:10.16451/j.cnki.issn1003-6059.202407007

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (2486 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要针对航拍图像目标检测中存在的复杂背景对检测的干扰、小目标的细节丢失及检测效率的高需求等问题,文中提出深度重塑增强网络(Depth-Reshaping Enhanced Network, DR-ENet).首先,采用空间深度重塑技术取代传统下采样方法,减少特征提取中的信息损失,增强对细节的捕获能力.然后,提出可变形空间金字塔池化方法,增强网络对目标形状变化的适应性和在复杂背景中目标识别的能力.同时,注意力解耦检测头增强针对各检测任务的学习效果.最后,为了同时兼顾密集小目标和复杂背景的特点,构建小型航拍数据集PORT.在3个公开航拍数据集及PORT数据集上的测试表明DR-ENet有一定的性能提升,说明其在航拍图像目标检测中的有效性和高效性.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	付天怡
	杨本翼
	董红斌
	邓宝松

关键词 ：航拍图像, 计算机视觉, 深度学习, 目标检测, 特征提取

Abstract：To address the issues of complex background interference, loss of fine details in small objects and the high demand for detection efficiency in aerial image object detection, a depth-reshaping enhanced network(DR-ENet) is proposed. Firstly, the traditional downsampling methods are replaced by spatial depth-reshaping techniques to reduce information loss during feature extraction and enhance the ability of the network to capture details. Then, a deformable spatial pyramid pooling method is designed to enhance the adaptability of network to object shape variations and its ability to recognize in complex backgrounds. Simultaneously, an attention decoupling detection head is proposed to enhance the learning effectiveness for different detection tasks. Finally, a small-scale aerial dataset , PORT, is constructed to simultaneously consider the characteristics of dense small objects and complex backgrounds. Experiments on three public aerial datasets and PORT dataset demonstrate that DR-ENet achieves performance improvement, proving its effectiveness and high efficiency in aerial image object detection.

Key words： Aerial Image Computer Vision Deep Learning Object Detection Feature Extraction

收稿日期: 2024-04-19

ZTFLH:

TP391

基金资助:国家自然科学基金项目(No.61472095,62303486,42201501,61902423)、黑龙江省自然科学基金项目(No.KY10600200048)资助

通讯作者: 董红斌,博士,教授,主要研究方向为人工智能、多智能体系统.E-mail:donghongbin@hrbeu.edu.cn.

作者简介: 付天怡,博士研究生,主要研究方向为深度学习、计算机视觉.E-mail:futianyi@hrbeu.edu.cn.杨本翼,博士,助理研究员,主要研究方向为计算机视觉.E-mail:yangbenyi000@163.com.邓宝松,博士,研究员,主要研究方向为无人系统技术及应用.E-mail:dbs@nudt.edu.cn.

引用本文:

付天怡, 杨本翼, 董红斌, 邓宝松. 基于深度重塑的航拍目标检测增强网络[J]. 模式识别与人工智能, 2024, 37(7): 652-662. FU Tianyi, YANG Benyi, DONG Hongbin, DENG Baosong. Depth-Reshaping Based Aerial Object Detection Enhanced Network. Pattern Recognition and Artificial Intelligence, 2024, 37(7): 652-662.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202407007 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2024/V37/I7/652