模式识别与人工智能
Friday, Apr. 4, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2024, Vol. 37 Issue (2): 162-171    DOI: 10.16451/j.cnki.issn1003-6059.202402005
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Model for Small Object Detection in Aerial Photography Based on Low Dimensional Image Feature Fusion
CAI Fenghuang1, ZHANG Jiaxiang1, HUANG Jie1
1. College of Electrical Engineering and Automation, Fuzhou University, Fuzhou 350108

Download: PDF (2828 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  To address the challenges of significant changes in the field of view and complex spatiotemporal information in unmanned aerial vehicle aerial image target detection, a model for small object detection in aerial photography based on low dimensional image feature fusion is presented grounded on the YOLOv5(you only look once version 5) architecture. Coordinate attention is introduced to improve the inverted residuals of MobileNetV3, thereby increasing the spatial dimension information of images while reducing parameters of the model. The YOLOv5 feature pyramid network structure is improved to incorporate feature images from shallow networks. The ability of the model to represent low-dimensional effective information of images is enhanced, and consequently the detection accuracy of the proposed model for small objects is improved. To reduce the impact of complex background in the image, the parameter-free average attention module is introduced to focus on both spatial attention and channel attention. VariFocal Loss is adopted to reduce the weight proportion of negative samples in the training process. Experiments on VisDrone dataset demonstrate the effectiveness of the proposed model. The detection accuracy is effectively improved while the model complexity is significantly reduced.
Key wordsYou Only Look Once Version 5(YOLOv5)      Small Target Detection      Attention Mechanism      Loss Function     
Received: 16 October 2023     
ZTFLH: TP181  
Fund:National Natural Science Foundation of China(No.92367109), Young Scientists Fund of National Natural Science Foundation of China(No.62301163)
Corresponding Authors: HUANG Jie, Ph.D., professor. His research interests include pa-ttern recognition, intelligent system and multi-agent system.   
About author:: CAI Fenghuang, Ph.D., professor. His research interests include machine learning and power electronic converter control. ZHANG Jiaxiang, Master student. His research interests include intelligent vision and unmanned system.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
CAI Fenghuang
ZHANG Jiaxiang
HUANG Jie
Cite this article:   
CAI Fenghuang,ZHANG Jiaxiang,HUANG Jie. Model for Small Object Detection in Aerial Photography Based on Low Dimensional Image Feature Fusion[J]. Pattern Recognition and Artificial Intelligence, 2024, 37(2): 162-171.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202402005      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2024/V37/I2/162
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn