模式识别与人工智能
Saturday, May. 3, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2024, Vol. 37 Issue (12): 1107-1120    DOI: 10.16451/j.cnki.issn1003-6059.202412006
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
LKDD-Net: Lightweight Keypoint and Deformable Descriptor Extraction Network
FANG Baofu1,2, ZHANG Keao1,2, WANG Hao1,2, YUAN Xiaohui3
1. School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601
2. Key Laboratory of Knowledge Engineering with Big Data of Ministry of Education of China, Hefei University of Technology, Hefei 230009
3. Department of Computer Science and Engineering, University of North Texas, Denton, Texas 76201, United States

Download: PDF (5169 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Keypoint extraction is a crucial step in visual simultaneous localization and mapping(VSLAM). Existing deep learning based keypoint extraction methods suffer from low efficiency and fail to meet real-time requirements. Furthermore, they do not provide the geometric invariance required by descriptors. To address this issue, a lightweight keypoint and deformable descriptor extraction network(LKDD-Net) is proposed. A lightweight network module is introduced in the backbone network to improve the efficiency of feature extraction, and then the deformable convolution module is applied to the descriptor decoder to extract deformable descriptors. LKDD-Net is capable of simultaneously obtaining both keypoint locations and deformable descriptors. To study the effectiveness of LKDD-Net, a visual odometry system based on LKDD-Net is designed. Experiments on HPatches public dataset and TUM public dataset show that LKDD-Net can run in real-time on GPUs with keypoint extraction time being as low as 8.3 ms, while maintaining high accuracy in various scenarios. The performance of the visual odometry system composed of LKDD-Net is superior to traditional vision and VSLAM systems based on deep learning keypoint extraction. The proposed method successfully tracks all six sequences in TUM public dataset, demonstrating stronger robustness.
Key wordsKeypoint Extraction      Descriptor      Lightweight Network      Visual Odometry     
Received: 08 November 2024     
ZTFLH: TP 399  
Fund:Natural Science Foundation of Anhui Province(No.2308085MF203), University Synergy Innovation Program of Anhui Province(No.GXXT-2022-055), Major Project of Key Laboratory of Flight Techniques and Flight Safety of CAAC(No.FZ2022ZZ02)
Corresponding Authors: FANG Baofu, Ph.D., associate professor. His research interests include intelligent robot systems.   
About author:: ZXHANG Keao, Master student. His research interests include visual SLAM
WANG Hao, Ph.D., professor. His research interests include distributed intelligent systems and robots.
YUAN Xiaohui, Ph.D., professor. His research interests include computer vision, machine learning and artificial intelligence.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
FANG Baofu
ZHANG Keao
WANG Hao
YUAN Xiaohui
Cite this article:   
FANG Baofu,ZHANG Keao,WANG Hao等. LKDD-Net: Lightweight Keypoint and Deformable Descriptor Extraction Network[J]. Pattern Recognition and Artificial Intelligence, 2024, 37(12): 1107-1120.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202412006      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2024/V37/I12/1107
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn