|
|
Improved Object Detection Method of Micro-operating System Based on Region Convolutional Neural Network |
PENG Gang1, YANG Shiqi1, HUANG Xinhan1, SU Hao1 |
1.MOE Key Laboratory of Image Processing and Intelligent Control, School of Automation, Huazhong University of Science and Technology, Wuhan 430074 |
|
|
Abstract In micro-operating system, traditional object detection method cannot detect the objects with partial occlusion and multiple poses, and thus an improved faster region convolutional neural network(Faster RCNN) is adopted to solve the problem. On the basis of original Faster RCNN, deep residual network exhibiting excellent performance in image classification is introduced as the framework of the algorithm, and online hard example mining strategy to enhance the performance by alleviating the imbalance between positive and negative examples is employed. The experimental results manifest that the proposed method can detect objects with partial occlusion and multiple poses effectively. The proposed method shows strong adaptability to environment, responds quickly compared with traditional methods, and thus the practicality of it is verified.
|
Received: 18 September 2017
|
|
Fund:Supported by National High Technology Research and Development Program of China(No.2008AA8041302), National Natural Science Foundation of China(No.60275013) |
About author:: PENG Gang(Corresponding author), Ph.D., associate professor. His research interests include robotics and intelligent manufacturing.YANG Shiqi, master student. His research interests include computer vision and deep learning.HUANG Xinhan, bachelor, professor. His research interests include intelligent robot and pattern recognition.SU Hao, master student. His research interests include intelligent robot and deep learning. |
|
|
|
[1] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet Classification with Deep Convolutional Neural Networks // PEREIRA F, BURGES C J C, BOTTOU L, et al., eds. Advances in Neural Information Processing Systems 25. Cambridge, USA: The MIT Press, 2012: 1097-1105. [2] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2014: 580-587. [3] GIRSHICK R. Fast R-CNN // Proc of the IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2015: 1440-1448. [4] REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[C/OL]. [2017-08-21]. https://arxiv.org/pdf/1506.01497.pdf. [5] REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, Real-Time Object Detection // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 779-788. [6] LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single Shot Multibox Detector // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2016: 21-37. [7] HE K M, ZHANG X Y, REN S Q, et al. Deep Residual Learning for Image Recognition // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 770-778. [8] SHRIVASTAVA A, GUPTA A, GIRSHICK R. Training Region-Based Object Detectors with Online Hard Example Mining // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 761-769. [9] DENG J, DONG W, SOCHER R, et al. ImageNet: A Large-Scale Hierarchical Image Database // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2009: 248-255. [10] SIMONYAN K, ZISSERMAN A. Very Deep Convolutional Networks for Large-Scale Image Recognition[J/OL]. [2017-08-21]. https://arxiv.org/pdf/1409.1556v6.pdf. [11] SZEGEDY C, LIU W, JIA Y Q, et al. Going Deeper with Convolutions // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2015. DOI: 10.1109/CVPR.2015.7298594. [12] IOFFE S, SZEGEDY C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift[C/OL]. [2017-08-21]. https://arxiv.org/pdf/1502.03167v3.pdf. [13] VEIT A, WILBER M J, BELONGIE S. Residual Networks Behave Like Ensembles of Relatively Shallow Networks // LEE D D, SUGIYAMA M, LUXBURG U V, et al., eds. Advances in Neural Information Processing Systems 29. Cambridge, USA: The MIT Press, 2016: 550-558. [14] HE K M, ZHANG X Y, REN S Q, et al. Identity Mappings in Deep Residual Networks // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2016: 630-645. [15] UIJLINGS J R R, VAN DE SANDE K E A, GEVERS T, et al. Selective Search for Object Recognition. International Journal of Computer Vision, 2013, 104(2): 154-171. [16] HE K M, ZHANG X Y, REN S Q, et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2014: 346-361. [17] SU J, HUANG X H, WANG M. Pose Detection of Partly Covered Target in Micro-Vision System // Proc of the 10th World Congress on Intelligent Control and Automation. Washington, USA: IEEE, 2012: 4721-4725. |
|
|
|