Abstract:In the existing pedestrian detection algorithms, the pedestrian detection is considered as a supervised learning problem of two classes, pedestrian and background. Thus, the pedestrian and the background in the video are distinguished. However, the problem of variable poses and heavy occlusion can not be solved by these algorithms effectively. In this paper, a pedestrian detection algorithm based on graph cuts and density clustering is proposed. The pedestrian detection is regarded as an unsupervised learning problem of multiple classes. At the training stage, the multilevel histogram of oriented gradient-local binary pattern(HOG-LBP) features are firstly calculated for each of training samples. Then, different weights are assigned to each image block of the multilevel HOG-LBP features. To distinguish the different parts of pedestrian and assign weight, the image sample is segmented by the block-based graph cuts algorithm. Finally, the density clustering approach is used to classify the positive and negative samples into multiple cluster center respectively. At the testing stage, the distance between the multilevel HOG-LBP of test sample and every cluster center is calculated, and the five shortest distances are voted to classify the test sample. Experiments show that the proposed algorithm can handle the pose variations and partial occlusions effectively. Moreover, with the increase of training samples, the results of the proposed algorithm can be comparable to that of the state-of-the-art pedestrian detection algorithms.
曾成斌,刘继乾. 基于图切割和密度聚类的视频行人检测算法*[J]. 模式识别与人工智能, 2017, 30(7): 588-587.
ZENG Chengbin, LIU Jiqian. Pedestrian Detection on Videos Based on Graph Cuts and Density Clustering. , 2017, 30(7): 588-587.
[1] DALAL N, TRIGGS B. Histograms of Oriented Gradients for Human Detection // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2005: 886-893. [2] WANG X Y, HAN T X, YAN S C. An HOG-LBP Human Detector with Partial Occlusion Handling // Proc of the 12th IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2009: 32-39. [3] OJALA T, PIETIK INEN M, M ENP T. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Inte-lligence, 2002, 24(7): 971-987. [4] FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al. Object Detection with Discriminatively Trained Part-Based Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(9): 1627-1645. [5] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C/OL]. [2017-01-25]. https://arxiv.org/pdf/1311.2524.pdf. [6] GIRSHICK R. Fast R-CNN // Proc of the International Conference on Computer Vision. Washington, USA: IEEE, 2015: 1440-1448. [7] TIAN Y L, LUO P, WANG X G, et al. Pedestrian Detection Aided by Deep Learning Semantic Tasks // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2015: 5079-5087. [8] HOSANG J, OMRAN M, BENENSON R, et al. Taking a Deeper Look at Pedestrians // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2015: 4073-4082. [9] ZHANG S S, BENENSON R, OMRAN M, et al. How Far Are We from Solving Pedestrian Detection? // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 1259-1267. [10] RODRIGUEZ A, LAIO A. Machine Learning. Clustering by Fast Search and Find of Density Peaks. Science, 2014, 344(6191): 1492-1496. [11] MA H D, ZENG C B, LING C X. A Reliable People Counting System via Multiple Cameras. ACM Transactions on Intelligent Systems and Technology, 2012, 3(2): 67-83. [12] KOLMOGOROV V, ZABIH R. What Energy Functions Can Be Minimized via Graph Cuts? IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(2): 147-159. [13] BOYKOV Y Y, JOLLY M P. Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images // Proc of the 8th IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2001: 105-112. [14] VEKSLER O. Star Shape Prior for Graph-Cut Image Segmentation // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2008: 454-467. [15] BANKO M, BRILL E. Scaling to Very Large Corpora for Natural Language Disambiguation // Proc of the 39th Annual Meeting on Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2001: 26-33. [16] ZHOU E, CAO Z M, YIN Q. Naive-Deep Face Recognition: Touching the Limit of LFW Benchmark or Not?[C/OL]. [2017-01-25]. https://arxiv.org/pdf/1501.04690v1.pdf. [17] AHONEN T, HADID A, PIETIKAINEN M. Face Description with Local Binary Patterns: Application to Face Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28(12): 2037-2041. [18] BOYKOV Y, KOLMOGOROV V. An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(9): 1124-1137. [19] WOJEK C, WALK S, SCHIELE B. Multi-cue Onboard Pedestrian Detection // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2009: 794-801. [20] ESS A, LEIBE B, VAN Gool L. Depth and Appearance for Mobile Scene Analysis // Proc of the 11th IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2007. DOI: 10.1109/ICCV.2007.4409092. [21] DOLLAR P, WOJEK C, SCHIELE B, et al. Pedestrian Detection: An Evaluation of the State of the Art. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4): 743-761. [22] SCHWARTZ W R, KEMBHAVI A, HARWOOD D, et al. Human Detection Using Partial Least Squares Analysis // Proc of the 12th IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2009: 24-31.