基于随机蕨丛的双层视频分割算法<sup>*</sup>

摘要
图/表
参考文献
相关文章 (6)

全文: PDF (604 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要提出一种基于随机蕨丛的双层视频分割算法，实现对单目视频的自动分割.算法在对视频运动特征进行聚类的基础上，构造视频运动特征字典，通过随机蕨丛对运动特征进行建模.在此基础上利用条件随机场约束视频颜色、运动特征以及邻域关系，通过graph-cut算法求解出全局最优的分割结果.在实验中采用多种环境的视频数据对本文算法的有效性进行测试，并与其他分割算法的结果进行比较.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	褚一平
	陈勤
	黄叶珏
	郑河荣

关键词 ：双层视频分割, 随机蕨丛, 条件随机场

Abstract：A random ferns based method is proposed for bilayer video segmentation with the capability of segmenting monocular video automatically. Motion feature dictionary is constructed by clustering the motion features of the video, and the motion features are modeled by random ferns. The video colors, motion features and neighboring relationships are constrained by using conditional random fields. The graph-cut algorithm is adopted for solving globally optimal segmentation results. The experimental results demonstrate the validity of the proposed algorithm, and the results of the proposed method are compared with other algorithms on different video data.

Key words： Bilayer Video Segmentation Random Fern Conditional Random Field

收稿日期: 2008-09-04

ZTFLH:

TP391

基金资助:浙江省教育厅资助项目(No.Y200805048)

作者简介: 褚一平，男，1978年生，博士，主要研究方向为智能识别、信息安全.E-mail: hzcyp@yahoo.com.cn.陈勤，男，1962年生，教授，主要研究方向为计算机视觉、机器学习.黄叶珏，女，1978年生，硕士，主要研究方向为计算机图形图像处理.郑河荣，男，1971年生，博士研究生，副教授，主要研究方向为计算机图形图像处理.

引用本文:

褚一平，陈勤，黄叶珏，郑河荣. 基于随机蕨丛的双层视频分割算法^*[J]. 模式识别与人工智能, 2009, 22(3): 463-467. CHU Yi-Ping, CHEN Qin, HUANG Ye-Jue, ZHENG He-Rong. Bilayer Video Segmentation Based on Random Ferns. , 2009, 22(3): 463-467.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2009/V22/I3/463

[1] Ridder C, Munkelt O, Kirchner H. Adaptive Background Estimation and Foreground Detection Using Kalman Filtering // Proc of the International Conference on Recent Advances in Mechatronics. Istanbul, Turkey, 1995: 193-199
[2] Toyama K, Krumm J, Brummit B, et al. Wallflower: Principles and Practice of Background Maintenance // Proc of the International Conference on Computer Vision. Corfu, Greece, 1999, Ⅰ: 255-261
[3] Yang Tao, Li S Z, Pan Quan, et al. Real-Time and Accurate Segmentation of Moving Objects in Dynamic Scene // Proc of the 2nd ACM International Workshop on Video Surveillance and Sensor Networks. New York, USA, 2004: 136-143
[4] Stauffer C, Grimson W E L. Learning Patterns of Activity Using Real-Time Tracking. IEEE Trans on Pattern Analysis and Machine Intelligence, 2000, 22(8): 747-757
[5] Stenger B, Ramesh V, Paragios N, et al. Topology Free Hidden Markov Models: Application to Background Modeling // Proc of the International Conference on Computer Vision. Vancouver, Canada, 2001, Ⅰ: 294-301
[6] Elgammal A, Duraiswami R, Harwood D, et al. Background and Foreground Modeling Using Nonparametric Kernel Density Estimation for Visual Surveillance. Proc of the IEEE, 2002, 90(7): 1151- 1163
[7] Chen Rui, Deng Yu, Xiang Shimin, et al. A Non-Parametric Foreground / Background Segmentation Method by Fusion of Intensity and Edge Feature. Journal of Computer-Aided Design & Computer Graphics, 2005, 17(6): 1278-1284 (in Chinese)
(陈睿,邓宇,向世明,等.结合强度和边界信息的非参数前景/背景分割方法.计算机辅助设计与图形学学报, 2005, 17(6): 1278-1284)
[8] Migdal J, Grimson E. Background Subtraction Using Markov Thresholds // Proc of the IEEE Workshop on Motion and Video Computing. Breckenridge, USA, 2005: 58-65
[9] Xu Wei, Zhou Yue, Gong Yihong, et al. Background Modeling Using Time Dependent Markov Random Field with Image Pyramid [EB/OL]. [2005- 01- 31]. http://www.soe.ucsc.edu/~tao/pps/Motion95.pdf
[10] Yaser S, Mubarak S. Bayesian Modeling of Dynamic Scenes for Object Detection. IEEE Trans on Pattern Analysis and Machine Intelligence, 2005, 27(11): 1778-1792
[11] Wang Yang, Ji Qiang. A Dynamic Conditional Random Field Model for Object Segmentation in Image Sequences // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, USA, 2005, Ⅰ: 264-270
[12] Criminisi A, Cross G, Blake A, et al. Bilayer Segmentation of Live Video // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA, 2006: 53-60
[13] Yin Pei, Criminisi A, Winn J, et al. Tree-Based Classifiers for Bilayer Video Segmentation // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Minneapolis, USA, 2007: 1-8
[14] Friedman J, Hastie T, Tibshirani R. Additive Logistic Regression: A Statistical View of Boosting. Annals of Statistics, 2000, 28(2): 337-407
[15] Ozuysal M, Fua P, Lepetit V. Fast Keypoint Recognition in Ten Lines of Code // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Minneapolis, USA, 2007: 9-12
[16] Lepetit V, Fua P. Keypoint Recognition Using Randomized Trees. IEEE Trans on Pattern Analysis and Machine Intelligence, 2006, 28(9): 1465-1479
[17] Bosch A, Zisserman A, Muoz X. Image Classification Using Random Forests and Ferns // Proc of the IEEE 11th International Conference on Computer Vision. Riode Janeiro, Brazil, 2007: 1-8
[18] Shotton J, Winn J, Rother C, et al. TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation // Proc of the 9th European Conference on Computer Vision. Graz, Austria, 2006, Ⅰ: 1-15
[19] Viola P, Jones M. Robust Real-Time Object Detection. International Journal of Computer Vision, 2004, 57(2): 137-154
[20] Boykov Y, Veksler O, Zabih R. Fast Approximate Energy Minimization via Graph Cuts. IEEE Trans on Pattern Analysis and Machine Intelligence, 2001, 23(11): 1222-1239