基于潜在Dirichlet分布的图像分层表示模型

摘要
图/表
参考文献
相关文章 (6)

全文: PDF (569 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要现有的图像分层表示方法严格局限于前馈型方式，不能较好地解决局部模糊性等问题。基于此，文中提出一种学习和推断层次结构所有分层的概率模型，它考虑递归的概率分解过程，通过推导得到金字塔式多层结构的潜在Dirichlet分布的衍生模型。该模型存在两个重要特性:增加表示层可提高平面模型的性能;采用全Bayesian概率方法优于其前馈型实现形式。在标准识别数据集上的实验结果表明，与现有的分层表示方法相比，该模型表现出较好性能。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	贾振华
	斯庆巴拉

关键词 ：图像分层表示, 前馈, 概率模型, 潜在Dirichlet分布(LDA)

Abstract：The existing image hierarchical representation methods are strict in feed-forward style, and therefore it is not able to solve problems like local ambiguities well. In this paper, a probabilistic model is proposed to learn and deduce all layers of the hierarchy together. Specifically, a recursive probabilistic decomposition process is taken into account, and a generative model based on latent Dirichlet allocation with pyramidal multilayer structure is derived. Two important properties of the proposed probabilistic model are demonstrated: adding an additional representation layer to improve the performance of the flat model and adopting a full Bayesian approach which is better than a feed-forward implementation of the model. Experimental results on a standard recognition dataset show that the proposed method outperforms the existing hierarchical approaches, and it improves the classification and the learning accuracy with better performance.

Key words： Image Hierarchical Representation Feed-Forward Probabilistic Model Latent Dirichlet Allocation (LDA)

收稿日期: 2012-08-14

ZTFLH:

TP 391

基金资助:河北省廊坊市科学技术支撑项目(No.2010011007)资助

作者简介: 贾振华(通讯作者)，男，1969年生，硕士，副教授，主要研究方向为计算机图形图像处理、模式识别、概率统计论等.E-mail: jzh_1969@126.com.斯庆巴拉，女，1974年生，硕士，讲师，主要研究方向为模式识别、数据挖掘、算法分析与设计.

引用本文:

贾振华，斯庆巴拉. 基于潜在Dirichlet分布的图像分层表示模型[J]. 模式识别与人工智能, 2013, 26(12): 1146-1153. JIA Zhen-Hua, SIQING Ba-La. Image Hierarchical Representation Model Based on LDA. , 2013, 26(12): 1146-1153.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2013/V26/I12/1146

[1] Lowe D G. Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 2004, 60(2): 91-110
[2] Ahmed A, Yu Kai, Xu Wei, et al. Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo-Tasks // Proc of the 10th European Conference on Computer Vision. Marseille, France, 2008: 69-82
[3] Lazebnik S, Schmid C, Ponce J. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA, 2006: 2169-2178
[4] Yang Jianchao, Yu Kai, Gong Yihong, et al. Linear Spatial Pyramid Matching Using Sparse Coding for Image Classification // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA, 2009: 1794-1801
[5] Olshausen B A, Field D J. Sparse Coding with an Over-Complete Basis Set: A Strategy Employed by V1? Vision Research, 1997, 37(23): 3311-3325
[6] Boureau Y L, Bach F, LeCun Y, et al. Learning Mid-Level Features for Recognition // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, USA, 2010: 2559-2566
[7] Fritz M, Black M J, Bradski G R, et al. An Additive Latent Feature Model for Transparent Object Recognition // Proc of the 23rd An-nual Conference on Neural Information Processing Systems. Vancouver, Canada, 2009: 558-566
[8] Mutch J, Lowe D G. Object Class Recognition and Localization Using Sparse Features with Limited Receptive Fields. International Journal of Computer Vision, 2008, 80(1): 45-47
[9] Rolls E, Deco G. Computational Neuroscience of Vision. Oxford, UK: Oxford University Press, 2002
[10] Lee H, Grosse R, Ranganath R, et al. Convolutional Deep Belief Networks for Scalable Unsupervised Learning of Hierarchical Repre-
sentations // Proc of the 26th Annual International Conference on Machine Learning. Montreal, Canada, 2009: 609-616
[11] Ranzato M A, Huang Fujie, Boureau Y L, et al. Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition[EB/OL].[2012-05-30].http://www.cs.nyu.edu/~ylan/files/publi/ranzato-cvpr-07.pdf
[12] Serre T, Wolf L, Bileschi S, et al. Robust Object Recognition with Cortex-Like Mechanisms. IEEE Trans on Pattern Analysis and Machine Intelligence, 2007, 29(3): 411-426
[13] Sivic J, Russell B C, Efros A A, et al. Discovering Objects and Their Locations in Images // Proc of the 10th IEEE International Conference on Computer Vision. Beijing, China, 2005, I: 370-377
[14] Blei D M, Ng A Y, Jordan M I. Latent Dirichlet Allocation. Journal of Machine Learning Research, 2003, 3(1): 993-1022
[15] Blei D M, Griffiths T L, Jordan M I, et al. Hierarchical Topic Models and the Nested Chinese Restaurant Process[EB/OL]. [2012-07-25]. http://machinelearning.wustl.edu/mlpapers/paper_files/NIPS2003_AA03.pdf
[16] Ferguson T S. A Bayesian Analysis of Some Nonparametric Problems. The Annals of Statistics, 1973, 1(3): 209-230
[17] Heinrich G. Parameter Estimation for Text Analysis[EB/OL]. [2012-07-25]. http://www.arbylon.net/publications/text-est.pdf
[18] Li Feifei, Fergus R, Perona P. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories. Computer Vision and Image Understanding, 2004, 106(1): 59-70
[19] Kavukcuoglu K, Sermanet P, Boureau Y L, et al. Learning Convolutional Feature Hierarchies for Visual Recognition[EB/OL]. [2012-07-25]. http://yann.lecun.com/exdb/publis/pdf/koray-nips-10.pdf
[20] Fidler S, Boben M, Leonardis A. Similarity-Based Cross-Layered Hierarchical Representation for Object Categorization[EB/OL].[2012-06-10].http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4587409&tag=1