模式识别与人工智能
Friday, May. 2, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2013, Vol. 26 Issue (10): 909-915    DOI:
Orignal Article Current Issue| Next Issue| Archive| Adv Search |
Image Representation Based on Multiple Visual Codebooks
SONG Yan, JIANG Bing, DAI Li-Rong
iFlytek Speech Laboratory, Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei 230027

Download: PDF (424 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The effectiveness of the image representation based on bag-of-visual words(BoW) model is majorly limited by the quantization error. To address this issue, an improved image representation based on multiple visual codebooks is proposed in this paper, which considers both visual codebook construction and feature coding. The proposed method specifically consists of 1) multiple visual codebooks construction, in which the compact and complementary visual codebooks are iteratively generated; 2) image representation, in which the visual words are firstly selected from each individual visual codebook, then the coding coefficients are determined by using the regularized linear regression method, and finally the image is represented by combining the spatial pyramid structure. The experimental results on several benchmark image classification datasets demonstrate the consistent and significant improvement of the proposed method.
Key wordsImage Classification      Visual Codebook      Clustering Analysis      Image Representation     
Received: 20 August 2012     
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
SONG Yan
JIANG Bing
DAI Li-Rong
Cite this article:   
SONG Yan,JIANG Bing,DAI Li-Rong. Image Representation Based on Multiple Visual Codebooks[J]. , 2013, 26(10): 909-915.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2013/V26/I10/909
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn