模式识别与人工智能
Tuesday, Apr. 22, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2019, Vol. 32 Issue (4): 336-344    DOI: 10.16451/j.cnki.issn1003-6059.201904006
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Fine-Grained Visual Classification Based on Sparse Bilinear Convolutional Neural Network
MA Li1, WANG Yongxiong1,2
1.School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093
2.Shanghai Engineering Research Center of Assistive Devices, Shanghai 200093

Download: PDF (766 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The overfitting problem of bilinear convolutional neural network(B-CNN) for fine-grained visual recognition is caused by the large number of parameters and its complex structure. In this paper, a sparse B-CNN is proposed to handle the problem. Firstly, a scaling factor is introduced into each feature channel of B-CNN, and regularization of sparsity is applied to the scaling factors during the training. Then, the feature channels in B-CNN with low contribution to the final classification are identified by small scaling factors. Finally, these channels are pruned in a certain proportion to prevent overfitting and increase the significance of key features. The learning of sparse B-CNN is weakly supervised and end-to-end. The verification experiments on FGVC-aircraft, Stanford dogs and Stanford cars fine-grained image datasets show that the accuracy of sparse B-CNN is higher than that of the original B-CNN. Moreover, compared with other advanced algorithms for fine-grained visual recognition, the performance of sparse B-CNN is same or even better.


Key wordsFine-Grained Visual Recognition      Bilinear Convolutional Neural Network(B-CNN)     
Overfitting
      Network Sparsity      Network Pruning     
Received: 03 January 2019     
ZTFLH: TP 391  
Fund:Supported by National Natural Science Foundation of China(No.61673276,61703277)
About author:: MA Li, master student. His research interests include computer vision and image processing.WANG Yongxiong(Corresponding author), Ph.D., professor. His research inte-rests include intelligent robot and vision.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
MA Li
WANG Yongxiong
Cite this article:   
MA Li,WANG Yongxiong. Fine-Grained Visual Classification Based on Sparse Bilinear Convolutional Neural Network[J]. , 2019, 32(4): 336-344.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201904006      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2019/V32/I4/336
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn