模式识别与人工智能
Saturday, March 15, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2020, Vol. 33 Issue (11): 1004-1012    DOI: 10.16451/j.cnki.issn1003-6059.202011005
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Vision Based Important Change Detection Method for Web Pages
SHI Cunhui1,2, YU Xiaoming1, LIU Yue1, JIN Xiaolong1,2, CHENG Xueqi1,2
1. Key Laboratory of Network Data Science and Technology,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;
2. School of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049

Download: PDF (1265 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Duplicate Web indexes of Web crawling can be reduced effectively by detecting important changes and determining changes of essential content in Web pages.Therefore,a vision based detection method is proposed to detect changes in different semantic regions of the page and compress the page into a low dimensional vector representation.The proposed method is utilized to understand the difference of semantic importance in different regions from the perspective of users.Compared with the existing methods,the proposed method is independent of the analysis of HTML,and thus it is suitable for new media,such as mobile Internet.Experiments show the effectiveness of the proposed method.
Key wordsWeb Content      Change Detection      Visual Feature      Low Dimensional Vector     
Received: 12 August 2020     
ZTFLH: TP391  
Corresponding Authors: SHI Cunhui,Ph.D.candidate,engineer.His research interests include network science,information reco-mmendation and event extraction.   
About author:: YU Xiaoming,Ph.D.,senior engineer.His research interests include Internet search and mining.LIU Yue,Ph.D.,associate professor.Her research interests include text mining,Web search,complex network analysis and social computing.JIN Xiaolong,Ph.D.,professor.His research interests include knowledge graph and knowledge engineering.CHENG Xueqi,Ph.D.,professor.His research interests include big data analysis and mining.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
SHI Cunhui
YU Xiaoming
LIU Yue
JIN Xiaolong
CHENG Xueqi
Cite this article:   
SHI Cunhui,YU Xiaoming,LIU Yue等. Vision Based Important Change Detection Method for Web Pages[J]. , 2020, 33(11): 1004-1012.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202011005      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2020/V33/I11/1004
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn