模式识别与人工智能
Saturday, March 15, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2018, Vol. 31 Issue (3): 275-282    DOI: 10.16451/j.cnki.issn1003-6059.201803009
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Word Embedding Based Chinese News Event Detection and Representation
ZHANG Bin1, HU Linmei1, HOU Lei1, LI Juanzi1
1.Knowledge Engineering Group, Department of Computer Science and Technology, Tsinghua University, Beijing 100084

Download: PDF (790 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Existing methods of event detection are mainly based on traditional TF-IDF document representation with high dimension and sparse semantics, leading to low efficiency and accuracy. Thus, they are not suitable for large-scale online news event detection. A document representation method based on word embedding is proposed in this paper. By the document representation method, the document representation dimension is reduced, the semantic sparse problem is alleviated and the efficiency and accuracy of document similarity calculation are enhanced. Based on the document representation method, a dynamic online clustering method is proposed for online news event detection. Based on the dynamic online clustering method, both the accuracy and the recall of event detection are improved. Experiments on the standard dataset TDT4 and a real dataset show that the proposed adaptive online event detection method significantly improves the performance of event detection in both efficiency and accuracy compared with the state-of-the-art methods.

Key wordsWord Embedding      Event Detection      Dynamic Online Clustering     
Received: 26 September 2017     
ZTFLH: TP 391  
Fund:Supported by National Basic Research Program of China(973 Program)(No.2014CB340504), Key Program of National Natural Science Foundation of China(No.61533018,61661146007), Fund of Online Education Research Center of Ministry of Educa-tion of China(No.2016ZD102), Tsinghua-NUS NEXT Joint Research Center Program
Corresponding Authors: HOU Lei, Ph.D.. His research interests include news and user-generated content analysis and semantic web.   
About author:: ZHANG Bin, master student. His research interests include news mining.HU Linmei, Ph.D.candidate. Her research interests include text mining and natural language processing.LI Juanzi, Ph.D., professor. Her research interests include data mining, semantic web and knowledge graph.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
ZHANG Bin
HU Linmei
HOU Lei
LI Juanzi
Cite this article:   
ZHANG Bin,HU Linmei,HOU Lei等. Word Embedding Based Chinese News Event Detection and Representation[J]. , 2018, 31(3): 275-282.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201803009      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2018/V31/I3/275
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn