模式识别与人工智能
Sunday, Apr. 13, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2019, Vol. 32 Issue (2): 133-143    DOI: 10.16451/j.cnki.issn1003-6059.201902005
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Entity Relations Extraction in Chinese Domain Based on Distant Supervision with Multi-feature Fusion
WANG Bin1, GUO Jianyi1, 2, XIAN Yantuan1, 2, WANG Hongbin1, 2, YU Zhengtao1, 2
1.Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500;
2.Key Laboratory of Intelligent Information Processing, Kunming University of Science and Technology, Kunming 650500

Download: PDF (974 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Aiming at the extraction of Chinese domain entity relationship from unlabeled text, a hybrid method of domain entity attribute extraction based on distant supervision is proposed. The structured relational three tuples in the knowledge base are applied to obtain the training corpus automatically from the natural language text. Due to the large amount of noise in the annotation data of distant supervision method, the latent Dirichlet allocation(LDA) topic model for topic keyword extraction is adopted, and then the similarity calculation with relationship type and keyword pattern matching for denoising are performed. Finally, the part-of-speech feature, the dependency feature and the phrase syntax tree feature are extracted, and the relationship extraction model is trained. Experiments show that the method fusing three features produces higher F value and better extraction performance.
Key wordsDistant Supervision      Entity Relation Extraction      Domain Knowledge Base      Feature Fusion      Latent Dirichlet Allocation Topic Model     
Received: 15 October 2018     
ZTFLH: TP 391.1  
Fund:Supported by National Natural Science Foundation of China(No.61562052,61363044,61462054)
About author:: (WANG Bin, master student. His research interests include natural language proce-ssing.) (GUO Jianyi(Corresponding author), master, professor. Her research interests include pattern recognition, natural language processing, information extraction and knowledge acquisition.)(XIAN Yantuan, Ph.D. candidate, lectu-rer. His research interests include machine translation, information retrieval and information extraction.) (WANG Hongbin, Ph.D. candidate. His research interests include intelligent information system, natural language processing and information retrieval.) (YU Zhengtao, Ph.D., professor. His research interests include machine translation, natural language processing and information retrieval.)
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WANG Bin
GUO Jianyi
XIAN Yantuan
WANG Hongbin
YU Zhengtao
Cite this article:   
WANG Bin,GUO Jianyi,XIAN Yantuan等. Entity Relations Extraction in Chinese Domain Based on Distant Supervision with Multi-feature Fusion[J]. , 2019, 32(2): 133-143.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201902005      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2019/V32/I2/133
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn