模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2017, Vol. 30 Issue (1): 54-63    DOI: 10.16451/j.cnki.issn1003-6059.201701006
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Relation Extraction Method Combining Clause Level Distant Supervision and Semi-supervised Ensemble Learning
YU Xiaokang1, CHEN Ling1, GUO Jing1, CAI Yaya1, WU Yong2, WANG Jingchang2
1. College of Computer Science and Technology, Zhejiang University, Hangzhou 310027
2. Zhejiang Hongcheng Computer System Co.,Ltd., Hangzhou 310053

Download: PDF (956 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Aiming at noisy data in training data and the insufficient use of negative instances in traditional distant supervision relation extraction methods, a relation extraction method combining clause level distant supervision and semi-supervised ensemble learning is proposed. Firstly, the relation instance set is generated by distant supervision. Secondly, based on clause identification, a denoising algorithm is used to reduce the wrongly labeled data in the relation instance set. Thirdly, the lexical features are extracted from relation instances and are transformed into distributed vectors to establish feature dataset. Finally, all positive data and part of negative data in feature dataset are chosen to form labeled dataset, and the other part of negative data are chosen to form unlabeled dataset. A relation classifier is trained through improved semi-supervised ensemble learning algorithm. Experiments show that compared with baseline methods the proposed method achieves higher accuracies and recall.
Key wordsRelation Extraction      Distant Supervision      Clause Identification      Noise Reduction      Semi supervised Ensemble Learning     
Received: 19 September 2016     
ZTFLH: TP 311  
Fund:Supported by National Natural Science Foundation of China(No.61332017,60703040), Major Science and Technology Project of Zhejiang Province(No.2015C33002,2013C01046,2011C13042), Project of China Knowledge Centre for Engineering Sciences and Technology(No.CKCEST-2014-1-5)
About author:: YU Xiaokang, born in 1990, master student. His research interests include text mining. CHEN Ling(Corresponding author), born in 1977, Ph.D., associate professor. His research interests include pervasive computing and database. GUO Jing, born in 1988, Ph.D. candidate. His research interests include text mining. CAI Yaya, born in 1990, master student. Her research interests include pervasive computing. WU Yong, born in 1977, engineer. His research interests include cloud computing. WANG Jingchang, born in 1977, master, senior engineer. His research interests include cloud computing.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
YU Xiaokang
CHEN Ling
GUO Jing
CAI Yaya
WU Yong
WANG Jingchang
Cite this article:   
YU Xiaokang,CHEN Ling,GUO Jing等. Relation Extraction Method Combining Clause Level Distant Supervision and Semi-supervised Ensemble Learning[J]. , 2017, 30(1): 54-63.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201701006      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2017/V30/I1/54
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn