模式识别与人工智能
Friday, May. 2, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2019, Vol. 32 Issue (4): 369-375    DOI: 10.16451/j.cnki.issn1003-6059.201904010
Ressarches and Applications Current Issue| Next Issue| Archive| Adv Search |
Character-Based Disconnected Recurrent Neural Network for Name Nationality Identification
ZHANG Yusha1, ZHANG Liming2, JIANG Shengyi2
1.School of Electronic Information, Hunan Institute of Information Technology, Changsha 410151
2.Eastern Language Processing Center, Guangdong University of Foreign Studies, Guangzhou 510006

Download: PDF (697 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Personal name is viewed as a strong indicator of inferring the nationality of the user. Generally, personal names reveal the differentiation and correlation of naming conventions among different nationalities. In the current research, personal name features are extracted by cutting off name strings into a set of independent n-gram units, while subtle relationships between characters are not explored. Therefore, a character-based disconnected recurrent neural network is proposed to capture subtle features among personal names in this paper. Concretely, a set of fragments is derived from name strings by order using a slice window. Then, long short-term memory units are utilized to learn information of each fragment, and they are aggregated via mean-pooling operation to obtain the whole name representation for nationalities prediction of users. Disconnected fragments enable model to focus on subtle features among different personal names. Experiments on Olympic dataset and Aminer dataset show that the proposed model outperforms the existing models and the performance is satisfactory.


Key wordsNationality Identification      User Profiling      Character Modeling      Recurrent Neural Network     
Received: 24 January 2019     
ZTFLH: TP 391  
Fund:Supported by National Natural Science Foundation of China(No.61572145), The Thirteenth Five-Year Plan Project of Educational Science in Hunan Province(No.XJK18CGD044)
About author:: ZHANG Yusha, master, associate profe-ssor. Her research interests include data mining and natural language processing.ZHANG Liming(Corresponding author), master student. His research interests include natural language processing.JIANG Shengyi, Ph.D, professor. His research interests include data mining and natural language processing.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
ZHANG Yusha
ZHANG Liming
JIANG Shengyi
Cite this article:   
ZHANG Yusha,ZHANG Liming,JIANG Shengyi. Character-Based Disconnected Recurrent Neural Network for Name Nationality Identification[J]. , 2019, 32(4): 369-375.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201904010      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2019/V32/I4/369
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn