Character-Based Disconnected Recurrent Neural Network for Name Nationality Identification
ZHANG Yusha1, ZHANG Liming2, JIANG Shengyi2
1.School of Electronic Information, Hunan Institute of Information Technology, Changsha 410151 2.Eastern Language Processing Center, Guangdong University of Foreign Studies, Guangzhou 510006
Abstract:Personal name is viewed as a strong indicator of inferring the nationality of the user. Generally, personal names reveal the differentiation and correlation of naming conventions among different nationalities. In the current research, personal name features are extracted by cutting off name strings into a set of independent n-gram units, while subtle relationships between characters are not explored. Therefore, a character-based disconnected recurrent neural network is proposed to capture subtle features among personal names in this paper. Concretely, a set of fragments is derived from name strings by order using a slice window. Then, long short-term memory units are utilized to learn information of each fragment, and they are aggregated via mean-pooling operation to obtain the whole name representation for nationalities prediction of users. Disconnected fragments enable model to focus on subtle features among different personal names. Experiments on Olympic dataset and Aminer dataset show that the proposed model outperforms the existing models and the performance is satisfactory.
[1] LIU W, RUTHS D. What′s in a Name? Using First Names as Features for Gender Inference in Twitter // Proc of the AAAI Spring Symposium: Analyzing Microtext. Palo Alto, USA: AAAI Press, 2013: 10-16. [2] KARIMI F, WAGNER C, LEMMERICH F, et al. Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods // Proc of the 25th International Conference Companion on World Wide Web. New York, USA: ACM, 2016: 53-54. [3] WOOD-DOUGHTY Z, ANDREWS N, MARVIN R, et al. Predicting Twitter User Demographics from Names Alone // Proc of the 2nd Workshop on Computational Modeling of People′s Opinions, Personality, and Emotions in Social Media. Stroudsburg, USA: ACL, 2018: 105-111. [4] AMBEKAR A, WARD C, MOHAMMED J, et al. Name-Ethnicity Classification from Open Sources // Proc of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM, 2009: 49-58. [5] CHANG J, ROSENN I, BACKSTROM L, et al. Epluribus: Ethni-city on Social Networks // Proc of the 4th International Conference on Weblogs and Social Media. Palo Alto, USA: AAAI Press, 2010: 18-25. [6] MISLOVE A, LEHMANN S, AHN Y Y, et al. Understanding the Demographics of Twitter Users // Proc of the 5th International AAAI Conference on Weblogs and Social Media. Palo Alto, USA: AAAI Press, 2012: 554-557. [7] TREERATPITUK P, GILES C L. Name-Ethnicity Classification and Ethnicity-Sensitive Name Matching // Proc of the 26th AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2012: 1141-1147. [8] ANDREW H J. What′s in a Name? A Method for Extracting Information about Ethnicity from Names. Political Analysis, 2015, 23(2): 212-224. [9] BERGSMA S, DREDZE M, VAN DURME B, et al. Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter // Proc of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, USA: ACL, 2013: 1010-1019. [10] HUANG W Y, WEBER I, VIEWEG S. Inferring Nationalities of Twitter Users and Studying International Linking // Proc of the 25th ACM Conference on Hypertext and Social Media. New York, USA: ACM, 2014: 237-242. [11] PENNACCHIOTTI M, POPESCU A. A Machine Learning Approa-ch to Twitter User Classification // Proc of the 5th AAAI International Conference on Weblogs and Social Media. Palo Alto, USA: AAAI Press, 2011: 281-288. [12] BHARGAVA A, KONDRAK G. Language Identification of Names with SVMs // Proc of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Stroudsburg, USA: ACL, 2010: 693-696. [13] KRIZHEVSKY A, SUTSKEVER L, HINTON G E. ImageNet Cla-ssification with Deep Convolutional Neural Networks // PEREIRA F, BURGES C J C, BOTTOU L, et al., eds. Advances in Neural Information Processing Systems 25. Cambridge, USA: The MIT Press, 2012: 1106-1114. [14] SIMONYAN K, ZISSERMAN A. Very Deep Convolutional Networks for Large-Scale Image Recognition[C/OL]. [2018-02-14]. https://arxiv.org/pdf/1409.1556.pdf. [15] MIKOLOV T, CHEN K, CORRADO G, et al. Efficient Estimation of Word Representations in Vector Space[C/OL]. [2018-02-14]. https://arxiv.org/pdf/1301.3781.pdf. [16] PENNINGTON J, SOCHER R, MANNING C D. Glove: Global Vectors for Word Representation // Proc of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: ACL, 2014: 1532-1543. [17] PEROZZI B, AL-RFOU R, SKIENA S. DeepWalk: Online Lear-ning of Social Representations // Proc of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA: ACM, 2014: 701-710. [18] TANG J, MU M, WANG M Z, et al. LINE: Large-Scale Information Network Embedding // Proc of the 24th International Confe-rence on World Wide Web. New York, USA: ACM, 2015: 1067-1077. [19] YE J T, HAN S C, HU Y F, et al. Nationality Classification Using Name Embeddings // Proc of the ACM Conference on Information and Knowledge Management. New York, USA: ACM, 2017: 1897-1906. [20] HOCHREITER S, SCHMIDHUBER J. Long Short-Term Memory. Neural Computation, 1997, 9(8). DOI: 10.1162/neco.1997.9.8.1735. [21] CHUNG J, GULCEHRE C, CHO K H, et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling[C/OL]. [2018-02-11]. https://arxiv.org/pdf/1412.3555.pdf. [22] LEE J, KIM H, KO M, et al. Name Nationality Classification with Recurrent Neural Networks // Proc of the 26th International Joint Conference on Artificial Intelligence. Melbourne, Australia: IJCAI, 2017: 2081-2087. [23] WANG B X. Disconnected Recurrent Neural Networks for Text Categorization // Proc of the 56th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: ACL, 2018: 2311-2320. [24] KIM Y. Convolutional Neural Networks for Sentence Classification // Proc of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: ACL, 2014: 1746-1751.