模式识别与人工智能
Saturday, May. 3, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2020, Vol. 33 Issue (6): 530-541    DOI: 10.16451/j.cnki.issn1003-6059.202006006
Surveys and Reviews Current Issue| Next Issue| Archive| Adv Search |
An Overview of Natural Language Processing for Indonesian and Malay
JIANG Shengyi1,2, LI Shanshan1,2, FU Sihui1, LIN Nankai1,2
1. School of Information Science and Technology, Guangdong University of Foreign Studies, Guangzhou 510006
2. Guangzhou Key Laboratory of Multilingual Intelligent Processing, Guangdong University of Foreign Studies, Guangzhou 510006

Download: PDF (806 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  As the penetration rate of Indonesian and Malay rises, it is significant to carry out information processing on massive texts of these two languages. Extensive research is conducted on Indonesian and Malay. However, as low-resource languages, Indonesian and Malay draw less attention than common languages. Thus, the deep learning methods cannot be fully utilized. In this paper, research on Indonesian and Malay morphological analysis, syntactic parsing, machine translation, spelling check etc., is analyzed and summarized. In the most research findings, algorithms cannot be compared objectively due to their different corpus scales and evaluation metrics. Finally, problems and future directions of natural language processing on Indonesian and Malay are discussed with the consideration of the existing open language resources in various fields.
Key wordsIndonesian      Malay      Agglutinative Language      Low-Resource Language      Natural Language Processing     
Received: 26 March 2020     
ZTFLH: TP 312  
Fund:National Natural Science Foundation of China(No.61572145), Science and Technology Program of Guangzhou(No.202002030227)
Corresponding Authors: JIANG Shengyi, Ph.D., professor. His research interests include data mining and natural language processing.   
About author:: LI Shanshan, master student. Her research interests include data mining and natural language processing. FU Sihui, master student. Her research interests include natural language processing. LIN Nankai, master student. His research interests include data mining and natural language processing.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
JIANG Shengyi
LI Shanshan
FU Sihui
LIN Nankai
Cite this article:   
JIANG Shengyi,LI Shanshan,FU Sihui等. An Overview of Natural Language Processing for Indonesian and Malay[J]. , 2020, 33(6): 530-541.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202006006      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2020/V33/I6/530
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn