Design and Implementation of OffLine Handwritten Document Recognition System of Manchu Manuscript
ZHAO Ji1, LI JingJiao2, ZHANG GuangYuan2, WAN Jie1
1.School of Computer Science and Engineering,Anshan Science and Technology University, Anshan 114044 2.School of Information Science and Engineering, Northeastern University, Shenyang 110004
Abstract:Based on an offline handwritten Manchu manuscript recognition system, a corresponding system model is established. Firstly, the digital image processing method is used to preprocess and extract the words from the identified targets. Next, the processed words are decomposed into the stroke units. The statistics pattern recognition method is employed to identify them and obtain the stroke sequence. Then the stroke sequence is converted into the root sequence. The fuzzy identification method is used to achieve the output of ManchuRoman characters. Hidden Markov Model method is also involved to postprocess the recognition results of every single word and enhance the recognition rate. The experimental results show that the recognition rate and the selfadaptability of the system are increased substantially on the basis of the single font stroke learning and probability statistics of great corpus of twoword simultaneity.
赵骥,李晶皎,张广渊,王杰. 脱机手写体满文文本识别系统的设计与实现*[J]. 模式识别与人工智能, 2006, 19(6): 801-805.
ZHAO Ji, LI JingJiao, ZHANG GuangYuan, WAN Jie. Design and Implementation of OffLine Handwritten Document Recognition System of Manchu Manuscript. , 2006, 19(6): 801-805.
[1] Zhang Li, Hu Minghan, Li Jinjiao, et al. Manchu Coded Character Set Building in Manchu-Chinese Assistant Translation System. Journal of Northeastern University: Natural Science, 2002, 23(2): 119-122 (in Chinese) (张 俐,胡明函,李晶皎,等.满汉计算机辅助翻译系统的满文字符编码.东北大学学报:自然科学版, 2002, 23(2): 119-122) [2] Zhang Guangyuan, Li Jinjiao, Zhang Li. Realization of Language Interconversion Algorithm between Roman-Manchu and Primitive Manchu. Journal of Northeastern University: Natural Science, 2003, 24(12): 1157-1160 (in Chinese) (张广渊,李晶皎,张 俐.满文罗马转写与圈点满文转换算法的实现.东北大学学报:自然科学版, 2003, 24(12): 1157-1160) [3] Qu Liusheng. Manchu Textbook. Urumchi, China: Xinjiang People’s Press, 1991: 2-12 (in Chinese) (屈六生. 满文教材.乌鲁木齐:新疆人民出版社, 1991: 2-12) [4] Cai Jinhai, Liu Zhiqiang. Integration of Structural and Statistical Information for Unconstrained Handwritten Numeral Recognition. IEEE Trans on Pattern Analysis and Machine Intelligence, 1999, 21(3): 263-270 [5] Srikantan G, Lam S W, Srihari S N. Gradient-Based Contour Encoding for Character Recognition. Pattern Recognition, 1996, 29(7): 1147-1160 [6] Alon N. Eigenvalues and Expanders. Combinatorica, 1986, 6(2): 83-96 [7] Mitchell T M. Machine Learning. New York, USA: McGraw-Hill, 1997: 165-175 [8] Duda R O. Pattern Classification. New York, USA: John Wiley & Sons, 1973: 333-339 [9] Li Yuanxiang, Ding Xiaoqing, Liu Changsong. Post-Processing of Chinese Document Recognition Based on HMM. Journal of Chinese Information Processing, 1999, 13(4): 29-34 (in Chinese) (李元祥,丁晓青,刘长松.基于HMM的汉语文本识别后处理的研究.中文信息学报, 1999, 13(4): 29-34) [10] Chang C H. Word Class Discovery for Postprocessing Chinese Handwriting Recognition // Proc of the 15th Conference on Computational Linguistics. Kyoto, Japan, 1994: 1221-1225 [11] Lin Xiaofan, Ding Xiaoqing, Chen Ming, et al. Adaptive Confidence Transform Based Classifier Combination for Chinese Character Recognition. Pattern Recognition Letters, 1998, 19(10): 975-988