计算机科学 ›› 2015, Vol. 42 ›› Issue (7): 300-304.doi: 10.11896/j.issn.1002-137X.2015.07.064

• 图形图像与模式识别 • 上一篇    下一篇

面向连续叠写的高精简中文手写识别方法研究

苏统华,戴洪良,张 健,马培军,邓胜春   

  1. 哈尔滨工业大学软件学院 哈尔滨150001,哈尔滨工业大学软件学院 哈尔滨150001,哈尔滨工业大学材料科学与工程学院 哈尔滨150001,哈尔滨工业大学软件学院 哈尔滨150001,哈尔滨工业大学软件学院 哈尔滨150001
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金(61203260),黑龙江省博士后基金(LBH-Q13066),哈尔滨工业大学科研创新基金(HIT.NSRIF.2015083)资助

Study on High Compact Recognition Method for Continuously Overlaid Chinese Handwriting

SU Tong-hua, DAI Hong-liang, ZHANG Jian, MA Pei-jun and DENG Sheng-chun   

  • Online:2018-11-14 Published:2018-11-14

摘要: 连续手写识别是中文手写输入技术的核心,自然、快捷地输入中文信息一直是模式识别乃至人工智能领域追求的目标。提出了一种有效克服小屏幕限制的连续叠写汉字识别方法。该方法基于切分-识别集成的解码框架,先使用过切分算法处理输入的书写轨迹;然后启用一种新颖的感知机算法判定字符的边界;随后采用来自字符分类模型、几何模型和语言模型的多种上下文信息进行路径解码。为适应不同类型的移动终端,特别提出了一种高效压缩字符分类模型的方法,以有效减少字符识别过程对存储和内存的占用。该识别方法已在Android平台上部署,并进行了大规模的测试实验。实验结果证实了该识别方法的性能和效率。

关键词: 模式识别,连续中文叠写,笔画分类,分类器压缩,集束搜索

Abstract: Continuous Chinese handwriting recognition is the primary bottleneck for Chinese handwritten character input method.Naturally and quickly inputting Chinese text is the fundamental goal to the pattern recognition field even to the artificial intelligence.A novel recognition method was proposed for overlaid Chinese handwriting.It follows a segmentation-recognition integrated framework.Firstly,an over-segmentation algorithm is used to partition the handwriting trajectory.Then a perceptron algorithm is developed to locate the candidate character boundaries.Finally,multiple contexts including character recognition score,geometrical score and linguistic score,are utilized to decode the optimal recognition path.To match different mobile terminals,an appealing compression algorithm was proposed to make the character classifier compact,which reduces the storage consumption both in memory fingerprint and disk storage.The principled method is successfully ported to Android platform,enabling overlaid Chinese handwriting to be input on smart phones and further tested on large overlaid Chinese handwriting samples.Experimental results verify the effectiveness and efficiency of the method.It also works smoothly on smart phone,whose overlapped handwriting input function makes handwriting input remarkably efficient.

Key words: Pattern recognition,Overlaid Chinese handwriting,Stroke classification,Classifier compression,Beam search

[1] Shimodaira H,Sudo T,Nakai M,et al.On-line Overlaid-Handwriting Recognition Based on Substroke HMMs[C]∥Seventh International Conference on Document Analysis and Recognition.Washington DC:IEEE,2003:1043-1047
[2] Wan Xiang,Liu Chang-song,Zou Yan-ming.On-line ChineseCharacter Recognition System for Overlapping Samples[C]∥2011 International Conference on Document Analysis and Re-cognition.Washington DC:IEEE,2011:799-803
[3] Zou Yan-ming,Liu Ying-fei,Liu Ying,et al.Overlapped handwriting input on mobile phones[C]∥2011 International Confe-rence on Document Analysis and Recognition.Washington DC:IEEE,2011:369-373
[4] Wang Qiu-feng,Yin Fei,Liu Cheng-lin.Handwritten ChineseText Recognition by Integrating Multiple Contexts[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,34(8):1469-1481
[5] Yin Fei,Wang Da-han,Wang Qiu-feng.CASIA Online and Offline Chinese Handwriting Databases[C]∥2011 International Conference on Document Analysis and Recognition.Washington DC:IEEE,2011:37-41
[6] Wang Da-han,Liu Cheng-lin,Zhou Xiang-dong.An approach for real-time recognition of online Chinese handwritten sentences[J].Pattern Recognition,2012,45(10):3661-3675
[7] Liu Cheng-lin.Classifier Combination Based on ConfidenceTransformation[J].Pattern Recognition,2005,38(1):11-28
[8] Teng Long,Jin Lian-wen.Building compact MQDF classifier for large character set recognition by subspace distribution sharing[J].Pattern Recognition,2008,41(9):2916-2925
[9] Wang Yong-qiang,Huo Qiang.Building compact recognizers of handwritten Chinese characters using precision constrained Gaussian model,minimum classification error training and parameter compression[J].International Journal on Document Analysis and Recognition,2011,14(3):255-262
[10] 杨军.聚类分析及其在大类别汉字识别中的应用 [D].广州:华南理工大学,2007 Yang Jun.Application of the Clustering Analysis in the Large vocabulary Chinese Character Recognition[D].Guangzhou:South China University of Technology ,2007

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!