计算机科学 ›› 2013, Vol. 40 ›› Issue (1): 273-276.
• 图形图像与模式识别 • 上一篇 下一篇
王刚,靳彦青,刘立柱,储瑞来
出版日期:
发布日期:
Online:
Published:
摘要: 针对目前基于统计特征和符号匹配的识别方法对字体较敏感的问题,提出一种基于多特征融合的东亚文种 识别算法。该算法首先分析并提取高频形状特征、排版特征以及字符复杂度特征,然后采用模糊集贴近度准则进行识 别。实验结果表明,该算法具有较高的识别准确率,并对不同字体具有较强的鲁棒性。
关键词: 文种识别,多特征,字符复杂度特征,贴近度
Abstract: Script identification has important applications in the field of document image information retrieval. An east asiatic script identification approach was proposed based on multi feature. Compared to traditional identification method based on statistical characteristics and symbols matching, the algorithm first analyzes and extracts the token shape matching features,layoutfeatures and character complexity features,and then uses closeness degree of fuzzy sets to i- dentify. The experimental results show that the algorithm has higher recognition accuracy and strong robustness to dif- ferent fonts.
Key words: Script identification, Multi feature, Character complexity features, Closeness degree
王刚,靳彦青,刘立柱,储瑞来. 基于多特征融合的东亚文种识别[J]. 计算机科学, 2013, 40(1): 273-276. https://doi.org/
0 / / 推荐
导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks
链接本文: https://www.jsjkx.com/CN/
https://www.jsjkx.com/CN/Y2013/V40/I1/273
Cited