计算机科学 ›› 2013, Vol. 40 ›› Issue (1): 273-276.

• 图形图像与模式识别 • 上一篇    下一篇

基于多特征融合的东亚文种识别

王刚,靳彦青,刘立柱,储瑞来   

  1. (解放军信息工程大学 郑州450002);(国家数字交换系统工程技术研究中心 郑州450002);(南京理工大学 南京210094)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Fast Asian Script Identification Based on Multi-feature

  • Online:2018-11-16 Published:2018-11-16

摘要: 针对目前基于统计特征和符号匹配的识别方法对字体较敏感的问题,提出一种基于多特征融合的东亚文种 识别算法。该算法首先分析并提取高频形状特征、排版特征以及字符复杂度特征,然后采用模糊集贴近度准则进行识 别。实验结果表明,该算法具有较高的识别准确率,并对不同字体具有较强的鲁棒性。

关键词: 文种识别,多特征,字符复杂度特征,贴近度

Abstract: Script identification has important applications in the field of document image information retrieval. An east asiatic script identification approach was proposed based on multi feature. Compared to traditional identification method based on statistical characteristics and symbols matching, the algorithm first analyzes and extracts the token shape matching features,layoutfeatures and character complexity features,and then uses closeness degree of fuzzy sets to i- dentify. The experimental results show that the algorithm has higher recognition accuracy and strong robustness to dif- ferent fonts.

Key words: Script identification, Multi feature, Character complexity features, Closeness degree

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!