Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 211100038-6.doi: 10.11896/jsjkx.211100038

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Multi-font Printed Uyghur-Kazakh-Kirghiz Keyword Image Recognition

SARDAR Parhat, ABDURAHMAN Kadir, ALIMJAN Yasin   

  1. School of Information Management,Xinjiang University of Finance and Economics,Urumqi 830012,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:SARDAR Parhat,born in 1984,Ph.D.His main research interest includes text and image information retrieval.
    ALIMJAN Yasin,born in 1985,Ph.D.His main research interests include programming language and formal system.
  • Supported by:
    National Natural Science Foundation of China(61662073),2020 Xinjiang Uyghur Autonomeus Ragion Tianchi Doctor Plan Project and Xinjiang University of Finance and Economics School Level Scientific Research Foundation Project(2022XGC022,2022XGC049).

Abstract: Aiming at the problems of single font type,small size of recognition data,indistinguishable recognition fields and lack of research on Kazakh and Kirghiz printed character recognition,a multi-font printed Uyghur-Kazakh-Kirghiz keyword recognition method based on convolutional neural network(CNN) is proposed.Firstly,aiming at the problem of lack of Uyghur-Kazakh-Kirghiz printed image corpus,based on image synthesis technique,a Uyghur-Kazakh-Kirghiz keyword image data set including 32 font type is constructed.Secondly,using data augmentation technology to add different level of noise,rotation and distortion effects on these images to further reflect the natural scene features of the data set.Thirdly,using a multi-layer CNN network to train the image recognition model on this data set,and obtaining the recognition accuracy over 96.5%,and the accuracy of about 96% is obtained in the actual print image recognition task including 3 commonly used fonts.This method has fewer pre-proces-sing steps and it outperforms previous recognition approaches within the classical machine learning framework.Experimental results show that the recognition method based on synthetic image data can better realize the task of multi-font printed Uyghur-Kazakh-Kirghiz image recognition.

Key words: Uyghur-Kazakh-Kirghiz, OCR, Image synthesis, Convolutional neural network, Keyword image recognition

CLC Number: 

  • TP391
[1]DOERMANN D.The Indexing and Retrieval of Document Ima-ges:A Survey[J].Computer Vision and Image Understanding,1998,70(3):287-298.
[2]ALAEI F,ALAEI A,BLUMENSTEIN M,et al.A Brief Reviewof Document Image Retrieval Methods:Recent Advances[C]//2016 International Joint Conference on Neural Networks(IJCNN).IEEE,2016:3500-3507.
[3]LI L.Research on document image classification and retrievalmethod based on convolutional neural network[D].Huazhong:Huazhong University of Science and Technology,2017.
[4]NOCE L,GALLO I,ZAMBERLETTI A,et al.Embedded Textual Content for Document Image Classification with Convolutional Neural Networks[C]//Acm Symposium on Document Engineering.ACM,2016:165-173.
[5]DAS A,ROY S,BHATTACHARYA U.Document Image Classifification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks[J].ar-Xiv:1801.09321,2018.
[6]AUDEBERT N,HEROLD C,SLIMANI K,et al.Multimodal Deep Networks for Text and Image-based Document Classification[J].arXiv:1907.06370,2019.
[7]BAGADKAR S L,MALIK L G.Review on Extraction Techniques for Images,Textlines and Keywords From Document Image[C]//IEEE International Conference on Computational Intelligence & Computing Research.IEEE,2015:1-3.
[8]JIANG Y X,DING S C,WU P.A Study on the Classification of Features of Multi-Modal Information Based on BiLSTM-VGG16[J].Information Studies:Theory & Application,2021,44(11):180-186.
[9]HARLEY W A,UFKES A,DERPANIS K G.Evaluation ofDeep Convolutional Nets for Document Image Classification and Retrieval[J].arXiv:1502.07058,2015.
[10]AN Y H,DONG W Z.Research on Segmentation Method of Adhesive Characters based on Recognition Feedback[J].Journal of Hebei Academy of Sciences,2008,25(2):34-38.
[11]SHIN C,DOERMANN D.Structural Similarity for Document Image Classification and Retrieval[J].Pattern Recognition Letters,2014,43(1):119-126.
[12]ZHI T,HUANG W,TONG H.Detecting Text in Natural Image with Connectionist Text Proposal Network[J].arXiv:1609.03605,2015.
[13]RANJAN V,HARIT G,JAWAHAR C V.Enhancing WordImage Retrieval in Presence of Font Variations[C]//Proceedings of the 2014 22nd International Conference on Pattern Recognition.IEEE Computer Society,2014:2709-2714.
[14]CHEN Q,YUAN B S,LI X,et al.Research on Printed Uyghur Character Recognition based on Template Matching[J].Computer Technology and Development,2012,22(4):119-122.
[15]BAI Y H.Printed Uyghur Word Recognition[D].Xi’an:Xi’an University of Electronic Science and Technology,2014.
[16]LANG X.Printed Uyghur Word Recognition based on Segmentation[D].Xi’an:Xi’an University of Electronic Science and Technology,2015.
[17]WANG X D.Research and Application of Key Technologies for Printed Uyghur Character Recognition[D].Xi’an:Xi’an University of Electronic Science and Technology,2017.
[18]YU L,YASIN A.Printed Uyghur Character Recognition Method based on HOG Feature and MLP Classifier[J].Micro-computer Application,2017,33(6):30-33.
[19]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet Classification with Deep Convolutional Neural Networks[J].Neural Information Processing Systems,2012,25:1106-1114.
[20]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-time Object Detection with Region Proposal Networks[C]//IEEE Transactions on Pattern Analysis and Machine Intelligence.2017:1137-1149.
[21]HUBEL D H,WEISEL T N.Receptive Fields,Binocular Intera-ction and Functional Architecture in the Cat’s Visual Cortex[J].The Journal of Physiology,1962,160(1):106-154.
[22]SARDAR P,MIJIT A,ASKAR H.Research on Keyword Extraction of Uyghur-Kazakh Text based on Stem Unit[J].Computer Engineering and Science,2020,42(1):131-137.
[23]CHRIS T,TONY M.Analysis of Convolutional Neural Net-works for Document Image Classifification[J].arXiv:1708-03273,2017.
[1] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[2] CHEN Yong-quan, JIANG Ying. Analysis Method of APP User Behavior Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(8): 78-85.
[3] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[4] DAI Zhao-xia, LI Jin-xin, ZHANG Xiang-dong, XU Xu, MEI Lin, ZHANG Liang. Super-resolution Reconstruction of MRI Based on DNGAN [J]. Computer Science, 2022, 49(7): 113-119.
[5] LIU Yue-hong, NIU Shao-hua, SHEN Xian-hao. Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(7): 127-131.
[6] XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[7] WU Zi-bin, YAN Qiao. Projected Gradient Descent Algorithm with Momentum [J]. Computer Science, 2022, 49(6A): 178-183.
[8] YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[9] YANG Jian-nan, ZHANG Fan. Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure [J]. Computer Science, 2022, 49(6A): 353-357.
[10] ZHANG Jia-hao, LIU Feng, QI Jia-yin. Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer [J]. Computer Science, 2022, 49(6A): 370-377.
[11] WANG Jian-ming, CHEN Xiang-yu, YANG Zi-zhong, SHI Chen-yang, ZHANG Yu-hang, QIAN Zheng-kun. Influence of Different Data Augmentation Methods on Model Recognition Accuracy [J]. Computer Science, 2022, 49(6A): 418-423.
[12] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[13] ZHAO Zheng-peng, LI Jun-gang, PU Yuan-yuan. Low-light Image Enhancement Based on Retinex Theory by Convolutional Neural Network [J]. Computer Science, 2022, 49(6): 199-209.
[14] ZHANG Wen-xuan, WU Qin. Fine-grained Image Classification Based on Multi-branch Attention-augmentation [J]. Computer Science, 2022, 49(5): 105-112.
[15] ZHAO Ren-xing, XU Pin-jie, LIU Yao. ECG-based Atrial Fibrillation Detection Based on Deep Convolutional Residual Neural Network [J]. Computer Science, 2022, 49(5): 186-193.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!