计算机科学 ›› 2022, Vol. 49 ›› Issue (11): 163-169.doi: 10.11896/jsjkx.210900225
何皇兴1, 陈爱国1, 王蛟龙2
HE Huang-xing1, CHEN Ai-guo1, WANG Jiao-long2
摘要: 手写文档图像中存在光照不均、笔墨浸染、纸张退化、阴影等复杂情况,针对文档图像在复杂背景下二值化后OCR效果不理想的问题,提出了一种对改进的背景估计和局部自适应集成的二值化方法。首先利用局部自适应方法得到具有高召回率的二值化图像,然后对背景估计的方法进行改进得到具有高精确率的二值化图像,最后基于连通域的方法将两种类型的图像集成得到结果。使用4种评价指标在DIBCO2013和DIBCO2016手写数据集上进行了对比实验,结果表明该方法整体性能优于Otsu,Wolf,Niblack,Sauvola,Singh和Howe等经典算法。
中图分类号:
[1]SULAIMAN A,OMAR K,NASRUDIN M F.Degraded historical document binarization:A review on issues,challenges,techniques,and future directions[J].Journal of Imaging,2019,5(4):48. [2]OTSU N.A Threshold Selection Method from Gray-Level Histograms[J].IEEE Transactions on Systems Man & Cyberne-tics,1979,9(1):62-66. [3]NIBLACK W.An Introduction to Digital Image Processing[M].Englewood Cliffs,NJ:Prentice-Hall,Inc,1986:115-116. [4]SAUVOLA J,PIETIKÄINEN M.Adaptive document image binarization[J].Pattern Recognition,2000,33(2):225-236. [5]HADJADJ Z,MEZIANE A,CHERFA Y,et al.ISauvola:Im-proved Sauvola’s algorithm for document image binarization[C]//International Conference on Image Analysis and Recognition.Póvoa de Varzim,Portugal:Springer,2016:737-745. [6]ZHU D L,YANG D G,HU R,et al.Adaptive Multi-levelThreshold Binaryzation Method for Optical Character Recognition in Mobile Environment[J].Computer Science,2019,46(8):315-320. [7]HOWE N R.Document binarization with automatic parameter tuning[J].International Journal on Document Analysis and Reco-gnition,2013,16(3):247-258. [8]BHOWMIK S,SARKAR R,DAS B,et al.GiB:a Game theory In-spired Binarization technique for degraded document images[J].IEEE Transactions on Image Processing,2018,28(3):1443-1455. [9]BARDOZZO F,DE LA OSA B,HORANSKÁ L′,et al.Sugeno integral generalization applied to improve adaptive image binarization[J].Information Fusion,2021,68:37-45. [10]RANI U,KAUR A,JOSAN G.A New Contrast Based Degraded Document Image Binarization[J].Cognitive Computing in Human Cognition,2020,17(1):83-90. [11]SUN G D,XU Y,XU L,et al.Binarization method of instrument image with uneven illumination and ghosting[J].Journal of Applied Optics,2020,41(1):74-78. [12]WESTPHAL F,LAVESSON N,GRAHN H.Document image binarization using recurrent neural networks[C]//2018 13th IAPR International Workshop on Document Analysis Systems(DAS).Vienna,Austria:IEEE,2018:263-268. [13]KANG S,IWANA B K,UCHIDA S.Complex image processing with less data—Document image binarization by integrating multiple pre-trained U-Net modules[J].Pattern Recognition,2021,109:107577. [14]CASTELLANOS F J,GALLEGO A J,CALVO-ZARAGOZA J.Unsupervised neural domain adaptation for document image binarization[J].Pattern Recognition,2021,119(2):108099. [15]TENSMEYER C,MARTINEZ T.Historical document image binarization:A review[J].SN Computer Science,2020,1(3):1-26. [16]MUSTAFA W A,KADER M M M A.Binarization of document images:A comprehensive review[J].Journal of Physics:Confe-rence Series,2018,1019(1):012023. [17]XU H Y,MA L L,WU J,et al.Document image binarization based on background estimation and edge detection[J].Compu-ter Applications and Software,2014,31(8):196-200. [18]HEDJAM R,NAFCHI H Z,KALACSKA M,et al.Influence of color-to-gray conversion on the performance of document image binarization:Toward a novel optimization problem[J].IEEE Transactions on Image Processing,2015,24(11):3637-3651. [19]WOLD S,ESBENSEN K,GELADI P.Principal component ana-lysis[J].Chemometrics and Intelligent Laboratory Systems,1987,2(1/2/3):37-52. [20]ITU-R R,BT.Studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios[S].Geneva,Switzerland:ITU,2011. [21]PRATIKAKIS I,GATOS B,NTIROGIANNIS K.ICDAR 2013 Document Image Binarization Contest(DIBCO 2013)[C]//2013 12th International Conference on Document Analysis and Recognition.Washington,DC,USA:IEEE,2013:1471-1476. [22]PRATIKAKIS I,ZAGORIS K,BARLAS G,et al.ICFHR2016 Handwritten Document Image Binarization Contest(H-DIBCO 2016)[C]//2016 15th International Conference on Frontiers in Handwriting Recognition(ICFHR).Shenzhen,China:IEEE,2016:619-623. [23]WOLF C,JOLION J M.Extraction and Recognition of Artificial Text in Multimedia Documents[J].Formal Pattern Analysis & Applications,2004,6(4):309-326. [24]SINGH O I,JAMES O,SINAM T,et al.Local Contrast andMean based Thresholding Technique in Image Binarization[J].International Journal of Computer Applications,2013,51(6):4-10. |
[1] | 郭拯危, 付泽文, 李宁, 白澜. 高分辨率斜视聚束SAR回波仿真加速算法研究 Study on Acceleration Algorithm for Raw Data Simulation of High Resolution Squint Spotlight SAR 计算机科学, 2022, 49(8): 178-183. https://doi.org/10.11896/jsjkx.210600066 |
[2] | 刘伟业, 鲁慧民, 李玉鹏, 马宁. 指静脉识别技术研究综述 Survey on Finger Vein Recognition Research 计算机科学, 2022, 49(6A): 1-11. https://doi.org/10.11896/jsjkx.210400056 |
[3] | 来腾飞, 周海洋, 余飞鸿. 视频流的实时景深延拓算法 Real-time Extend Depth of Field Algorithm for Video Processing 计算机科学, 2022, 49(6A): 314-318. https://doi.org/10.11896/jsjkx.201100187 |
[4] | 詹瑞, 雷印杰, 陈训敏, 叶书函. 基于多重差异特征网络的街景变化检测 Street Scene Change Detection Based on Multiple Difference Features Network 计算机科学, 2021, 48(2): 142-147. https://doi.org/10.11896/jsjkx.200500158 |
[5] | 张育龙, 王强, 陈明康, 孙静涛. 图像去雨算法在云物联网应用中的研究综述 Survey of Intelligent Rain Removal Algorithms for Cloud-IoT Systems 计算机科学, 2021, 48(12): 231-242. https://doi.org/10.11896/jsjkx.201000055 |
[6] | 姚楠, 张征. 基于三维图像的疤痕面积计算 Scar Area Calculation Based on 3D Image 计算机科学, 2021, 48(11A): 308-313. https://doi.org/10.11896/jsjkx.201100044 |
[7] | 冯一凡, 赵雪青, 师昕, 杨坤. 基于光照叠加的颜色恒常计算方法 Light Superposition-based Color Constancy Computational Method 计算机科学, 2021, 48(11A): 386-390. https://doi.org/10.11896/jsjkx.210200053 |
[8] | 宋一言, 唐东林, 吴续龙, 周立, 秦北轩. 改进穿线法与HOG+SVM结合的数码管图像读数研究 Study on Digital Tube Image Reading Combining Improved Threading Method with HOG+SVM Method 计算机科学, 2021, 48(11A): 396-399. https://doi.org/10.11896/jsjkx.210100123 |
[9] | 谢海平, 李高源, 杨海涛, 赵洪利. 超分辨率重构遥感图像分类研究 Classification Research of Remote Sensing Image Based on Super Resolution Reconstruction 计算机科学, 2021, 48(11A): 424-428. https://doi.org/10.11896/jsjkx.210300132 |
[10] | 宋娅菲, 谌雨章, 沈君凤, 曾张帆. 基于改进残差网络的水下图像重建方法 Underwater Image Reconstruction Based on Improved Residual Network 计算机科学, 2020, 47(6A): 500-504. https://doi.org/10.11896/JsJkx.200100084 |
[11] | 蔡玉鑫, 汤志伟, 赵博, 杨明, 吴禹非. 基于嵌入式多核DSP的加速软件系统 Accelerated Software System Based on Embedded Multicore DSP 计算机科学, 2020, 47(6A): 622-625. https://doi.org/10.11896/JsJkx.190400079 |
[12] | 马虹. 基于5G的视觉辅助BDS移动机器人融合定位算法 Fusion Localization Algorithm of Visual Aided BDS Mobile Robot Based on 5G 计算机科学, 2020, 47(6A): 631-633. https://doi.org/10.11896/JsJkx.190400156 |
[13] | 苗益, 赵增顺, 杨雨露, 徐宁, 杨皓然, 孙骞. 图像描述技术综述 Survey of Image Captioning Methods 计算机科学, 2020, 47(12): 149-160. https://doi.org/10.11896/jsjkx.200500039 |
[14] | 凌晨, 张鑫彤, 马雷. 基于Mask R-CNN算法的遥感图像处理技术及其应用 Remote Sensing Image Processing Technology and Its Application Based on Mask R-CNN Algorithms 计算机科学, 2020, 47(10): 151-160. https://doi.org/10.11896/jsjkx.190900119 |
[15] | 郭兰英, 韩睿之, 程鑫. 基于可变形卷积神经网络的数字仪表识别方法 Digital Instrument Identification Method Based on Deformable Convolutional Neural Network 计算机科学, 2020, 47(10): 187-193. https://doi.org/10.11896/jsjkx.191000035 |
|