计算机科学 ›› 2020, Vol. 47 ›› Issue (11A): 231-235.doi: 10.11896/jsjkx.191000128
田献珍1, 孙立强2, 田振中1
TIAN Xian-zhen1, SUN Li-qiang2, TIAN Zhen-zhong1
摘要: 借助于计算机将大量规则的文档碎片重建修复,可以极大地提高工作效率,降低人工成本,因此该方面的工作受到学术界的普遍关注。目前,形状规则的英文碎片匹配主要面临3个方面的问题:1)碎片特征提取困难;2)拼接效率低;3)拼接精确度低。针对问题一,通过一系列数据统计处理,排除英文字母高低不一的干扰因素,提取每行字符的标准像素高度作为碎片的特征向量;针对问题二,通过建立优化模型,在保证每类碎片个数相同的前提下,使用蚁群算法进行横向快速聚类;针对问题三,通过对字符8邻域内的像素灰度值进行统计,建立两幅碎片的距离函数,并通过蚁群算法进行匹配及精确聚类。最后,以2013年全国高教杯数学建模的B题附件5的碎片为实验对象,验证该方法的可行性和有效性。
中图分类号:
[1] LIU J G,WU Z P,LIU S Q,et al.A merging algorithm for images based on segmentation of feature regions[J].Journal of Xidian University,2002,29(6):768-771. [2] HE P F,ZHOU Z T,HU D W.Reconstruction of the Ripped-Up Documents Based on Ant Colony Optimization[J].Computer Engineering & Science,2011,33(7):67-73. [3] BISWAS A,BHOWMICK P,BHATTACHARYA B B.Reconstruction of torn documents using contour maps[C]//Proceedings of the 2005 IEEE International Conference on Image Processing.Piscat away:IEEE,2005,3:517-520. [4] ZHAO K Y,SHU Y,DUAN X.Re-assembly algorithm of fragments based on literal characteristics of scrapped paper[J].Journal Computer Applications,2014,34(S2):271-273,309. [5] LIU Q J,CHEN P,WANG Z Y.Algorithm Design on Scraps of Paper Splicing Based on Text Feature[J].Research and Exploration in Laboratory,2016,35(11):110-113. [6] LIU Q J,YU J X,WANG Z Y. Algorithm Design on Scraps of Paper Splicing Based on Grey Level[J]. Research and Exploration in Laboratory,2016,35(7):16-19. [7] ZHOU Y F,WANG S J,HUANG Y B. Double-sided shreds restoration based on English letters feature[J].Journal of Image and Graphics,2015,20(1):85-94. [8] PAIXAO T M,BERRIEL R F,BOERES M C S,et al.A deep learning-based compatibility score for reconstruction of strip-shredded text documents[C]//31st SIBGRAPI Conference on Graphics,Patterns and Images (SIBGRAPI).2018. |
[1] | 崔彤彤, 王桂玲, 高晶. 基于1DCNN-LSTM的船舶轨迹分类方法 Ship Trajectory Classification Method Based on 1DCNN-LSTM 计算机科学, 2020, 47(9): 175-184. https://doi.org/10.11896/jsjkx.191000162 |
[2] | 蓝章礼, 申德兴, 曹娟, 张玉欣. 一种基图像提取和内容无关图像重构方法研究 Content-independent Method for Basis Image Extraction and Image Reconstruction 计算机科学, 2020, 47(6A): 226-229. https://doi.org/10.11896/JsJkx.200160009 |
[3] | 杨旭华,沈敏. 基于特征向量局部相似性的社区检测算法 Community Detection Algorithm Based on Local Similarity of Feature Vectors 计算机科学, 2020, 47(2): 58-64. https://doi.org/10.11896/jsjkx.181202433 |
[4] | 万卓昊,徐冬冬,梁生,黄保华. 基于N-Gram的SQL注入检测研究 Study on SQL Injection Detection Based on N-Gram 计算机科学, 2019, 46(7): 108-113. https://doi.org/10.11896/j.issn.1002-137X.2019.07.017 |
[5] | 马李昕, 李凤坤. 一种轻量级的车牌字符识别算法 Light-weight Recognition Algorithm of Vehicle License Plate Characters 计算机科学, 2019, 46(6A): 239-241. |
[6] | 赵宁博, 刘伟, 罗嵘, 胡顺仁. 无线传感器节点工作模式转换策略优化模型 Optimization Model of Working Mode Transformation Strategies for Wireless Sensor Nodes 计算机科学, 2019, 46(5): 44-49. https://doi.org/10.11896/j.issn.1002-137X.2019.05.006 |
[7] | 孙雪强, 黄旻, 张桂峰, 赵宝玮, 丛麟骁. 基于改进SIFT的多光谱图像匹配算法 Multispectral Image Matching Algorithm Based on Improved SIFT 计算机科学, 2019, 46(4): 280-284. https://doi.org/10.11896/j.issn.1002-137X.2019.04.044 |
[8] | 包宗铭, 龚声蓉, 钟珊, 燕然, 戴兴华. 基于双向KNN排序优化的行人再识别算法 Person Re-identification Algorithm Based on Bidirectional KNN Ranking Optimization 计算机科学, 2019, 46(11): 267-271. https://doi.org/10.11896/jsjkx.181001861 |
[9] | 罗殊彦, 朱怡安, 曾诚. 嵌入式异构多核处理器核间的通信性能评估与优化 Performance Evaluation and Optimization of Inter-cores Communication for Heterogeneous Multi-core Processor Unit 计算机科学, 2018, 45(6A): 262-265. |
[10] | 赵澄, 陈君新, 姚明海. 基于SVM分类器的XSS攻击检测技术 XSS Attack Detection Technology Based on SVM Classifier 计算机科学, 2018, 45(11A): 356-360. |
[11] | 陆亿红,张振宁,杨雄. 一种基于节点特征向量的复杂网络社团发现算法 Community Structure Detection Algorithm Based on Nodes’ Eigenvectors 计算机科学, 2017, 44(Z6): 419-423. https://doi.org/10.11896/j.issn.1002-137X.2017.6A.094 |
[12] | 李浩君,杜兆宏,邱飞岳. 基于混合遗传算法的任务驱动分组优化研究 Optimized Research for Task-driven Grouping Based on Hybrid Genetic Algorithm 计算机科学, 2017, 44(Z6): 105-108. https://doi.org/10.11896/j.issn.1002-137X.2017.6A.022 |
[13] | 林江豪,周咏梅,阳爱民,陈锦. 基于语义相似度的情感特征向量提取方法 Extraction Method of Sentimental Feature Vector Based on Semantic Similarity 计算机科学, 2017, 44(10): 296-301. https://doi.org/10.11896/j.issn.1002-137X.2017.10.053 |
[14] | 王科特,王力生,廖新考. 基于多核处理器的K线程低能耗的任务调度优化算法 K-threaded Low Energy-consuming Task Scheduling Optimization Algorithm Based on Multi-core Processors 计算机科学, 2015, 42(2): 18-23. https://doi.org/10.11896/j.issn.1002-137X.2015.02.004 |
[15] | 齐乃新,曹立佳,杨小冈,李冰. 基于方向约束的改进SIFT匹配算法 Improved SIFT Matching Algorithm Based on Orientation Constraint 计算机科学, 2014, 41(Z6): 125-128. |
|