计算机科学 ›› 2015, Vol. 42 ›› Issue (2): 185-190.doi: 10.11896/j.issn.1002-137X.2015.02.040

• 软件与数据库技术 • 上一篇    下一篇

模式匹配中的结构差异识别及消解

杜小坤,李国徽,李艳红   

  1. 中南民族大学计算机科学学院 武汉430074,华中科技大学计算机学院 武汉430074,中南民族大学计算机科学学院 武汉430074
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金(61173049),湖北省自然科学基金(2014CFB915),中央高校基本科研业务费(CZQ14015)资助

Structural Difference Recognition and Dispelling in Schema Matching

DU Xiao-kun, LI Guo-hui and LI Yan-hong   

  • Online:2018-11-14 Published:2018-11-14

摘要: 模式匹配是数据空间、语义Web等热点研究领域的一个关键问题。已有的研究成果以元素为操作对象,通过元素的自身信息、结构信息和数据信息等来获取元素语义并选取语义相近的元素作为匹配元素,取得了较好的效果。但不同模式在元素自身信息、结构信息上的巨大差异严重阻碍了语义的获取。分析了模式结构差异产生的原因,总结了几种模式差异常见的形式,并给出了相应的检测和消解算法来消除差异。实验表明,对模式进行差异消解后再匹配能显著提高匹配结果的准确率。

关键词: 模式匹配,结构信息,结构差异

Abstract: Schema matching is a primary problem in the hot research field such as data space,semantic Web and so on.The existing method extracts the element’s own information,structural information and data information,and then chooses the pair of elements having most similar semantic as matching elements.But the difference between elements in element’s own information and structural information hinders the extraction of semantic.Through analyzing the reason of structure information difference,this paper summarized some kinds of structure information difference and proposed the corresponding detecting and dispelling algorithm.Extensive simulation experiments were conducted and the results show that the accuracy of matching result is increased by the dispelling of structure information difference.

Key words: Schema matching,Structural information,Structural difference

[1] Zhang C J,Chen L,Jagadish H V,et al.Reducing uncertainty of schema matching via crowdsourcing[J].Proceedings of the VLDB Endowment,2013,6(9):757-768
[2] Nguyen Q V H,Weidlich M,Nguyen Thanh T,et al.Pay-as-you-go Reconciliation in Schema Matching Networks.http://infoscience.epf/.ch/record/189892
[3] Lee Y,Sayyadian M,Doan A H,et al.eTuner:tuning schema matching software using synthetic scenarios[J].The VLDB Journal,2007,16(1):97-122
[4] Rahm E,Bernstein P A.A Survey of approaches to automaticschema matching[J].VLDB Journal,2001,10(4):334-350
[5] De Carvalho M S G,Laender A H F,Gonalves M A,et al.An evolutionary approach to complex schema matching[J].Information Systems,2013,38(3):302-316
[6] Madhavan J,Bernstein P A,Rahm E.Generic schema matching with cupid.http://db.cs.washington.edu/papers/CupidTechReport.pdf
[7] Do Hong-hai,Rahm E.COMA-a system for flexible combination of schema matching approaches[C]∥Proc.of VLDB.2002:610-621
[8] Bernstein P A,Madhavan J,Rahm E.Generic schema matching,ten years later[J].Proceedings of the VLDB Endowment,2011,4(11):695-701
[9] Sorrentino S,Bergamaschi S,Gawinecki M,et al.Schema labelnormalization for improving schema matching[J].Data & Knowledge Engineering,2009,69(12):1254-1273
[10] De Carvalho M S G,Laender A H F,Gonalves M A,et al.An evolutionary approach to complex schema matching[J].Information Systems,2013,38(3):302-316
[11] Bilke A,Naumann F.Schema matching using duplicates[C]∥Proceedings of 21st International Conference on Data Engineering.2005:69-80
[12] Melnik S,Garcia-Molina H,Rahm E.Similarity flooding:A versatile graphmatching algorithm and its application to schema matching[C]∥Proceedings of the 18th International Conference on Data Engineering.2002:117-128
[13] 李国徽,杜小坤,杨兵,等.基于部分函数依赖的结构匹配方法[J].计算机学报,2010,33(2):240-250
[14] 申德荣,余恩运,张旭,等.SKM:一种基于模式结构和已有匹配知识的模式匹配模型[J].软件学报,2009,0(2):327-338
[15] Elmeleegy H,Elmagarmid A,Lee J.Leveraging query logs for schema mapping generation in U-MAP[C]∥Proceedings of the 2011 International Conference on Management of Data.2011:121-132
[16] Pinkel C.Interactive Pay as You Go Relational-to-Ontology Mapping[C]∥The Semantic Web-ISWC.2013:456-464
[17] Aumueller D,Do H H,Massmann S,et al.Schema and ontology matching with COMA++[C]∥Proceedings of the 2005 ACM SIGMOD international conference on Management of data.Chicago,IL,USA,2005:906-908
[18] Peukert E,Eberius J,Rahm E.A self-configuring schema matching system[C]∥Proceedings of 28st International Conference on Data Engineering.Washington DC,USA,2012:306-317
[19] Qian L,Cafarella M J,Jagadish H V.Sample-driven schemamapping[C]∥Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data.Scottsdale,USA,2012:73-84
[20] 黄少滨,刘国峰,万庆生,等.一种基于部分已验证匹配关系的模式匹配模型[J].自动化学报,2013,39(10):1642-1652
[21] 董慧,刘厚嘉.文献数据库优化设计的探讨[J].情报学报,1999,8(1):43-49
[22] 崔跃生,张勇,曾春,等.数据库物理结构优化技术[J].软件学报,2013,4(4):761-780
[23] Berzal F,Cubero J C,Cuenca F,et al.Relational decomposition through partial functional dependencies[J].Data & Knowledge Engineering,2002,43(2):207-234

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!