李婷,程海涛.模糊XML关键字近似查询方法研究[J].计算机科学,2017,44(9):216-221, 226
模糊XML关键字近似查询方法研究
Research of Approximate Keyword Query on Fuzzy XML Documents
投稿时间:2016-08-04  修订日期:2016-12-13
DOI:10.11896/j.issn.1002-137X.2017.09.040
中文关键词:  XML,关键字,近似查询,模糊,可能性
英文关键词:XML,Keyword,Approximate query,Fuzzy,Possibility
基金项目:
作者单位
李婷 东北大学计算机科学与工程学院 沈阳110819 
程海涛 东北大学计算机科学与工程学院 沈阳110819 
摘要点击次数: 232
全文下载次数: 111
中文摘要:
      在精确XML文档上的关键字查询方法的研究大多是基于LCA语义或者其变种语义(SLCA,ELCA等)开展的,将包含所有关键字的最紧致XML子树片段作为查询结果返回。但是这些基于LCA语义产生的查询结果中通常包含了大量的冗余信息,现实世界中存在着大量的不确定和模糊信息,因而如何从模糊XML文档中搜索到高质量的关键字查询结果是一个需要研究的问题。针对模糊XML文档上的关键字近似查询方法进行研究,通过引入最小连接树(MCT)的概念,提出在模糊XML文档上关键字查询的所有GDMCTs问题,并给出解决这一问题的基于栈的算法All fuzzy GDMCTs,该算法可以得到满足用户指定的子树大小阈值和可能性阈值条件的所有GDMCTs结果。实验表明,该算法在模糊XML文档上能够得到较高质量的关键字查询结果。
英文摘要:
      The study of keyword queries on crisp XML documents is carried out mainly based on the LCA semantics or its variant semantics (SLCA,ELCA),and the most compact XML subtrees containing all keywords are returned as the query results.However,the generated results based on the LCA semantics always contain a lot of redundant information,and there are a lot of uncertainty and fuzzy information exist in the real world.How to search the high quality results of keyword queries on fuzzy XML documents is an issue need to be studied.Aiming at investigating the method of approximate keyword query on fuzzy XML documents,firstly the concept of minimum connecting tree was introduced,All GDMCTs problem of keyword queries on fuzzy XML documents was proposed,and a stack based algorithm All fuzzy GDMCTs was given to solve the problem.The algorithm can get all the GDMCTs results satisfying the given subtree size threshold and possibility threshold.Experimental results show that the algorithm can get the high quality results of keyword queries on fuzzy XML documents.
查看全文  查看/发表评论  下载PDF阅读器