计算机科学 ›› 2013, Vol. 40 ›› Issue (6): 178-182.

• 软件与数据库技术 • 上一篇    下一篇

基于关键字之间结构关系的XML查询结果排序方法

任建华,周建,孟祥福,魏珂   

  1. 辽宁工程技术大学电子与信息工程学院 葫芦岛125105;沈阳炮兵学院通信指挥系 沈阳111000;辽宁工程技术大学电子与信息工程学院 葫芦岛125105;辽宁工程技术大学电子与信息工程学院 葫芦岛125105
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家青年科学基金项目(61003162)资助

Results Ranking Approach of XML Keyword Search Based on Keyword’s Structural Relationships

REN Jian-hua,ZHOU Jian,MENG Xiang-fu and WEI Ke   

  • Online:2018-11-16 Published:2018-11-16

摘要: 非空结果的XML关键字查询中,多个查询关键字之间必然存在联系,这种联系可以通过SLCA(最紧致片段)的结构关系获得。基于SLCA的结构关系,提出了一种推测多个关键字内在联系的XML关键字查询结果排序方法:通过LISA II算法获得SLCA;根据SLCA的结构信息推测出各个关键字之间的内在结构关系,得到所有关键字组成的关系树;然后根据关系树中各关键字对查询结点的严格程度得到对应SLCA的重要程度,据此得到有序的SLCA并输出。该方法利用了XML文档的结构信息对查询结果进行排序。实验结果和分析表明,提出的方法具有较高的准确率,能够较好地满足当前用户的需求和偏好。

关键词: 关键字查询,SLCA,小枝查询,结果排序,准确率

Abstract: If the answer of an XML multi-keywords search is not empty,there would be some specific relationships between these keywords and such relationships can be speculated by SLCA (the smallest lowest common ancestor).This paper proposed an XML keywords query results ranking approach based on these relationships:the approach obtains the SLCAs by the LISA II algorithm,leverages the structures of SLCAs to speculate the interior structural relationships of keywords and to obtain the relationship tree.Then,the importance of each SLCA can be estimated by the strict degree of keywords to the query node in the relationship tree.The SLCAs are ranked according to their importance and the ordered SLCAs are treated as the ranked XML keywords query results.The experimental results demonstrate that the approach presented in this paper has the high precision,and can efficiently meet the user’s needs as well.

Key words: Keywords search,SLCA,Twig query,Results ranking,Precision

[1] Spink A,Jansen B J,Wolfram D,et al.From e-sex to e-com-merce:web search changes [J].IEEE computer,2002,35(3):107-109
[2] 黄静,陆嘉恒,孟小峰.高效的XML关键字查询改写和结果生成技术[J].计算机研究与发展,2010,47(5):841-848
[3] Liu Z,Chen Y.Identifying meaningful return information forXML keyword search [C]∥Proceedings of the ACM SIGMOD Conference.2007:329-340
[4] 周军锋,孟小峰,张新,等.XML数据流上基于关键字的多查询处理[J].计算机研究与发展,2007,44(5):392-397
[5] 郭文琪,温馨,王鹏,等.Ropeway:基于语义相关的XML关键字搜索引擎[J].计算机研究与发展,2010,47(Suppl.):470-474
[6] 许建军,汪卫,施伯乐.一种基于XLCA的XML关键字搜索方法[J].小型微型计算机系统,2008,29(1) 52-56
[7] Li L,Lee M L,Hsu W E,et al.A prüfer based approach to process top-k queries in XML [C]∥Proceedings of the DEXA Conference.2009:348-355
[8] Li J X,Liu C F,et al.Efficient top-k search across heterogeneous XML data sources [C]∥Proceedings of the DASFAA Conference,LNCS 4947.2008:314-329
[9] Bao Z F,Ling T W,Chen B,et al.Effective XML keywordsearch with relevance oriented ranking [C]∥Proceedings of the ICDE Conference.2009:517-528
[10] 张雷.XML关键字查询中最紧致片段问题的研究[D]. 济南:山东大学,2009
[11] Sun C,Chan C Y,Goenka A K.Multiway SLCA-based keyword search in XML data [C]∥International World Wide Web Conference Committee (IW3C2).2007:1043-1052
[12] Lu J,Ling T W,Chan C Y,et al.From region encoding to extended Dewey:on efficient processing of XML twig pattern matching [C]∥Proceedings of the VLDB Conference.2005:193-204
[13] Xu Y,Papakonstantinou Y.Efficient keyword search for smallest LCAs in XML database [C]∥Proceedings of the ACM SIGMOD Conference.2005:776-787
[14] 孔令波,唐世渭,杨冬青,等.XML 信息检索中最小子树根结点问题的分层算法[J].软件学报,2007,18(4):919-932
[15] IBM Corporation XML data generator [EB/OL].http://www.alphaworks.ibm.com/tech/xmlgenerator,2010-08
[16] http://www.amazon.cn/[EB/OL].2010-06
[17] Su W,Wang J,Huang Q,et al.Query result ranking over e-commerce web databases [C]∥Proceedings of the ACM CIKM Conference.2006:575-584

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!