计算机科学 ›› 2013, Vol. 40 ›› Issue (3): 232-237.

• 软件与数据库技术 • 上一篇    下一篇

概率XML文档Top-k关键字并行检索算法

周小平,史一民,张俊   

  1. (大连海事大学信息科学技术学院 大连116026)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Parallel Top-k Keyword Search Algorithm in Probabilistic XML Documents

  • Online:2018-11-16 Published:2018-11-16

摘要: 概率XMI、是描述不确定数据的有效方式,Dcwcy编码是一种重要的XMI、文档关键字索引编码技术。在概率XML大文档关键字索引检索过程中,频繁地比较关键字索引Dewey编码非常耗时。针对上述问题,对概率XML文档进行分区,并设计了适合概率XML文档特点的关键字索引的Dewey编码策略,提出了一种概率XML文档Top-k关键字并行检索算法PTKS(Parallcl Top-k Keyword Scarch Algorithm)。实验证明,P"I'KS提高了概率XM工文档关键字检索的时间效率,尤其在文档结构复杂度高的情况下检索效率提高更加显著。

关键词: 概率XML,最小最低公共祖先,XML文档分区,Dewev编码,并行检索

Abstract: Probabilistic XMI. can describe the uncertain data effectively, and Dewcy code is the most important encoding method for indexing probabilistic XML documents. But during the keyword search in big probabilistic XML documents,it takes much time for comparing the Dewey code of keyword index frectuently. To deal with the problem above, the probabilistic XML document was partitioned into several fragments, and a new Dewey encoding method of keyword index for probabilistic XML documents was designed, thus, a Parallel Top-k Keyword Search Algorithm (PTKS) was proposed. I}he experiment results show that the PTKS algorithm has low time complexity, especially, its efficiency is improved significantly when the structure of the document is complicated.

Key words: Probabilistic XML,SLCA,XML document partition,Dewey code,Parallel search

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!