计算机科学 ›› 2010, Vol. 37 ›› Issue (1): 184-188.

• 软件工程与数据库技术 • 上一篇    下一篇

基于空间填充曲线网格划分的最近邻查询算法

徐红波,郝忠孝   

  1. (哈尔滨理工大学计算机科学与技术学院 哈尔滨150080);(哈尔滨工业大学计算机科学与技术学院 哈尔滨150001)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受黑龙江省自然科学基金(F200601)资助。

Nearest-neighbor Query Algorithm Based on Grid Partition of Space-filling Curve

XU Hong-bo,HAO Zhong-xiao   

  • Online:2018-12-01 Published:2018-12-01

摘要: 在建树过程中,R树存在最小边界矩形之间重叠的现象。当数据量较大时,重叠现象尤为严重,基于R树最近部查询算法的性能急剧恶化。针对该问题,利用空间填充曲线的降低维度特性和数据聚类特性,提出一种基于网格划分最近邻查询算法。该算法将整个数据空间划分成大小相等、互不重叠的网格,对网格中的点进行线性排序之后,只需要访问查询点所在网格中的点及其周边部近网格中的点,就能够获得最近部。在Hilbert曲线、Z曲线和Gray曲线上实现3种最近部查询算法,在映射算法和数据聚类特性上实验比较3种曲线之间的性能差异。实验结果表明,算法的查询性能明显优于顺序扫描算法和基于R树的最近邻查询算法。

关键词: 空间填充曲线,网格划分,最近邻,降维

Abstract: As the overlap between minimum bounding rectangles in the directory of R-tree is increasing very rapidly with growing number of the data, the performance of the nearest neighbor query algorithm based on R-tree deteriorates rapidly. To avoid the problem, the paper presented a nearest-neighbor query algorithm based on grid partition of space filling curve. Space-filling curve has the properties of dimension reduction and data clustering. Using space-filling curve,the algorithm divides 2D space into equal-size grids, and map points in grids into linear space. It only visits points in the grid which query point lies in and points in near grids of ctuery point. The paper implemented nearest neighbor query algorithms based on Hilbert curve, Z curve and Gray curve, and compared the difference of mapping algorithm and data clustering between three curves. Experimental results indicate that the algorithm is better than brutcforce algorithm and near-neighbor cauery algorithm based on R-tree.

Key words: Nearest neighbors,Grid,Space-filling curve,Reduction of dimensionahty

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!