计算机科学 ›› 2022, Vol. 49 ›› Issue (1): 166-174.doi: 10.11896/jsjkx.201000186

• 数据库&大数据&数据科学 • 上一篇    下一篇

星型高影响的空间co-location模式挖掘

马董, 李新源, 陈红梅, 肖清   

  1. 云南大学信息学院 昆明650504
  • 收稿日期:2020-10-30 修回日期:2021-03-25 出版日期:2022-01-15 发布日期:2022-01-18
  • 通讯作者: 陈红梅(hmchen@ynu.edu.cn)
  • 作者简介:md0301@mail.ynu.edu.cn
  • 基金资助:
    国家自然科学基金(61662086,61966036);云南省创新团队项目(2018HC019)
    This work was supported by the National Natural Science Foundation of China(61662086,61966036) and Project of Innovative Research Team of Yunnan Province(2018HC019).

Mining Spatial co-location Patterns with Star High Influence

MA Dong, LI Xin-yuan, CHEN Hong-mei, XIAO Qing   

  1. School of Information Science and Engineering,Yunnan University,Kunming 650504,China
  • Received:2020-10-30 Revised:2021-03-25 Online:2022-01-15 Published:2022-01-18
  • About author:MA Dong,born in 1992,master.His main research interests include spatial data mining and so on.
    CHEN Hong-mei,born in 1976,Ph.D,associate professor.Her research in-terests include database and spatial data mining.
  • Supported by:
    Joint Funds of the National Natural Science Foundation of China(U1611263).

摘要: 空间co-location模式是其实例在空间邻域内频繁并置出现的一组空间特征集。传统的空间co-location模式挖掘方法通常假设空间实例相互独立,并采用参与度作为模式有趣性的唯一度量指标,没有考虑不同特征或相同特征不同实例在空间邻域内所产生的影响差异,因此挖掘的结果往往缺乏相关性和可解释性。文中提出了一种星型高影响的空间co-location模式及挖掘方法,能够有效发现自身影响高且在邻域范围内也具有一定影响的空间co-location模式。首先,定义了度量模式影响的两个指标:模式影响参与度和模式影响占有度。其次,提出了挖掘星型高影响co-location模式的基础挖掘算法和剪枝策略。最后,通过在大量的真实和合成数据集上进行实验,分析了挖掘算法的效率和挖掘效果。实验结果表明,所提出的星型高影响co-location模式的度量方法和挖掘算法能够挖掘出较强相关性的co-location模式。

关键词: 高影响模式, 空间co-location模式, 空间数据挖掘, 星型影响

Abstract: The spatial co-location pattern is a group of spatial features whose instances are frequently collocated in the spatial neighborhood.Traditional spatial co-location pattern mining methods usually assume that the spatial instances are independent each other,and use participation index (PI) to measure the patterns.They don't consider the influence of different features or different instances of the same feature so that the mining results are often lack of relevance and interpretability.This paper proposes the spatial co-location pattern with star high influence which has influence in the neighborhood,and its mining method.Firstly,this paper defines two indicators to measure the influence of the pattern:influence participation index (IPI) and influence occupancy index (IOI).Secondly,a basic algorithm and pruning strategies for mining co-location patterns with star high influence are proposed.Finally,the experimental results on real and synthetic data sets show that the proposed method can discover the strong relevant co-location patterns.

Key words: High influence pattern, Spatial co-location pattern, Spatial data mining, Star influence

中图分类号: 

  • TP391
[1]WANG L Z,CHEN H M.Spatial Pattern Mining Theory and Methods[M].Beijing:Science Press,2014:2-4.
[2]AN S,YANG H,WANG J,et al.Mining urban recurrent congestion evolution patterns from GPS equipped vehicle mobility data[J].Information Sciences,2016,373:515-526.
[3]WU C F,CAI L,LI J,et al.Frequent Pattern Mining of Residents' Travel Based on Multi-source Location Data[J].Compu-ter Science,2021,48(7):155-163.
[4]SUN T X,ZHAO Y L,LIAN Z W,et al.Mobility Pattern Mi-ning for People Flow Based on Spatio-Temporal Data[J].Computer Science,2020,47(10):91-96.
[5]AKBARI M,SAMADAZDEGAN F,WEIBEL R.A generic regional spatio-temporal co-occurrence pattern mining model:a case study for air pollution[J].Journal of Geographical Systems,2015,17(3):249-274.
[6]HUANG Y,SHEKHAR S,XIONG H.Discovering co-location patterns from spatial data sets:a general approach[J].IEEE Transactions on Knowledge and Data Engineering,2004,16(12):1472-1485.
[7]YOO J S,SHEKHAR S.A join-less approach for mining spatial colocation patterns[J].IEEE Transactions on Knowledge and Data Engineering,2006,18(10):1323-1337.
[8]YOO J S,BOULWARE D,KIMMEY D.A parallel spatial co-location mining algorithm based on MapReduce[C]//2014 IEEE International Congress on Big Data.IEEE,2014:25-31.
[9]YANG P,WANG L,WANG X.A parallel spatial co-locationpattern mining approach based on ordered clique growth[C]//International Conference on Database Systems for Advanced Applications.Cham:Springer,2018:734-742.
[10]LU Y,WANG L Z,ZHANG X F.Mining frequent co-location patterns from uncertain data[J].Journal of Frontiers of Compu-ter Science and Technology,2009,3(6):656-664.
[11]OUYANG Z P,WANG L Z,CHEN H M.Mining spatial co-location patterns for fuzzy objects[J].Chinese Journal of Computers,2011,34(10):1947-1955.
[12]FANG Y,WANG L,HU T.Spatial co-location pattern mining based on density peaks clustering and fuzzy theory[C]//Proceedings of the 2018 Asia-Pacific Web(APWeb)and Web-Age Information Management (WAIM) Joint International Confe-rence on Web and Big Data,LNCS 10988.Cham:Springer,2018:298-305.
[13]ZENG X,YANG J.Co-location patterns mining with time constraint[J].Computer Science,2016,43(2):293-296.
[14]QIAN F,YIN L,HE Q,et al.Mining spatio-temporal co-location patterns with weighted sliding window[C]//2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.IEEE,2009,3:181-185.
[15]WANG L,BAO X,CHEN H,et al.Effective lossless condensed representation and discovery of spatial co-location patterns[J].Information Sciences,2018,436:197-213.
[16]YAO X,PENG L,YANG L,et al.A fast space-saving algorithm for maximal co-location pattern mining[J].Expert Systems with Applications,2016,63:310-323.
[17]BAO X,WANG L,ZHAO J.Mining top-k-size maximal co-location patterns[C]//2016 International Conference on Computer,Information and Telecommunication Systems (CITS).IEEE,2016:1-6.
[18]FANG Y,WANG L,WANG X,et al.Mining co-location patterns with dominant features[C]//International Conference on Web Information Systems Engineering.Cham:Springer,2017:183-198.
[19]WANG L,BAO X,ZHOU L,et al.Maximal sub-prevalent co-loca-tion patterns and efficient mining algorithms[C]//Proceedings of the 2017 International Conference on Web Information Systems Engineering,LNCS 10569.Cham:Springer,2017:199-214.
[20]WANG L,BAO X,ZHOU L,et al.Mining maximal sub-prevalent co-location patterns[J].World Wide Web,2019,22(5):1971-1997.
[21]YANG S S,WANG L Z,LU J L,et al.Primary Exploration for Mining Spatial High Utility Co-location Patterns[J].Journal of Chinese Mini-Micro Computer Systems,2014,35(10):2302-2307.
[1] 刘新斌, 王丽珍, 周丽华.
MLCPM-UC:一种基于模式实例分布均匀系数的多级co-location模式挖掘算法
MLCPM-UC:A Multi-level Co-location Pattern Mining Algorithm Based on Uniform Coefficient of Pattern Instance Distribution
计算机科学, 2021, 48(11): 208-218. https://doi.org/10.11896/jsjkx.201000097
[2] 周剑云,王丽珍,杨增芳.
基于加权欧氏距离的空间Co-location模式挖掘算法研究
Algorithm of Mining Spatial Co-location Patterns Based on Weighted Euclidean Distance
计算机科学, 2014, 41(Z6): 425-428.
[3] 崔阳,杨炳儒.
超图在数据挖掘领域中的几个应用
Application of Hypergraph in Data Mining
计算机科学, 2010, 37(6): 220-222.
[4] 胡彩平 秦小麟.
空间数据挖掘研究综述

计算机科学, 2007, 34(5): 14-19.
[5] 郭平 范丽 叶莲.
空间规则的可视化解释

计算机科学, 2004, 31(5): 169-171.
[6] 何彬彬 方涛 郭达志.
基于不确定性的空间聚类

计算机科学, 2004, 31(11): 196-198.
[7] 甄彤 范艳峰.
基于Agent的分布式空间数据挖掘模型及实现

计算机科学, 2004, 31(10): 96-97.
[8] 肖予钦 景宁 吴秋云 钟志农.
空间数据挖掘关键问题研究

计算机科学, 2003, 30(9): 49-53.
[9] 文俊浩 李立新 吴中福 吴红艳.
基于邻接关系的空间趋势检测算法研究

计算机科学, 2003, 30(12): 123-125.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!