Computer Science ›› 2019, Vol. 46 ›› Issue (5): 150-156.doi: 10.11896/j.issn.1002-137X.2019.05.023

Special Issue: Database Technology

Previous Articles     Next Articles

DFTS:A Top-k Skyline Query for Large Datasets

WEI Liang1, LIN Zi-yu1, LAI Yong-xuan2,3   

  1. (School of Information Science and Engineering,Xiamen University,Xiamen,Fujian 361005,China)1
    (School of Software,Xiamen University,Xiamen,Fujian 361005,China)2
    (Shenzhen Research Institute,Xiamen University,Shenzhen,Guangdong 518000,China)3
  • Received:2018-07-12 Revised:2018-09-15 Published:2019-05-15

Abstract: Top-k Skyline query combines the features of Top-k query and Skyline,which can find the best object in the datasets.However,the available methods can not fit to large datasets well.An efficient Top-k Skyline query method called DFTS was proposed,which can perform well for large datasets.DFTS involves three steps.Firstly,the degreescore function is used to rank the dataset,and a large quantity of objects with low ranking will be filtered out.Secondly,DFTS makes a Skyline query upon the candidates and generates a Skyline subset.Finally,top-k objects with high ran-king will be selected from the Skyline subset as the final result.Through these steps,DFTS can significantly reduce the time cost.It is proved that the results of DFTS satisfy the demand of Top-k Skyline query.Extensive experimental results show that DFTS can achieve much better performance for large datasets than state-of-the-art methods.

Key words: Apache Spark, Skyline, Top-k

CLC Number: 

  • TP311
[1]BÖRZSÖNYI S,KOSSMANN D,STOCKER K.The Skyline operator[J].Data Engineering,2001,1:433-442.
[2]CHAN C Y,JAGADISH H V,TAN K L,et al.Finding k-dominant skylines in high dimensional space[C]∥Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data.Chicago:ACM,2006:503-514.
[3]SHARIFZADEH M,SHAHABI C.The spatial skyline queries[C]∥Proceedings of the 32nd International Conference on Very Large Data Bases.Seoul,Korea:VLDB Endowment,2006:751-762.
[4]MAN L Y,MAMOULIS N.Efficient processing of top-k dominating queries on multi-dimensional data[C]∥Proceedings of the 33rd International Conference on Very Large Data Bases.Vienna:VLDB Endowment,2007:483-494.
[5]ENDRES M,PREISINGER T.Behind the skyline[J].Procee-dings of DBKDA,2015,15.
[6]PREISINGER T,ENDRES M.Looking for the best,but not too many of them:multi-level and top-k skylines[J].Int.J.Adv.Softw,2015,8(3):4.
[7]SIDDIQUE M A,TIAN H,MORIMOTO Y.Selecting Representative Objects from Large Database by Using K-Skyband and Top-k Dominating Queries in MapReduce Environment[M]∥Advanced Data Mining and Applications.Springer International Publishing,2014:560-572.
[8]CHOMICKI J,GODFREY P,GRYZ J,et al.Skyline with presorting:Theory and optimizations[M]∥Intelligent Information Processing and Web Mining.Springer,Berlin,Heidelberg,2005:595-604.
[9]GODFREY P,SHIPLEY R,GRYZ J.Maximal vector computation in large data sets[C]∥Proceedings of the 31st International Conference on Very Large Data Bases.Trondheim:VLDB Endowment,2005:229-240.
[10]LEE K C K,ZHENG B,LI H,et al.Approaching the skyline in Z order[C]∥Proceedings of the 33rd International Conference on Very Large Data Bases.Vienna:VLDB Endowment,2007:279-290.
[11]LEE K C K,LEE W C,ZHENG B,et al.Z-SKY:an efficientskyline query processing framework based on Z-order[J].The VLDB Journal,2010,19(3):333-362.
[12]CHAUDHURI S,DALVI N,KAUSHIK R.Robust cardinality and cost estimation for skyline operator[C]∥22nd International Conference on Data Engineering (ICDE’06).Atlanta:IEEE,2006:64-64.
[13]ZHANG Z,YANG Y,CAI R,et al.Kernel-based skyline cardinality estimation[C]∥Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data.Providence:ACM,2009:509-522.
[14]TAn K L,Eng P K,Ooi B C.Efficient Progressive Skyline Computation[C]∥ International Conference on Very Large Data Bases.Rome:Morgan Kaufmann Publishers Inc.2001:301-310.
[15]KOSSMANN D,RAMSAK F,ROST S.Shooting stars in the sky:An online algorithm for skyline queries[C]∥Proceedings of the 28th International Conference on Very Large Data Bases.Hong Kong:VLDB Endowment,2002:275-286.
[16]PAPADIAS D,TAO Y,FU G,et al.Progressive skyline computation in database systems[J].Acm Transactions on Database Systems,2005,30(1):41-82.
[17]PAPADIAS D,TAO Y,FU G,et al.An optimal and progressive algorithm for skyline queries[C]∥Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data.San Diego:ACM,2003:467-478.
[1] YU Jian-ye, QI Yong, WANG Bao-zhuo. Distributed Combination Deep Learning Intrusion Detection Method for Internet of Vehicles Based on Spark [J]. Computer Science, 2021, 48(6A): 518-523.
[2] ZHU Run-ze, QIN Xiao-lin, LIU Jia-chen. Study on Why-not Problem in Skyline Query of Road Network Based on Query Object [J]. Computer Science, 2021, 48(6): 57-62.
[3] WANG Yan, HAN Xiao, ZENG Hui, LIU Jing-xin, XIA Chang-qing. Task Migration Node Selection with Reliable Service Quality in Edge Computing Environment [J]. Computer Science, 2020, 47(10): 240-246.
[4] ZHOU Jian-gang, QIN Xiao-lin, ZHANG Ke-heng, XU Jian-qiu. Dynamic Skyline Query for Multiple Mobile Users Based on Road Network [J]. Computer Science, 2019, 46(9): 73-78.
[5] QI Yu-dong,HE Cheng,SI Wei-chao. Cloud Resource Selection Algorithm by Skyline under MapReduce Frame [J]. Computer Science, 2018, 45(6A): 411-414.
[6] SUN Zhi, SUN Xue-jiao. Survey of Skyline Processing in P2P Environments [J]. Computer Science, 2018, 45(11A): 63-70.
[7] DONG Lei-gang, LIU Guo-hua and CUI Xiao-wei. PPQ:Finding Combinatorial Skyline Based on Partition [J]. Computer Science, 2018, 45(1): 267-272.
[8] DAI Hua, YE Qing-qun, YANG Geng, XIAO Fu and HE Rui-liang. Overview of Secure Top-k Query Processing in Two-tiered Wireless Sensor Networks [J]. Computer Science, 2017, 44(5): 6-13.
[9] TU Bing, PAN Jian-wu, WU Jian-hui, ZENG Xiang and CAO Xu. Research on Skyline Detection Based on Region Covariance and Median Filtering Algorithm [J]. Computer Science, 2017, 44(3): 313-317.
[10] YU Wei, ZHENG Ji-ping, WANG Hai-xiang, WANG Yong-ge, CHEN Jia-liang and JIANG Shun-qing. Spatial Skyline Queries:Applications,Research and Challenges [J]. Computer Science, 2017, 44(2): 1-16.
[11] LI Qing, XIAO Ying-yuan, WANG Xiao-ye and LI Yu-kun. Clustering Architecture-based Skyline Query Processing in Wireless Sensor Networks [J]. Computer Science, 2017, 44(10): 177-181.
[12] ZHENG Shi-min, QIN Xiao-lin, LIU Liang and ZHOU Qian. Algorithm for Top-K Keyword Query in Data Streams [J]. Computer Science, 2016, 43(8): 142-147.
[13] CUI Wen-xiang, XIAO Ying-yuan, HAO Gang, WANG Hong-ya and DENG Hua-feng. MapReduce-based Skyline Query Processing Algorithm [J]. Computer Science, 2016, 43(6): 35-38.
[14] GUO Chang-you, ZHENG Xue-feng and GAO Xiu-lian. Top-k Query Calculation of Uncertain Data Based on Uncertainty Theory [J]. Computer Science, 2016, 43(3): 225-230.
[15] ZHAO Fa-xin and JIN Yi-fu. Study on Skyline Query for Vague Database [J]. Computer Science, 2015, 42(8): 236-239.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!