Computer Science ›› 2019, Vol. 46 ›› Issue (10): 202-208.doi: 10.11896/jsjkx.180901623

• Software & Database Technology • Previous Articles     Next Articles

Data Replicas Distribution Transition Strategy in Cloud Storage System

WU Xiu-guo, LIU Cui   

  1. (School of Management Science and Engineering,Shandong University of Finance and Economics,Jinan 250014,China)
  • Received:2018-09-03 Revised:2019-01-29 Online:2019-10-15 Published:2019-10-21

Abstract: Replication is a common method used to improve the data access reliability and system fault tolerance in the cloud storage system.It is one of the most important topics to reschedule the replicas distribution dynamically according to the changes of users’ requirements and environment in the replicas management.However,current replicas redistribution strategies mostly focus on the new replicas schemes,such as replicas number and their placements,on the pre-mise that it can be completed automatically,without taking into account the task scheduling problem in practice.In fact,data replica distribution transition is a complex scheduling problem involving data replicas migration and deletion among data centers.In addition,the required disk space and time of different scheduling strategies have large differences,lea-ding to big difference in cost and efficiency.In this way,this paper first proposed a data replicas distribution transition model and feasibility analysis in the cloud storage environment.Also a minimum-cost data replicas distribution transition problem definition was presented,and then its complexity was proven based on 0-1 Knapsack problem.Besides random strategy,three transition strategies (MTCF,MOCF and MTCFSD) were given from minimum-cost view.In the end,a series of experiments were performed on CloudSim simulation platform.The results show that nearly 60% of transmission number and 50% of transmission cost are reduced compared with other methods,indicating the proposed method’sreliability and effectiveness,so as to further improve the cloud storage system performance.

Key words: Cloud storage system, Cost, Distribution transition, Replicas, Tasks scheduling

CLC Number: 

  • TP391
[1]SUN X,LI Q Z,ZHAO P,et al.An optimized replica distribution method for peer-to-peer network[J].Chinese Journal of Computers,2014,37(6):1424-1434.(in Chinese)
孙新,李庆洲,赵璞,等.对等网络中一种优化的副本分布方法[J].计算机学报,2014,37(6):1424-1434.
[2]POLATO I,RÉ R,GOLDMAN A,et al.A comprehensive view of Hadoop research-a systematic literature review[J].Journal of Network & Computer Applications,2014,46:1-25.
[3]HAMROUNI T,SLIMANI S,CHARRADA F B.A survey of dynamic replication and replica selection strategies based on data mining techniques in data grids[J].Engineering Applications of Artificial Intelligence,2016,48(C):140-158.
[4]WU X.Data sets replicas placements strategy from cost-effective view in the cloud[J].Scientific Programming,2016(11):1-13.
[5]SAADAT N,RAHMANI A M.PDDRA:A new pre-fetching based dynamic data replication algorithm in data grids[J].Future Generation Computer Systems,2012,28(4):666-681.
[6]YANG X L,QIAN C,ZHU F X.Evaluation method of big data service based on cloud computing[J].Computer Science,2018,45(5):295-299.(in Chinese)
阳小兰,钱程,朱福喜.基于云计算的大数据服务资源评价方法[J].计算机科学,2018,45(5):295-299.
[7]LIU W,PENG S,DU W,et al.Security-aware intermediate data placement strategy in scientific cloud workflows[J].Knowledge &Information Systems,2014,41(2):423-447.
[8]LI W,YANG Y,YUAN D.Ensuring Cloud data reliability with minimum replication by proactive replica checking[J].IEEE Transactions on Computers,2016,65(5):1494-1506.
[9]GRACE R K,MANIMEGALAI R.Dynamic replica placement and selection strategies in data grids-A comprehensive survey[J].Journal of Parallel & Distributed Computing,2014,74(2):2099-2108.
[10]WU X G.Minimum cost data replicas distribution based on dynamic programming in the cloud storage system[J].Computer Engineering,2017,43(7):29-37.(in Chinese)
吴修国.云存储系统中基于动态规划的最小开销数据副本布局研究[J].计算机工程,2017,43(7):29-37.
[11]LIU G,LI J,XU J.An Improved Min-Min Algorithm in Cloud Computing[C]//Proceedings of the 2012 International Conference of Modern Computer Science and Applications.Springer Berlin Heidelberg,2013:47-52.
[12]MA J,LI W,FU T,et al.A Novel Dynamic Task Scheduling Algorithm Based on Improved Genetic Algorithm in Cloud Computing[M]//Wireless Communications,Networking and Applications.Springer India,2016:184-186.
[13]TANG Y T,HUANG J,XIAO Q.Task scheduling algorithm for MapReduce based on DAG[J].Computer Science,2014,41(S1):42-46.(in Chinese)
唐一韬,黄晶,肖球.一种基于DAG的MapReduce任务调度算法[J].计算机科学,2014,41(S1):42-46.
[14]KHULLER S,KIM Y A,MALEKIAN A.Improved Approximation Algorithms for Data Migration[J].Algorithmica,2012,63(1/2):347-362.
[15]LU P,ZHANG L,LIU X,et al.Highly efficient data migration and backup for big data applications in elastic optical inter-data-center networks[J].Network IEEE,2015,29(5):36-42.
[16]ANDRONIKOU V,MAMOURAS K,TSERPES K,et al.Dy-namic Qos-aware data replication in grid environments based on data “importance”[J].Future Generation Computer Systems,2012,28(3):544-553.
[17]ATENIESE G,BURNS R,CURTMOLA R,et al.Provable data possession at untrusted stores[C]//Proceedings of ACM Conference on Computer and Communications Security.New York:ACM Press,2007:598-609.
[18]LOUKOPOULOS T,TZIRITAS N,LAMPSAS P,et al.Implementing replica placements:feasibility and cost minimization[C]//Proceedings of Parallel and Distributed Processing Symposium.New York:IEEE Press,2007:1-10.
[19]CALHEIROS R N,RANJAN R,BELOGLAZOV A,et al. Cloudsim:a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms[J].Software Practice & Experience,2011,41(1):23-50.
[1] SHAO Zi-hao, YANG Shi-yu, MA Guo-jie. Foundation of Indoor Information Services:A Survey of Low-cost Localization Techniques [J]. Computer Science, 2022, 49(9): 228-235.
[2] LI Jing-tai, WANG Xiao-dan. XGBoost for Imbalanced Data Based on Cost-sensitive Activation Function [J]. Computer Science, 2022, 49(5): 135-143.
[3] YAN Lei, ZHANG Gong-xuan, WANG Tian, KOU Xiao-yong, WANG Guo-hong. Scheduling Algorithm for Bag-of-Tasks with Due Date Constraints on Hybrid Clouds [J]. Computer Science, 2022, 49(5): 244-249.
[4] ZUO Yuan-lin, GONG Yue-jiao, CHEN Wei-neng. Budget-aware Influence Maximization in Social Networks [J]. Computer Science, 2022, 49(4): 100-109.
[5] WANG Zi-yin, LI Lei-jun, MI Ju-sheng, LI Mei-zheng, XIE Bin. Attribute Reduction of Variable Precision Fuzzy Rough Set Based on Misclassification Cost [J]. Computer Science, 2022, 49(4): 161-167.
[6] HUANG Ying-qi, CHEN Hong-mei. Cost-sensitive Convolutional Neural Network Based Hybrid Method for Imbalanced Data Classification [J]. Computer Science, 2021, 48(9): 77-85.
[7] LUAN Ling, PAN Lian-wu, YAN Lei, WU Xiao-lin. Research on Intelligent Control Technology of Accurate Cost for Unit Confirmation in All Links of Power Transmission and Transformation Project Based on Edge Computing [J]. Computer Science, 2021, 48(11A): 688-692.
[8] LU Shu-xia, ZHANG Zhen-lian. Imbalanced Data Classification of AdaBoostv Algorithm Based on Optimum Margin [J]. Computer Science, 2021, 48(11): 184-191.
[9] LIU Jing, FANG Xian-wen. Mining Method of Business Process Change Based on Cost Alignment [J]. Computer Science, 2020, 47(7): 78-83.
[10] WU Chong-ming, WANG Xiao-dan, XUE Ai-Jun and LAI Jie. Multiclass Cost-sensitive Classification Based on Error Correcting Output Codes [J]. Computer Science, 2020, 47(6A): 89-94.
[11] XIANG Wei, WANG Xin-wei. Imbalance Data Classification Based on Model of Multi-class Neighbourhood Three-way Decision [J]. Computer Science, 2020, 47(5): 103-109.
[12] SUN Yong-yue, LI Hong-yan, ZHANG Jin-bo. RAISE:Efficient Influence Cost Minimizing Algorithm in Social Network [J]. Computer Science, 2019, 46(9): 59-65.
[13] FANG Xu-yuan, TIAN Hong-xin, SUN De-chun, DU Wen-cong, QI Ting. Utility Function Heterogeneous Network Access Algorithm Based on Green Energy Perception [J]. Computer Science, 2019, 46(8): 127-132.
[14] ZHANG Lei, HU Bo-wen, ZHANG Ning, WANG Mao-sen. Global Residual Recursive Network for Image Super-resolution [J]. Computer Science, 2019, 46(6A): 230-233.
[15] YE Fu-ming, LI Wen-ting, WANG Ying. MC2ETS:An Energy-efficient Tasks Scheduling Algorithm in Mobile Cloud Computing [J]. Computer Science, 2019, 46(6): 135-142.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!