Computer Science ›› 2026, Vol. 53 ›› Issue (6): 367-375.doi: 10.11896/jsjkx.260200107

• Computer Network • Previous Articles     Next Articles

Research on Hierarchical Fountain Codes for Multi-Availability-Zone Cloud Storage

SUN Jing, WANG Yi, CHEN Haiyan   

  1. Department of Intelligent Science and Information Law,East China University of Political Science and Law,Shanghai 201620,China
  • Received:2026-02-27 Revised:2026-04-16 Online:2026-06-15 Published:2026-06-09
  • About author:SUN Jing,born in 1985,Ph.D,assistant research fellow,is a member of CCF(No.30246M).Her main research interests include distributed storage systems and cloud storage systems.

Abstract: In modern multi-availability zone(Multi-AZ) cloud storage systems,failures exhibit significant hierarchical and correlated characteristics.Traditional erasure codes(e.g.,RS codes) or flat Fountain code schemes often overlook the physical topology,leading to high cross-AZ repair bandwidth for frequent local failures and poor flexibility in heterogeneous networks.To address this,this paper proposes a hierarchical AZ-Fountain coding model tailored for Multi-AZ environments.This model explicitly couples the rateless property of Fountain codes with the Multi-AZ failure domain structure,introducing a two-layer coding architecture:utilizing an AZ-optimized Bi-modal distribution(AZ-OBMD) locally to achieve zero cross-AZ traffic for local repairs,and an AZ-Raptor-link distribution(AZ-RLD) globally to ensure efficient recovery during AZ-level disasters.Experimental results show that,comparing to RS codes,LRC,and the recent AZ-Code,the proposed scheme maintains zero cross-AZ traffic for local block failures similar to LRC,while significantly reducing the number of participating blocks and local repair latency.Furthermore,it significantly lowers repair latency and transmission overhead during AZ-level disaster recovery,achieving a systematic trade-off between reliability and repair cost.

Key words: Multi-AZ, Distributed storage, Fountain codes, Hierarchical coding, Cross-AZ repair bandwidth

CLC Number: 

  • TP391.4
[1]REED I S,SOLOMON G.Polynomial codes over certain finite fields[J].Journal of the Society for Industrial and Applied Mathematics,1960,6(8):300-304.
[2]LUBY M.LT codes[C]//Proceedings of The 43rd Annual IEEE Symposium on Foundations of Computer Science.2002:271-282.
[3]SHOKROLLAHI A.Raptor codes[J].IEEE Transactions onInformation Theory,2006,52(6):2551-2567.
[4]CAO Y,WANG X,LI S.Hierarchical Distribution for Fountain Codes in Geo-Distributed Storage Systems[J].IEEE Transactions on Cloud Computing,2022,10(1):214-226.
[5]HUANG C,SIMITCI H,XU Y,et al.Erasure coding in Win-dows Azure Storage [C]//Proceedings of the 2012 USENIX Annual Technical Conference(ATC).USENIX Association,2012:15-26.
[6]HUANGC,CHEN M,LI J.Pyramid Codes:Flexible Schemes to Trade Space for Access Efficiency in Reliable Data Storage Systems[J].ACM Transactions on Storage,2013,9(1):1-28.
[7]RASHMI K V,SHAH N B.A hitchhiker's guide to fast and efficient data reconstruction in erasure-coded data centers[C]//Proceedings of the ACM SIGCOMM Conference.2014:331-342.
[8]SHEN Z,SHU J,LEE P P C.Revisiting Locality in Erasure-Co-ded Storage Systems:An Asymptotic Perspective[C]//Procee-dings of IEEE International Symposium on Information Theory(ISIT).2016.
[9]LI J,LI B.Erasure Coding for Cloud Storage Systems:A Survey[J].Tsinghua Science and Technology,2013,18(3):259-272.
[10]CHEN Z,ZHANG Y,LEE P P C.Ursa:Hybrid Block-Level and Parity-Level Erasure Coding to Mitigate Long-Tail Latency in Geo-Distributed Storage[C]//Proceedings of the 51st International Conference on Parallel Processing(ICPP).2022:1-11.
[11]DIMAKIS A G,GODFREY P B,WU Y,et al.Network Coding for Distributed Storage Systems[J].IEEE Transactions on Information Theory,2010,56(9):4539-4551.
[12]XIE X,WU C,GU J,et al.AZ-Code:An Efficient AvailabilityZone Level Erasure Code to Provide High Fault Tolerance in Cloud Storage Systems[C]//Proceedings of the 35th Sympo-sium on Mass Storage Systems and Technologies(MSST).2019.
[13]HU S,ZHOU Y,CHEN X,Hierarchical Regenerating Codeswith Optimal Repair Bandwidth for Multi-Rack Storage Systems[J].IEEE Transactions on Communications,2024,72(1):120-133.
[14]XU X,ZHANG Y.ACH-Code:Adaptive Cross-tier Hierarchical Erasure Coding for Heterogeneous Data Centers[C]//Procee-dings of IEEE INFOCOM.2024.
[15]LIU Y,LI J,LIEW S C.Bandwidth-Adaptive Erasure Coding forHeterogeneous Data Centers[J].IEEE Transactions on Mobile Computing,2023,22(10):5821-5835.
[16]ZHOU J,LUI K C S,ZHAO J.Topology-Aware Rateless Co-ding for Dynamic Edge Storage Environments[C]//Proceedings of IEEE INFOCOM.2022:1459-1468.
[17]LIANG J,LUI J C S.Geo-Distributed Storage Systems withRateless Codes[C]//Proceedings of IEEE International Confe-rence on Distributed Computing Systems(ICDCS).2023.
[18]KARZAND M,LEITH D J,WANG J,et al.On the Decoding Probability of Finite-Length Fountain Codes[J].IEEE Transactions on Information Theory,2020,66(1):190-204.
[19]HAYAJNEH K F,SCAGLIONE S V.Analysis of Ripple Size in Luby Transform Codes[J].IEEE Access,2020,8:12345-12356.
[20]DIMAKIS A G,KARZAND M.Finite-Length Analysis of Rateless Codes with Application to Distributed Learning[J].IEEE Journal on Selected Areas in Information Theory,2023,3(2):210-222.
[1] HAN Lei, LI Mouxing, WU Zheng, FAN Weibei, QIAN Xiaoyan. Barrier-based Network Storage Ordering Method [J]. Computer Science, 2026, 53(6): 350-357.
[2] YE Miao, WANG Jue, JIANG Qiuxiang, WANG Yong. SDN-based Integrated Communication and Storage Edge In-network Storage Node Selection Method [J]. Computer Science, 2025, 52(8): 343-353.
[3] SUN Shiquan, YE Miao, ZHU Cheng, WANG Yong, JIANG Qiuxiang. Performance Optimization of Wireless Edge Storage System Based on SDN and Drone Assistance in Disaster Scenarios [J]. Computer Science, 2025, 52(11): 306-319.
[4] DU Qingpeng, XU Yinlong, WU Si. Stripe Matching and Merging Algorithm-based Redundancy Transition for Locally Repairable Codes [J]. Computer Science, 2023, 50(12): 89-96.
[5] ZHANG Hang, TANG Dan, CAI Hong-liang. Study on Predictive Erasure Codes in Distributed Storage System [J]. Computer Science, 2021, 48(5): 130-139.
[6] ZHANG Xiao, ZHANG Si-meng, SHI Jia, DONG Cong, LI Zhan-huai. Review on Performance Optimization of Ceph Distributed Storage System [J]. Computer Science, 2021, 48(2): 1-12.
[7] ZHONG Feng-yan, WANG Yan, LI Nian-shuang. Node Selection Scheme for Data Repair in Heterogeneous Distributed Storage Systems [J]. Computer Science, 2019, 46(8): 35-41.
[8] LI Peng-yuan,ZHANG Zhi-yong. Design of Storage Platform for Large Scale Data Based on SWIFT System [J]. Computer Science, 2018, 45(6A): 601-605.
[9] WANG Jun-sheng, LI Li-li, YAN Yong, ZHAO Wei, XU Yu. Security Incidents and Solutions of Blockchain Technology Application [J]. Computer Science, 2018, 45(6A): 352-355.
[10] WANG Qing-yun and CHENG Chun-ling. Mobile SNS Data Dynamic Partitioning and Replication Algorithm Based on Location Information [J]. Computer Science, 2017, 44(3): 220-225.
[11] ZHU Kang-lin. Application of Distributed Virtualized Storage in Public Security College [J]. Computer Science, 2016, 43(Z6): 571-576.
[12] WANG Jing, LUO Wei, OUYANG Ming-sheng, JIANG Can and WANG Xin-mei. Segmentation Coding Scheme Based on Simple Regenerating Codes [J]. Computer Science, 2016, 43(8): 148-153.
[13] DONG Shu-jian, WANG Jing-bin and CHEN Yuan. HMSST+:HMSST Algorithm Optimization Based on Distributed Memory Database [J]. Computer Science, 2016, 43(3): 220-224.
[14] ZHANG Xiao-yuan,LIU Li-ren and HAN Hai-wen. Supervision Collaboration Platform Based on SOA and Cloud Computing Technologies [J]. Computer Science, 2014, 41(Z11): 473-477.
[15] . Fine-grained Parallel Multi-pattern Matching for Backbone Network NIDS [J]. Computer Science, 2013, 40(3): 74-76.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!