Computer Science ›› 2025, Vol. 52 ›› Issue (9): 232-240.doi: 10.11896/jsjkx.240700116

• Database & Big Data & Data Science • Previous Articles     Next Articles

Personalized Federated Learning Framework for Long-tailed Heterogeneous Data

WU Jiagao, YI Jing, ZHOU Zehui, LIU Linfeng   

  1. School of Computer,Nanjing University of Posts and Telecommunications,Nanjing 210023,China
    Jiangsu Key Laboratory of Big Data Security & Intelligent Processing,Nanjing 210023,China
  • Received:2024-07-18 Revised:2024-10-17 Online:2025-09-15 Published:2025-09-11
  • About author:WU Jiagao,born in 1969,Ph.D,associate professor,is a member of CCF(No.12107M).His main research interests include computer network,edge computing and artificial intelligence.
  • Supported by:
    National Natural Science Foundation of China(62272237,61872191).

Abstract: Aiming at the problem of model performance degradation in federated learning caused by long tail distribution and he-terogeneity of data,a novel personalized federated learning framework called Balanced Personalized Federated Learning(BPFed) is proposed,where the whole federated learning process is divided into two stages:representation learning based on personalized fe-derated learning and personalized classifiers retraining based on global feature augmentation.In the first stage,the Mixup strategy is adopted firstly for data augmentation,and then a feature extractor training method is proposed based on the personalized fede-rated learning with parameter decoupling to optimize the performance of the feature extractor while reducing the communication cost.In the second stage,a new class-level feature augmentation method based on global covariance matrix is proposed firstly,and then the classifiers of clientsare retaines individually in balance with a proposed label-aware smoothing loss function based on sample weight to correct the overconfidence for head classes and boost the generalization ability for tail classes.Extensive experimental results show that the model accuracy of BPFed is significantly improved compared with other representative related algorithms under different settings of data long-tailed distributions and heterogeneities.Moreover,the effectiveness of the proposed methods and optimization strategies is further verified by the experiments on ablation and hyperparameter influence.

Key words: Personalized federated learning, Long-tailed distribution, Data heterogeneity, Parameter decoupling, Feature augmentation, Optimization strategy

CLC Number: 

  • TP393
[1]MCENROE P,WANG S,LIYANAGE M.A survey on the convergence of edge computing and AI for UAVs:Opportunities and challenges[J].IEEE Internet of Things Journal,2022,9(17):15435-15459.
[2]MCMAHAN B,MOORE E,RAMAGE D,et al.Communica-tion-efficient learning of deep networks from decentralized data[C]//Proceedings of the20th International Conference on Artificial Intelligence and Statistics(AISTATS).New York:PMLR,2017:1273-1282.
[3]ZHANG Y,KANG B,HOOI B,et al.Deep long-tailed learning:A survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2023,45(9):10795-10816.
[4]KANG B,XIE S,ROHRBACH M,et al.Decoupling Representation and Classifier for Long-Tailed Recognition[C]//Procee-dings of International Conference on Learning Representations.Washington DC:ICLR,2020:1-16.
[5]LIN T,GOYAL P,GIRSHICK R,et al.Focalloss for dense object detection[C]//Proceedings of International Conference on Computer Vision(ICCV).Piscataway,NJ:IEEE,2017:2980-2988.
[6]ZHANG H,CISSE M,DAUPHIN Y N,et al.Mixup:Beyond empirical risk minimization[C]//Proceedings of International Conference on Learning Representations.Washington DC:ICLR,2018:1-13.
[7]YIN X,YU X,SOHN K,et al.Featuretransfer learning for face recognition with under-represented data[C]//Proceedings of International Conference on Computer Vision and Pattern Re-cognition(CVPR).Piscataway,NJ:IEEE,2019:5704-5713.
[8]SHANG X,LU Y,HUANG G,et al.Federatedlearning on he-terogeneous and long-tailed data via classifier re-training with federated features[C]//Proceedings of International Joint Conference on Artificial Intelligence.Freiburg:IJCAI,2022:2218-2224.
[9]SHUAI X,SHEN Y,JIANG S,et al.BalanceFL:Addressingclass imbalance in long-tail federated learning[C]//Proceedings of the 21st ACM/IEEE International Conference on Information Processing in Sensor Networks(IPSN).Piscataway,NJ:IEEE,2022:271-284.
[10]ARIVAZHAGAN M G,AGGARWAL V,SINGH A K,et al.Federated lear-ning with personalization layers[J].arXiv:1912.00818,2019.
[11]LU Y,QIAN P,HUANG G,et al.Personalized federated lear-ning on long-tailed data via adversarial feature augmentation[C]//Proceedings of 2023 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP).Piscataway,NJ:IEEE,2023:1-5.
[12]ZHONG Z,CUI J,LIU S,et al.Improving Calibration for Long-Tailed Recognition[C]//Proceedings of International Confe-rence on Computer Vision and Pattern Recognition(CVPR).Piscataway,NJ:IEEE,2021:16489-16498.
[13]WANG X M,FANG R.Device selection in federated learningunder class imbalance[J].Application Research of Computers,2021,38(10):2968-2973.
[14]WU F,SONG Y B,JI Y M,et al.Contribution-based federated learning approach for global imbalanced problem[J].Computer Science,2023,50(12):343-348.
[15]LI T,SAHU A K,ZAHEER M,et al.Federated Optimization inHeterogeneous Networks[C]//Proceedings of International Conference on Machine Learning Systems(ICML).New York:IMLS,2020:429-450.
[16]KARIMIREDDY S P,KALE S,MOHRI M,et al.Scaffold:Stochastic controlled averaging for federated learning[C]//Proceedings of International Conference on Machine Learning(ICML).New York:IMLS,2020:5132-5143.
[17]LI Q,HE B,SONG D,et al.Model-Contrastive FederatedLearning[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Piscataway,NJ:IEEE,2021:10713-10722.
[18]ACAR D A E,ZHAO Y,NAVARRO R M,et al.FederatedLearning Based on Dynamic Regularization[C]//Proceedings of International Conference on Learning Representations.Wa-shington DC:ICLR,2021:1-14.
[19]CHAI Z,ALI A,ZAWAD S,et al.Tifl:A tier-based federated learning system[C]//Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing(HPDC).New York:ACM,2020:125-136.
[20]ZHAO Y,LI M,LAI L,et al.Federated learning with non-IID data[J].arXiv:1806.00582,2018.
[21]LUO M,CHEN F,HU D,et al.No fear of heterogeneity:Classifier calibration for federated learning with non-IID data[J].Advances in Neural Information Processing Systems,2021,34:5972-5984.
[22]LIU Y,WU X,LIU C K.Research onpersonalized federatedlearning algorithm in industrial Internet of things[J].Journal of Chinese Computer Systems,2025,46(1):209-216.
[23]KRIZHEVSKY A.Learning multiple layers of features from tiny images[R].University of Toronto,2009.
[24]CAO K,WEI C,GAIDON A,et al.Learningimbalanced datasets with label-distribution-aware margin loss[C]//Proceedings of the 33rd Conference on Neural Information Processing Systems(NeurIPS).New York:PMLR,2019:1-12.
[25]HSU T M H,QI H,BROWN M.Measuring the effects of non-identical data distribution for federated visual classification[J].arXiv:1909.06335,2019.
[26]HE K,ZHANG X,REN S,et al.Deepresidual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Piscataway,NJ:IEEE,2016:770-778.
[1] REN Qingxin, FENG Feng. Hippo Optimization Algorithm Improved by Multi-strategy and Multi-dimensional Fusion [J]. Computer Science, 2025, 52(6A): 240400145-8.
[2] WANG Chundong, ZHANG Qinghua, FU Haoran. Federated Learning Privacy Protection Method Combining Dataset Distillation [J]. Computer Science, 2025, 52(6A): 240500132-7.
[3] XU Yicheng, DAI Chaofan, MA Wubin, WU Yahui, ZHOU Haohao, LU Chenyang. Particle Swarm Optimization-based Federated Learning Method for Heterogeneous Data [J]. Computer Science, 2024, 51(6): 391-398.
[4] WANG Zihang, YANG Min, WEI Zichong. Application of Parameter Decoupling in Differentially Privacy Protection Federated Learning [J]. Computer Science, 2024, 51(11): 379-388.
[5] LI Yang, HAN Ping. Human Parsing Model Combined with Regional Sampling and Inter-class Loss [J]. Computer Science, 2023, 50(4): 103-109.
[6] WANG Bing, WU Hong-liang, NIU Xin-zheng. Robot Path Planning Based on Improved Potential Field Method [J]. Computer Science, 2022, 49(7): 196-203.
[7] ZHOU Hai-yu, ZHANG Dao-qiang. Multi-site Hyper-graph Convolutional Neural Networks and Application [J]. Computer Science, 2022, 49(3): 129-133.
[8] WU Lan, WANG Han, LI Bin-quan. Unsupervised Domain Adaptive Method Based on Optimal Selection of Self-supervised Tasks [J]. Computer Science, 2021, 48(6A): 357-363.
[9] ZHANG Li and YU Lei. Who Can Collaborate New Users in Recommendation System? [J]. Computer Science, 2015, 42(Z11): 80-82.
[10] GE Yu,LIANG Jing and WANG Xue-ping. Improved Artificial Bee Colony Algorithms Based on Extremal Optimization Strategy [J]. Computer Science, 2013, 40(6): 247-251.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!