Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241200119-9.doi: 10.11896/jsjkx.241200119

• Big Data & Data Science • Previous Articles     Next Articles

Fairness-enhancing Decision Tree Algorithm

JIANG Wenhui, YE Jianhong, GAO Lingting, HUANG Yifan   

  1. College of Computer Science and Technology,Huaqiao University,Xiamen,Fujian 361021,China
  • Online:2025-11-15 Published:2025-11-10
  • Supported by:
    Science and Technology Planning Project of Fujian Province,China(2024H0014(2024H01010100)).

Abstract: In the field of machine learning,the problem of intrinsic biases in models has received increasing attention,and these biases often originate from imbalances in the training data or flaws in the algorithm design,which lead to unfair treatment of certain groups in the prediction results.To address this problem,this paper proposes a fairness-enhanced decision tree algorithm,which effectively reduces the imbalance in the data by introducing a fairness preprocessing method,and changes the traditional decision tree splitting criterion by integrating classification accuracy and fairness in the splitting criterion of the decision tree.The proposed method aims to achieve the fair distribution of prediction results among different groups,reduce the bias in model decision-making,and ensure that all individuals are treated fairly.Experimental results show that the proposed method demonstrates good performance under multiple fairness metrics,significantly reduces the prediction bias among different groups,and exhibits stronger fairness bias-correction performance than the existing traditional algorithms.

Key words: Machine learning, Classification, Decision tree, Fairness, Preprocessing

CLC Number: 

  • TP391
[1]PESSACH D,SHMUELI E.A review on fairness in machinelearning[J].ACM Computing Surveys,2022,55(3):1-44.
[2]KANAKIS M E,KHALILI R,WANG L.Machine learning for computer systems and networking:A survey[J].ACM Computing Surveys,2022,55(4):1-36.
[3]CATON S,HAAS C.Fairness in machine learning:A survey[J].ACM Computing Surveys,2024,56(7):1-38.
[4]CALDERS T,VERWER S.Three naive bayes approac-hes fordiscrimination-free classification[J].Data Mining and Know-ledge Discovery,2010,21:277-292.
[5]KAMIRAN F,CALDERS T.Data preprocessing techn-iques for classification without discrimination[J].Knowledge and Information Systems,2012,33(1):1-33.
[6]CALDERS T,KAMIRAN F,PECHENIZKIY M.Building classifiers with independency constraints[C]//2009 IEEE International Conference on Data Mining Workshops.IEEE,2009:13-18.
[7]ZHANG W,TANG X,WANG J.On fairness-aware learning for non-discriminative decision-making [C]//2019 International Conference on Data Mining Workshops(ICDMW).IEEE,2019:1072-1079.
[8]CHAWLA N V,BOWYER K W,HALL L O,et al.SMOTE:synthetic minority over-sampling technique[J].Journal of Artificial Intelligence Research,2002,16:321-357.
[9]FRIEDLER S A,SCHEIDEGGER C,VENKATASUBRAMA-NI-AN S,et al.A comparative study of fairness-enhancing interventions in machine learning[C]//Proceedings of the Confe-rence on Fairness,Accounttability,and Transparency.2019:329-338.
[10]TAE K H,ROH Y,OH Y H,et al.Data cleaning for accurate,fair,and robust models:A big data-AI integration approach[C]//Proceedings of the 3rd International Workshop on Data Management for End-to-end Machine Learning.2019:1-4.
[11]GONZÁLEZ-ZELAYA V,SALAS J,MEGÍAS D,et al.Fair and private data preprocessing through microa-ggregation[J].ACM Transactions on Knowledge Discovery from Data,2023,18(3):1-24.
[12]ZAFAR M B,VALERA I,ROGRIGUEZM G,et al.Fairness constraints:Mechanisms for fair classification[C]//Artificial Intelligence and Statistics.PMLR,2017:962-970.
[13]LE QUY T,ROY A,IOSIFIDIS V,et al.A survey on datasets for fairness-aware machine learning[J].Wiley Interdisciplinary Reviews:Data Mining and Knowledge Discovery,2022,12(3):e1452.
[14]HARDT M,PRICE E,SREBRO N.Equality of opportunity in supervised learning[J].Advances in Neural Information Proces-sing Systems,2016,29:3323-3331.
[15]BERK R,HEIDARI H,JABBARI S,et al.Fairness in criminal justice risk assessments:The state of the art[J].Sociological Methods & Research,2021,50(1):3-44.
[16]ZAFAR M B,VALERA I,GOMEZ RODRIGUEZ M,et al.Fairness beyond disparate treatment & disparate impact:Lear-ning classification without disparate mistreatment[C]//Procee-dings of the 26th International Conference on World Wide Web.2017:1171-1180.
[17]LI Y,SUN H,WANGW H.Towards fair truth discovery from biased crowdsourced answers[C]//Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Disco-very & Data Mining.2020:599-607.
[18]KAMIRAN F,CALDERS T,PECHENIZKIY M.Discrimi-nation aware decision tree learning[C]//2010 IEEE International Conference on Data Mining.IEEE,2010:869-874.
[19]SPINELLI I,SCARDAPANE S,HUSSAIN A,et al.Fairdrop:Biased edge dropout for enhancing fairness in graph representation learning[J].IEEE Transactions on Artificial Intelligence,2021,3(3):344-354.
[20]TANGIRALA S.Evaluating the impact of GINI index and information gain on classification using decision tree classifier algorithm[J].International Journal of Advanced Computer Science and Applications,2020,11(2):612-619.
[1] WANG Yongquan, SU Mengqi, SHI Qinglei, MA Yining, SUN Yangfan, WANG Changmiao, WANG Guoyou, XI Xiaoming, YIN Yilong, WAN Xiang. Research Progress of Machine Learning in Diagnosis and Treatment of Esophageal Cancer [J]. Computer Science, 2025, 52(9): 4-15.
[2] LI Fang, WANG Jie. DACSNet:Dual Attention Mechanism and Classification Supervision Network for Breast Lesion Detection in Ultrasound Images [J]. Computer Science, 2025, 52(9): 54-61.
[3] LIU Leyuan, CHEN Gege, WU Wei, WANG Yong, ZHOU Fan. Survey of Data Classification and Grading Studies [J]. Computer Science, 2025, 52(9): 195-211.
[4] JIANG Rui, FAN Shuwen, WANG Xiaoming, XU Youyun. Clustering Algorithm Based on Improved SOM Model [J]. Computer Science, 2025, 52(8): 162-170.
[5] WANG Jia, XIA Ying, FENG Jiangfan. Few-shot Video Action Recognition Based on Two-stage Spatio-Temporal Alignment [J]. Computer Science, 2025, 52(8): 251-258.
[6] ZHANG Yuan, ZHANG Shengjie, LIU Lilong, QIAN Shengsheng. Research on Continual Social Event Classification Based on Continual Event Knowledge Network [J]. Computer Science, 2025, 52(8): 268-276.
[7] LONG Tie, XIAO Fu, FAN Weibei, HE Xin, WANG Junchang. Cubic+:Enhanced Cubic Congestion Control for Cross-datacenter Networks [J]. Computer Science, 2025, 52(8): 335-342.
[8] LI Maolin, LIN Jiajie, YANG Zhenguo. Confidence-guided Prompt Learning for Multimodal Aspect-level Sentiment Analysis [J]. Computer Science, 2025, 52(7): 241-247.
[9] DING Bowen, LU Tianliang, PENG Shufan, GENG Haoqi, YANG Gang. Face Forgery Algorithm Recognition Model Based on Multi-classification Dataset [J]. Computer Science, 2025, 52(7): 353-362.
[10] ZHANG Yuekang, SHE Yanhong. Hierarchical Classification with Multi-path Selection Based on Calculation of Correlation Degree of Granularity Categories in the Same Level [J]. Computer Science, 2025, 52(7): 119-126.
[11] LI Mengxi, GAO Xindan, LI Xue. Two-way Feature Augmentation Graph Convolution Networks Algorithm [J]. Computer Science, 2025, 52(7): 127-134.
[12] LONG Xiao, HUANG Wei, HU Kai. Bi-MI ViT:Bi-directional Multi-level Interaction Vision Transformer for Lung CT ImageClassification [J]. Computer Science, 2025, 52(6A): 240700183-6.
[13] RAN Qin, RUAN Xiaoli, XU Jing, LI Shaobo, HU Bingqi. Function Prediction of Therapeutic Peptides with Multi-coded Neural Networks Based on Projected Gradient Descent [J]. Computer Science, 2025, 52(6A): 240800024-6.
[14] WANG Chundong, ZHANG Qinghua, FU Haoran. Federated Learning Privacy Protection Method Combining Dataset Distillation [J]. Computer Science, 2025, 52(6A): 240500132-7.
[15] YANG Jixiang, JIANG Huiping, WANG Sen, MA Xuan. Research Progress and Challenges in Forest Fire Risk Prediction [J]. Computer Science, 2025, 52(6A): 240400177-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!