Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241200119-9.doi: 10.11896/jsjkx.241200119

• Big Data & Data Science • Previous Articles     Next Articles

Fairness-enhancing Decision Tree Algorithm

JIANG Wenhui, YE Jianhong, GAO Lingting, HUANG Yifan   

  1. College of Computer Science and Technology,Huaqiao University,Xiamen,Fujian 361021,China
  • Online:2025-11-15 Published:2025-11-10
  • About author:JIANG Wenhui,born in 2000,postgra-duate.Her main research interests include machine learning and so on.
    YE Jianhong,born in 1976,Ph.D,associate professor,master supervisor.His main research interests include process mining and so on.
  • Supported by:
    Science and Technology Planning Project of Fujian Province,China(2024H0014(2024H01010100)).

Abstract: In the field of machine learning,the problem of intrinsic biases in models has received increasing attention,and these biases often originate from imbalances in the training data or flaws in the algorithm design,which lead to unfair treatment of certain groups in the prediction results.To address this problem,this paper proposes a fairness-enhanced decision tree algorithm,which effectively reduces the imbalance in the data by introducing a fairness preprocessing method,and changes the traditional decision tree splitting criterion by integrating classification accuracy and fairness in the splitting criterion of the decision tree.The proposed method aims to achieve the fair distribution of prediction results among different groups,reduce the bias in model decision-making,and ensure that all individuals are treated fairly.Experimental results show that the proposed method demonstrates good performance under multiple fairness metrics,significantly reduces the prediction bias among different groups,and exhibits stronger fairness bias-correction performance than the existing traditional algorithms.

Key words: Machine learning, Classification, Decision tree, Fairness, Preprocessing

CLC Number: 

  • TP391
[1]PESSACH D,SHMUELI E.A review on fairness in machinelearning[J].ACM Computing Surveys,2022,55(3):1-44.
[2]KANAKIS M E,KHALILI R,WANG L.Machine learning for computer systems and networking:A survey[J].ACM Computing Surveys,2022,55(4):1-36.
[3]CATON S,HAAS C.Fairness in machine learning:A survey[J].ACM Computing Surveys,2024,56(7):1-38.
[4]CALDERS T,VERWER S.Three naive bayes approac-hes fordiscrimination-free classification[J].Data Mining and Know-ledge Discovery,2010,21:277-292.
[5]KAMIRAN F,CALDERS T.Data preprocessing techn-iques for classification without discrimination[J].Knowledge and Information Systems,2012,33(1):1-33.
[6]CALDERS T,KAMIRAN F,PECHENIZKIY M.Building classifiers with independency constraints[C]//2009 IEEE International Conference on Data Mining Workshops.IEEE,2009:13-18.
[7]ZHANG W,TANG X,WANG J.On fairness-aware learning for non-discriminative decision-making [C]//2019 International Conference on Data Mining Workshops(ICDMW).IEEE,2019:1072-1079.
[8]CHAWLA N V,BOWYER K W,HALL L O,et al.SMOTE:synthetic minority over-sampling technique[J].Journal of Artificial Intelligence Research,2002,16:321-357.
[9]FRIEDLER S A,SCHEIDEGGER C,VENKATASUBRAMA-NI-AN S,et al.A comparative study of fairness-enhancing interventions in machine learning[C]//Proceedings of the Confe-rence on Fairness,Accounttability,and Transparency.2019:329-338.
[10]TAE K H,ROH Y,OH Y H,et al.Data cleaning for accurate,fair,and robust models:A big data-AI integration approach[C]//Proceedings of the 3rd International Workshop on Data Management for End-to-end Machine Learning.2019:1-4.
[11]GONZÁLEZ-ZELAYA V,SALAS J,MEGÍAS D,et al.Fair and private data preprocessing through microa-ggregation[J].ACM Transactions on Knowledge Discovery from Data,2023,18(3):1-24.
[12]ZAFAR M B,VALERA I,ROGRIGUEZM G,et al.Fairness constraints:Mechanisms for fair classification[C]//Artificial Intelligence and Statistics.PMLR,2017:962-970.
[13]LE QUY T,ROY A,IOSIFIDIS V,et al.A survey on datasets for fairness-aware machine learning[J].Wiley Interdisciplinary Reviews:Data Mining and Knowledge Discovery,2022,12(3):e1452.
[14]HARDT M,PRICE E,SREBRO N.Equality of opportunity in supervised learning[J].Advances in Neural Information Proces-sing Systems,2016,29:3323-3331.
[15]BERK R,HEIDARI H,JABBARI S,et al.Fairness in criminal justice risk assessments:The state of the art[J].Sociological Methods & Research,2021,50(1):3-44.
[16]ZAFAR M B,VALERA I,GOMEZ RODRIGUEZ M,et al.Fairness beyond disparate treatment & disparate impact:Lear-ning classification without disparate mistreatment[C]//Procee-dings of the 26th International Conference on World Wide Web.2017:1171-1180.
[17]LI Y,SUN H,WANGW H.Towards fair truth discovery from biased crowdsourced answers[C]//Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Disco-very & Data Mining.2020:599-607.
[18]KAMIRAN F,CALDERS T,PECHENIZKIY M.Discrimi-nation aware decision tree learning[C]//2010 IEEE International Conference on Data Mining.IEEE,2010:869-874.
[19]SPINELLI I,SCARDAPANE S,HUSSAIN A,et al.Fairdrop:Biased edge dropout for enhancing fairness in graph representation learning[J].IEEE Transactions on Artificial Intelligence,2021,3(3):344-354.
[20]TANGIRALA S.Evaluating the impact of GINI index and information gain on classification using decision tree classifier algorithm[J].International Journal of Advanced Computer Science and Applications,2020,11(2):612-619.
[1] WANG Jinghong, LI Pengchao, WANG Xizhao, ZHANG Zili. Dual-channel Graph Neural Network Based on KAN [J]. Computer Science, 2026, 53(3): 188-196.
[2] QIN Jing, LI Guanfeng, CHEN Yuyin, XIAO Yuhang. Embedding Model of Knowledge Graph via Jointly Modeling Ontology and Instances [J]. Computer Science, 2026, 53(3): 331-340.
[3] CHEN Han, XU Zefeng, JIANG Jiu, FAN Fan, ZHANG Junjian, HE Chu, WANG Wenwei. Large Language Model and Deep Network Based Cognitive Assessment Automatic Diagnosis [J]. Computer Science, 2026, 53(3): 41-51.
[4] GE Zeqing, HUANG Shengjun. Semi-supervised Learning Method for Multi-label Tabular Data [J]. Computer Science, 2026, 53(3): 151-157.
[5] CHEN Lin, MA Longxuan, ZHANG Yongbing, HUANG Yuxin, GAO Shengxiang, YU Zhengtao. Industrial Text Classification for Chinese and Vietnamese Based on Prompt Learning and AdaptiveLoss Weighting [J]. Computer Science, 2026, 53(2): 312-321.
[6] JIANG Lei, WANG Zi, YANG Rong, HAN Wanglin. Human Motion Recognition Algorithm Based on Wearable Sensors [J]. Computer Science, 2026, 53(2): 342-348.
[7] WANG Xinyu, SONG Xiaomin, ZHENG Huiming, PENG Dezhong, CHEN Jie. Contrastive Learning-based Masked Graph Autoencoder [J]. Computer Science, 2026, 53(2): 145-151.
[8] JIA Jingdong, HOU Xin, WANG Zhe, HUANG Jian. Research on User Data-driven App Fading Functions [J]. Computer Science, 2026, 53(1): 262-270.
[9] XUE Jingyan, XIA Jianan, HUO Ruili, LIU Jie, ZHOU Xuezhong. Review of Retinal Image Analysis Methods for OCT/OCTA Based on Deep Learning [J]. Computer Science, 2026, 53(1): 128-140.
[10] WANG Yongquan, SU Mengqi, SHI Qinglei, MA Yining, SUN Yangfan, WANG Changmiao, WANG Guoyou, XI Xiaoming, YIN Yilong, WAN Xiang. Research Progress of Machine Learning in Diagnosis and Treatment of Esophageal Cancer [J]. Computer Science, 2025, 52(9): 4-15.
[11] LI Fang, WANG Jie. DACSNet:Dual Attention Mechanism and Classification Supervision Network for Breast Lesion Detection in Ultrasound Images [J]. Computer Science, 2025, 52(9): 54-61.
[12] LIU Leyuan, CHEN Gege, WU Wei, WANG Yong, ZHOU Fan. Survey of Data Classification and Grading Studies [J]. Computer Science, 2025, 52(9): 195-211.
[13] JIANG Rui, FAN Shuwen, WANG Xiaoming, XU Youyun. Clustering Algorithm Based on Improved SOM Model [J]. Computer Science, 2025, 52(8): 162-170.
[14] WANG Jia, XIA Ying, FENG Jiangfan. Few-shot Video Action Recognition Based on Two-stage Spatio-Temporal Alignment [J]. Computer Science, 2025, 52(8): 251-258.
[15] ZHANG Yuan, ZHANG Shengjie, LIU Lilong, QIAN Shengsheng. Research on Continual Social Event Classification Based on Continual Event Knowledge Network [J]. Computer Science, 2025, 52(8): 268-276.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!