NeuronSup:基于偏见神经元抑制的深度模型去偏方法

doi:10.11896/jsjkx.220900169

Computer Science ›› 2023, Vol. 50 ›› Issue (11): 122-131.doi: 10.11896/jsjkx.220900169

• Database & Big Data & Data Science • Previous Articles Next Articles

NeuronSup:Deep Model Debiasing Based on Bias Neuron Suppression

NI Hongjie¹, LIU Jiawei¹, ZHENG Haibin^1,2, CHEN Yipeng¹, CHEN Jinyin^1,2

1 College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023 China
2 Institute of Cyberspace Security,Zhejiang University of Technology,Hangzhou 310023,China

Received:2022-09-18 Revised:2023-01-06 Online:2023-11-15 Published:2023-11-06
About author:NI Hongjie,born in 1978,Ph.D,Ph.D supervisor.His main research interests include artificial intelligence security and data mining.CHEN Jinyin,born in 1982,Ph.D,professor.Her main research interests include artificial intelligence security,graph data mining and evolutionary computing.
Supported by:
National Natural Science Foundation of China(62072406),Natural Science Foundation of Zhejiang Province,China(LDQ23F020001),Chinese National Key Laboratory of Science and Technology on Information System Security(61421110502),Key R&D Projects in Zhejiang Province(2021C01117),2020 Industrial Internet Innovation Development Project(TC200H01V) and “Ten Thousand Talents Program” in Zhejiang Province(2020R52011).

Abstract

Abstract: With the wide application of deep learning,researchers not only focus on the classification performance of the model,but also need to pay attention to whether the decision of the model is fair and credible.A deep learning model with decision bias may cause great negative effects,so how to maintain the classification accuracy and improve the decision fairness of the model is very important.At present,many methods have been proposed to improve the individual fairness of the model,but there are still shortcomings in the debiasing effect,the availability of the debiased model,and the debiasing efficiency.To this end,this paper analyzes the abnormal activation of neurons when there is individual bias in the deep model,and proposes a model debiasing me-thod NeuronSup based on the inhibition of biased neurons,which has the advantages of significantly reducing individual bias,less impact on the performance of the main task,and low time complexity.To be specific,the concept of bias neuron is first proposed based on the phenomenon that some neurons in the deep model are abnormally activated due to individual bias.Then,the bias neurons are found by using discrimination samples,and the individual bias of the deep model is greatly reduced by suppressing the abnormal activation of bias neurons.And the main task performance neurons are determined according to the maximum weight edge of each neuron.By keeping the main task performance neuron parameters of the deep model unchanged,the influence of debiasing operation on the classification performance of the deep model could be reduced.Because NeuronSup only debiases specific neurons in the deep model,the time complexity is lower and the efficiency is higher.Finally,debiasing experiments on three real datasets with six sensitive attributes,compared with five contrasting algorithms,NeuronSup reduces the individual fairness index THEMIS more than 50%,and at the same time,the impact of the debiasing operation on the classification accuracy of the deep model is reduced to less than 3%,which verifies the effectiveness of NeuronSup in reducing individual bias while ensuring the classification ability of deep model.

Key words: Individual fairness, Deep learning, Bias neurons, Model debiasing

CLC Number:

TP391

NI Hongjie, LIU Jiawei, ZHENG Haibin, CHEN Yipeng, CHEN Jinyin. NeuronSup:Deep Model Debiasing Based on Bias Neuron Suppression[J].Computer Science, 2023, 50(11): 122-131.

References

[1]HARALICK R M,SHANMUGAM K,DINSTEIN I.TexturalFeatures for Image Classification [J].IEEE Transactions on Systems,Man,and Cybernetics,1973,3(6):610-621.
[2]CHAR D S,SHAH N H,MAGNUS D.Implementing MachineLearning in Health Care－Addressing Ethical Challenges[J].New England Journal of Medicine,2018,378(11):981-983.
[3]BRENNAN T,DIETERICH W,EHRET B.Evaluating thePredictive Validity of the Compas Risk and Needs Assessment System[J].Criminal Justice & Behavior,2008,36(1):21-40.
[4]LIU L T,DEAN S,ROLF E,et al.Delayed impact of fair machine learning[C]//International Conference on Machine Lear-ning.PMLR,2018:3150-3158.
[5]WADSWORTH C,VERA F,PIECH C.Achieving fairnessthrough adversarial learning:an application to recidivism prediction [J].arXiv:1807.00199,2018.
[6]LICHMAN M.UCI machine learning repository[OL].http://archive.ics.uci.edu/ml.
[7]HARDT M,PRICE E,SREBRO N.Equality of opportunity in supervised learning [J].arXiv:1610.02413,2016.
[8]FELDMAN M,FRIEDLER S A,MOELLER J,et al.Certifying and removing disparate impact[C]//Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Disco-very and Data Mining.2015:259-268.
[9]CHAKRABORTY J,MAJUMDER S,MENZIES T.Bias in machine learning software:why? how? what to do?[C]//Procee-dings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering.2021:429-440.
[10]PENG K,CHAKRABORTY J,MENZIES T.xFAIR:BetterFairness via Model-based Rebalancing of Protected Attributes[J].arXiv:2110.01109,2021.
[11]CALMON F P,WEI D,VINZAMURI B,et al.Optimized pre-processing for discrimination prevention[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:3995-4004.
[12]KAMIRAN F,CALDERS T.Data preprocessing techniques for classification without discrimination[J].Knowledge and Information Systems,2012,33(1):1-33.
[13]FELDMAN M,FRIEDLER S A,MOELLER J,et al.Certifying and removing disparate impact[C]//Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Disco-very and Data Mining.2015:259-268.
[14]YUROCHKIN M,SUN Y.Sensei:Sensitive set invariance forenforcing individual fairness[J].arXiv:2006.14168,2020.
[15]YUROCHKIN M,BOWER A,SUN Y.Training individuallyfair ML models with sensitive subspace robustness[J].arXiv:1907.00020,2019.
[16]LOHIA P K,RAMAMURTHY K N,BHIDE M,et al.Bias mi-tigation post-processing for individual and group fairness[C]//2019 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP 2019).IEEE,2019:2847-2851.
[17]KIM M P,GHORBANI A,ZOU J.Multiaccuracy:Black-boxpost-processing for fairness in classification[C]//Proceedings of the 2019 AAAI/ACM Conference on AI,Ethics,and Society.2019:247-254.
[18]ZEMEL R,WU Y,SWERSKY K,et al.Learning fair representations[C]//International Conference on Machine Learning.PMLR,2013:325-333.
[19]SATTIGERI P,HOFFMAN S C,CHENTHAMARAKSHANV,et al.Fairness GAN:Generating datasets with fairness pro-perties using a generative adversarial network[J].IBM Journal of Research and Development,2019,63(4/5):3:1-3:9.
[20]ZHANG B H,LEMOINE B,MITCHELL M.Mitigating un-wanted biases with adversarial learning[C]//Proceedings of the 2018 AAAI/ACM Conference on AI,Ethics,and Society.2018:335-340.
[21]XU D,YUAN S,ZHANG L,et al.Fairgan:Fairness-aware gene-rative adversarial networks[C]//2018 IEEE International Conference on Big Data(Big Data).IEEE,2018:570-575.
[22]BEUTEL A,CHEN J,ZHAO Z,et al.Data decisions and theoretical implications when adversarially learning fair representations[J].arXiv:1707.00075,2017.
[23]KAMISHIMA T,AKAHO S,SAKUMA J.Fairness-awarelearning through regularization approach[C]//2011 IEEE 11th International Conference on Data Mining Workshops.IEEE,2011:643-650.
[24]ZHANG P,WANG J,SUN J,et al.White-box fairness testing through adversarial sampling[C]//Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering.2020:949-960.
[25]UDESHI S,ARORA P,CHATTOPADHYAY S.Automated directed fairness testing[C]//Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering.2018:98-108.
[26]GALHOTRA S,BRUN Y,MELIOU A.Fairness testing:tes-ting software for discrimination[C]//Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering.2017:498-510.
[27]KAMIRAN F,MANSHA S,KARIM A,et al.Exploiting reject option in classification for social discrimination control[J].Information Sciences,2018,425:18-33.
[28]LIU Z,LI J,SHEN Z,et al.Learning efficient convolutional net-works through network slimming[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2736-2744.

Related Articles 15

[1]	ZHAO Mingmin, YANG Qiuhui, HONG Mei, CAI Chuang. Smart Contract Fuzzing Based on Deep Learning and Information Feedback [J]. Computer Science, 2023, 50(9): 117-122.
[2]	LI Haiming, ZHU Zhiheng, LIU Lei, GUO Chenkai. Multi-task Graph-embedding Deep Prediction Model for Mobile App Rating Recommendation [J]. Computer Science, 2023, 50(9): 160-167.
[3]	HUANG Hanqiang, XING Yunbing, SHEN Jianfei, FAN Feiyi. Sign Language Animation Splicing Model Based on LpTransformer Network [J]. Computer Science, 2023, 50(9): 184-191.
[4]	ZHU Ye, HAO Yingguang, WANG Hongyu. Deep Learning Based Salient Object Detection in Infrared Video [J]. Computer Science, 2023, 50(9): 227-234.
[5]	ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[6]	SONG Xinyang, YAN Zhiyuan, SUN Muyi, DAI Linlin, LI Qi, SUN Zhenan. Review of Talking Face Generation [J]. Computer Science, 2023, 50(8): 68-78.
[7]	WANG Xu, WU Yanxia, ZHANG Xue, HONG Ruize, LI Guangsheng. Survey of Rotating Object Detection Research in Computer Vision [J]. Computer Science, 2023, 50(8): 79-92.
[8]	ZHOU Ziyi, XIONG Hailing. Image Captioning Optimization Strategy Based on Deep Learning [J]. Computer Science, 2023, 50(8): 99-110.
[9]	ZHANG Xiao, DONG Hongbin. Lightweight Multi-view Stereo Integrating Coarse Cost Volume and Bilateral Grid [J]. Computer Science, 2023, 50(8): 125-132.
[10]	WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[11]	WANG Mingxia, XIONG Yun. Disease Diagnosis Prediction Algorithm Based on Contrastive Learning [J]. Computer Science, 2023, 50(7): 46-52.
[12]	SHEN Zhehui, WANG Kailai, KONG Xiangjie. Exploring Station Spatio-Temporal Mobility Pattern:A Short and Long-term Traffic Prediction Framework [J]. Computer Science, 2023, 50(7): 98-106.
[13]	HUO Weile, JING Tao, REN Shuang. Review of 3D Object Detection for Autonomous Driving [J]. Computer Science, 2023, 50(7): 107-118.
[14]	ZHOU Bo, JIANG Peifeng, DUAN Chang, LUO Yuetong. Study on Single Background Object Detection Oriented Improved-RetinaNet Model and Its Application [J]. Computer Science, 2023, 50(7): 137-142.
[15]	MAO Huihui, ZHAO Xiaole, DU Shengdong, TENG Fei, LI Tianrui. Short-term Subway Passenger Flow Forecasting Based on Graphical Embedding of Temporal Knowledge [J]. Computer Science, 2023, 50(7): 213-220.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

NeuronSup:Deep Model Debiasing Based on Bias Neuron Suppression

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0