Computer Science ›› 2023, Vol. 50 ›› Issue (7): 46-52.doi: 10.11896/jsjkx.230200216

• Database & Big Data & Data Science • Previous Articles     Next Articles

Disease Diagnosis Prediction Algorithm Based on Contrastive Learning

WANG Mingxia, XIONG Yun   

  1. School of Computer Science,Fudan University,Shanghai 200433,ChinaShanghai Key Laboratory of Data Science,Shanghai 200433,China
  • Received:2023-02-28 Revised:2023-04-17 Online:2023-07-15 Published:2023-07-05
  • About author:WANG Mingxia,born in 1999,postgraduate.Her main research interests include big data and medical data mi-ning.XIONG Yun,born in 1980,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.Her main research interests include data science and data mining.

Abstract: Disease diagnosis prediction aims to use electronic health data to model disease progression patterns and predict the future health status of patients,and is widely used in assisting clinical decision-making,healthcare services and other fields.In order to further explore the valuable information in the medical records,a disease diagnosis prediction algorithm based on contrastive learning is proposed.Contrastive learning provides self-supervised training signals for the model by measuring the similarity between samples,which can improve the information capture ability of the model.The proposed algorithm excavates the common knowledge between similar patients through contrastive training,and enhances the ability of the model to learn patient representations.In order to capture more comprehensive common information,the information of similar groups of the target patient is further explored as auxiliary information to characterize the health status of the target patient.Experimental results on the public dataset show that compared with the Retain,Dipole,LSAN and GRASP algorithms,the proposed algorithm improves AUROC and AUPRC of the readmission prediction task by more than 2.9% and 8.1% respectively,and Recall@10 and MAP@10 of the diagnosis prediction task by 2.1% and 1.8%,respectively.

Key words: Diagnosis prediction, Deep learning, Contrastive learning, Clustering, Similar patients

CLC Number: 

  • TP311
[1]LI Y J,ZHENG R L,YANG X M.Diagnosis and predictionmodel of coronary heart disease based on data mining technology[J].Medical Information,2020,33(24):14-17.
[2]ZHU X T,PANG C Y,ZHU H.Cardiovascular disease prediction model based on deep learning [J].Journal of Computer Applications,2021,41(S2):346-350.
[3]LI M,MA L Y,YAO Z.Study on an intelligent diagnosis prediction model based on deep neural network[J].Medical Information,2022,43(8):52-55,75.
[4]CHOI E,BAHADORI M T,KULAS J A,et al.Retain:An interpretable predictive model for healthcare using reverse time attention mechanism[C]//Proceedings of the 30th International Conference on Neural Information Processing Systems.2016:3512-3520.
[5]MA F,CHITTA R,ZHOU J,et al.Dipole:Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks[C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2017:1903-1911.
[6]XIAO C,MA T,DIENG A B,et al.Readmission prediction via deep contextual embedding of clinical concepts[J].PLOS ONE,2018,13(4):1-15.
[7]CHOI E,BAHADORI M T,SCHUETZ A,et al.Doctor AI:Predicting clinical events via recurrent neural networks[C]//Proceedings of the 1st Machine Learning for Healthcare Confe-rence.2016:301-318.
[8]BAYTAS I M,XIAO C,ZHANG X,et al.Patient subtyping via time-aware LSTM networks[C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2017:65-74.
[9]KWON B C,CHOI M J,KIM J T,et al.RetainVis:Visual analytics with interpretable and interactive recurrent neural networks on electronic medical records [J].IEEE Transactions on Visualization and Computer Graphics,2019,25(1):299-309.
[10]BAI T,ZHANG S,EGLESTON B L,et al.Interpretable representation learning for healthcare via capturing disease progression through time[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2018:43-51.
[11]LUO J,YE M,XIAO C,et al.HiTANet:Hierarchical time-aware attention networks for risk prediction on electronic health records [C]//Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2020:647-656.
[12]MEN L,ILK N,TANG X,et al.Multi-disease predictionusing LSTM recurrent neural networks[J].Expert Systems with Applications,2021,177:114905.
[13]SUO Q,MA F,YUAN Y,et al.Personalized disease prediction using a CNN based similarity learning method[C]//2017 IEEE International Conference on Bioinformatics and Biomedicine(BIBM).2017:811-816.
[14]SUO Q,MA F,YUAN Y,et al.Deep patient similarity learning for personalized health care[J].IEEE Transactions on NanoBioscience,2018,17(3):219-227.
[15]ZHANG C,GAO X,MA L,et al.GRASP:Generic framework for health status representation learning based on incorporating knowledge from similar patients [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:715-723.
[16]OEI R W,HSU W,LEE M L,et al.Using similar patients to predict complication in patients with diabetes,hypertension,and lipid disorder:a domain knowledge infused convolutional neural network approach[J].Journal of the American Medical Informatics Association,2022,30(2):273-281.
[17]LI Y,YANG D,GONG X.Patient similarity via medical attributed heterogeneous graph convolutional network[J].IAENG International Journal of Computer Science,2022,49(4):1152-1161.
[18]AN Y,LI R,CHEN X.MERGE:A multi-graph attentive representation learning framework integrating group information from similar patients[J].Computers in Biology and Medicine,2022,151:106245.
[19]ZHANG C,CHU X,MA L,et al.M3Care:Learning with mis-sing modalities in multimodal healthcare data[C]//Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.KDD,2022:2418-2428.
[20]VAN DEN OORD A,LI Y,VINYALS O.Representation lear-ning with contrastive predictive coding[J].arXiv:,1807.03748,2018.
[21]LI J,ZHOU P,XIONG C,et al.Prototypical contrastive learning of unsupervised representations [J].arXiv:2005.04966,2020.
[22]PENG X,LONG G,SHEN T,et al.Self-attention enhanced patient journey understanding in healthcare system[C]//Joint European Conference on Machine Learning and Knowledge Disco-very in Databases.2020:719-735.
[23]YE M,LUO J,XIAO C,et al.LSAN:Modeling long-term dependencies and short-term correlations with hierarchical attention for risk prediction[C]//Proceedings of the 29th ACM International Conference on Information and Knowledge Management.2020:1753-1762.
[1] LI Kun, GUO Wei, ZHANG Fan, DU Jiayu, YANG Meiyue. Adversarial Malware Generation Method Based on Genetic Algorithm [J]. Computer Science, 2023, 50(7): 325-331.
[2] CUI Yunsong, WU Ye, XU Xiaoke. Decoupling Analysis of Network Structure Affecting Propagation Effect [J]. Computer Science, 2023, 50(7): 368-375.
[3] SHEN Zhehui, WANG Kailai, KONG Xiangjie. Exploring Station Spatio-Temporal Mobility Pattern:A Short and Long-term Traffic Prediction Framework [J]. Computer Science, 2023, 50(7): 98-106.
[4] HUO Weile, JING Tao, REN Shuang. Review of 3D Object Detection for Autonomous Driving [J]. Computer Science, 2023, 50(7): 107-118.
[5] ZHOU Bo, JIANG Peifeng, DUAN Chang, LUO Yuetong. Study on Single Background Object Detection Oriented Improved-RetinaNet Model and Its Application [J]. Computer Science, 2023, 50(7): 137-142.
[6] MAO Huihui, ZHAO Xiaole, DU Shengdong, TENG Fei, LI Tianrui. Short-term Subway Passenger Flow Forecasting Based on Graphical Embedding of Temporal Knowledge [J]. Computer Science, 2023, 50(7): 213-220.
[7] LI Yuqiang, LI Linfeng, ZHU Hao, HOU Mengshu. Deep Learning-based Algorithm for Active IPv6 Address Prediction [J]. Computer Science, 2023, 50(7): 261-269.
[8] GAO Xiang, TANG Jiqiang, ZHU Junwu, LIANG Mingxuan, LI Yang. Study on Named Entity Recognition Method Based on Knowledge Graph Enhancement [J]. Computer Science, 2023, 50(6A): 220700153-6.
[9] LUO Ruiqi, YAN Jinlin, HU Xinrong, DING Lei. EEG Emotion Recognition Based on Multiple Directed Weighted Graph and ConvolutionalNeural Network [J]. Computer Science, 2023, 50(6A): 220600128-8.
[10] ZENG Wu, MAO Guojun. Few-shot Learning Method Based on Multi-graph Feature Aggregation [J]. Computer Science, 2023, 50(6A): 220400029-10.
[11] CHEN Jie. Study on Long Text Topic Clustering Based on Doc2Vec Enhanced Features [J]. Computer Science, 2023, 50(6A): 220800192-6.
[12] HOU Yanrong, LIU Ruixia, SHU Minglei, CHEN Changfang, SHAN Ke. Review of Research on Denoising Algorithms of ECG Signal [J]. Computer Science, 2023, 50(6A): 220300094-11.
[13] GU Yuhang, HAO Jie, CHEN Bing. Semi-supervised Semantic Segmentation for High-resolution Remote Sensing Images Based on DataFusion [J]. Computer Science, 2023, 50(6A): 220500001-6.
[14] LIANG Mingxuan, WANG Shi, ZHU Junwu, LI Yang, GAO Xiang, JIAO Zhixiang. Survey of Knowledge-enhanced Natural Language Generation Research [J]. Computer Science, 2023, 50(6A): 220200120-8.
[15] WANG Dongli, YANG Shan, OUYANG Wanli, LI Baopu, ZHOU Yan. Explainability of Artificial Intelligence:Development and Application [J]. Computer Science, 2023, 50(6A): 220600212-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!