Computer Science ›› 2020, Vol. 47 ›› Issue (9): 185-189.doi: 10.11896/jsjkx.190900001

• Artificial Intelligence • Previous Articles     Next Articles

MTHAM:Multitask Disease Progression Modeling Based on Hierarchical Attention Mechanism

PAN Zu-jiang1, LIU Ning1, ZHANG Wei2, WANG Jian-yong1   

  1. 1 Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China
    2 School of Computer Science and Technology,East China Normal University,Shanghai 200333,China
  • Received:2019-08-30 Published:2020-09-10
  • About author:PAN Zu-jiang,born in 1994,postgra-duate.His research interests include data mining and machine learnig.
    ZHANG Wei,born in 1988,Ph.D,associate researcher.His research interests include user data modeling and so on.
  • Supported by:
    Key Program of National Natural Science Foundation of China (61532010).

Abstract: Alzheimer’s disease (AD) is an irreversible neurodegenerative disease.The degeneration of brain tissue causes serious cognitive problems and eventually leads to death.There are many clinical trials and research projects to study AD pathology and produce some data for analysis.This paper focuses on the diagnosis of AD and the prediction of potential prognosis in combination with a variety of clinical features.In this paper,a multi-task disease progression model based on hierarchical attention mechanism is proposed.The task of disease automatic diagnosis is regarded as the main task,and the task of disease prognosis is regarded as the auxiliary task to improve the generalization ability of the model,and then improve the performance of disease automatic diagnosis task.In this paper,two layers of attention mechanism are applied in the feature layer and the medical record layer respectively,so that the model can pay different attention to different features and different medical records.The validation experiment is carried out on ADNI (Alzheimer’s Disease Neuroimaging Initiative) dataset.Compared with several benchmark models,the experimental results show that the proposed method has better performance and provides better robustness for clinical application.

Key words: Attention mechanism, Multi-task learning, Automatic diagnosis, Prognosis prediction, Alzheimer’s disease

CLC Number: 

  • TP391.4
[1] PATTERSON C.World Alzheimer Report 2018—The state of the art of dementia research:New frontiers[R].Alzheimer’s Disease International (ADI):London,UK,2018.
[2] WANG Q,SUN M,ZHAN L,et al.Multi-Modality DiseaseModeling via Collective Deep Matrix Factorization[C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.ACM,2017:1155-1164.
[3] DAI P,GWADRY-SRIDHAR F,BAUER M,et al.Healthy cognitive aging:A hybrid random vector functional-link model for the analysis of alzheimer’s disease[C]//Thirty-First AAAI Conference on Artificial Intelligence.2017.
[4] HUANG G B,ZHU Q Y,SIEW C K.Extreme learning ma-chine:theory and applications[J].Neurocomputing,2006,70(1/2/3):489-501.
[5] HINTON G,DENG L,YU D,et al.Deep neural networks for acoustic modeling in speech recognition[J].IEEE Signal proces-sing magazine,2012,29(6):82-97.
[6] LECUN Y,BENGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436.
[7] TUFAIL A B,ABIDI A,SIDDIQUI A M,et al.Automatic classification of initial categories of Alzheimer’s disease from structural MRI phase images:a comparison of PSVM,KNN and ANN methods[J].Age,2012,2012:1731.
[8] LEBEDEV A V,WESTMAN E,VAN WESTEN G J P,et al.Random Forest ensembles for detection and prediction of Alzheimer's disease with a good between-cohort robustness[J].NeuroImage:Clinical,2014,6:115-125.
[9] LÓPEZ M,RAMÍREZ J,GÓRRIZ J M,et al.Principal component analysis-based techniques and supervised classification schemes for the early detection of Alzheimer's disease[J].Neurocomputing,2011,74(8):1260-1271.
[10] SHI B,CHEN Y,HOBBS K,et al.Nonlinear Metric Learning for Alzheimer’s Disease Diagnosis with Integration of Longitudinal Neuroimaging Features[C]//BMVC.2015.
[11] DAI P,GWADRY-SRIDHAR F,BAUER M,et al.Bagging ensembles for the diagnosis and prognostication of alzheimer’s di-sease[C]//Thirtieth AAAI Conference on Artificial Intelligence.2016.
[12] XING E P,JORDAN M I,RUSSELL S J,et al.Distance metric learning with application to clustering with side-information[C]//Advances in neural information processing systems.2003:521-528.
[13] LIN T,ZHA H.Riemannian manifold learning[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2008,30(5):796-809.
[14] 邱锡鹏.神经网络与深度学习[OL].[2017-04-21].
[15] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[J].arXiv:1409.0473,2014.
[16] YANG Z,YANG D,DYER C,et al.Hierarchical attention networks for document classification[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1480-1489.
[17] MNIH V,HEESS N,GRAVES A.Recurrent models of visualattention[C]//Advances in Neural Information Processing Systems.2014:2204-2212.
[18] YANG Y,YANG L,ZOU Y B,et al.Humor Recognition Based on Linguistic Features and Hierarchical Attention Mechanism[J].Computer Engineering,2020,46(8):64-71.
[19] CHOI E,BAHADORI M T,SUN J,et al.Retain:An interpretable predictive model for healthcare using reverse time attention mechanism[C]//Advances in Neural Information Processing Systems.2016:3504-3512.
[20] COLLOBERT R,WESTON J.A unified architecture for natural language processing:Deep neural networks with multitask lear-ning[C]//Proceedings of the 25th International Conference on Machine Learning.ACM,2008:160-167.
[21] ZHANG W J.An Online Multi-Task Learning Algorithm Based on Weight Matrix Decomposition[J].Computer Engineering,2019,45(8):190-197.
[22] DENG L,HINTON G,KINGSBURY B.New types of deep neural network learning for speech recognition and related applications:An overview[C]//2013 IEEE International Conference on Acoustics,Speech and Signal Processing.IEEE,2013:8599-8603.
[23] GIRSHICK R.Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:1440-1448.
[24] RAMSUNDAR B,KEARNES S,RILEY P,et al.Massivelymultitask networks for drug discovery[J].arXiv:1502.02072,2015.
[25] RUDER S.An overview of multi-task learning in deep neuralnetworks[J].arXiv:1706.05098,2017.
[26] CARUANA R.Multitask learning[J].Machine Learning,1997,28(1):41-75.
[27] BAXTER J.A Bayesian/information theoretic model of learning to learn via multiple task sampling[J].Machine Learning,1997,28(1):7-39.
[28] HOCHREITER S,SCHMIDHUBER J.Long short-term memo-ry[J].Neural Computation,1997,9(8):1735-1780.
[29] KINGMA D P,BA J.Adam:A method for stochastic optimization[J].arXiv:1412.6980,2014.
[30] WEINER M W,VEITCH D P,AISEN P S,et al.The Alzhemer’sDisease Neuroimaging Initiative:a review of papers published since its inception[J].Alzheimer’s & Dementia,2013,9(5):e111-e194.
[1] ZHAO Wei, LIN Yu-ming, WANG Chao-qiang, CAI Guo-yong. Opinion Word-pairs Collaborative Extraction Based on Dependency Relation Analysis [J]. Computer Science, 2020, 47(8): 164-170.
[2] YUAN Ye, HE Xiao-ge, ZHU Ding-kun, WANG Fu-lee, XIE Hao-ran, WANG Jun, WEI Ming-qiang, GUO Yan-wen. Survey of Visual Image Saliency Detection [J]. Computer Science, 2020, 47(7): 84-91.
[3] LIU Yan, WEN Jing. Complex Scene Text Detection Based on Attention Mechanism [J]. Computer Science, 2020, 47(7): 135-140.
[4] YU Yi-lin, TIAN Hong-tao, GAO Jian-wei and WAN Huai-yu. Relation Extraction Method Combining Encyclopedia Knowledge and Sentence Semantic Features [J]. Computer Science, 2020, 47(6A): 40-44.
[5] NI Hai-qing, LIU Dan, SHI Meng-yu. Chinese Short Text Summarization Generation Model Based on Semantic-aware [J]. Computer Science, 2020, 47(6): 74-78.
[6] HUANG Yong-tao, YAN Hua. Scene Graph Generation Model Combining Attention Mechanism and Feature Fusion [J]. Computer Science, 2020, 47(6): 133-137.
[7] ZHANG Zhi-yang, ZHANG Feng-li, CHEN Xue-qin, WANG Rui-jin. Information Cascade Prediction Model Based on Hierarchical Attention [J]. Computer Science, 2020, 47(6): 201-209.
[8] DENG Yi-jiao, ZHANG Feng-li, CHEN Xue-qin, AI Qing, YU Su-zhe. Collaborative Attention Network Model for Cross-modal Retrieval [J]. Computer Science, 2020, 47(4): 54-59.
[9] ZHOU Zi-qin, YAN Hua. 3D Shape Recognition Based on Multi-task Learning with Limited Multi-view Data [J]. Computer Science, 2020, 47(4): 125-130.
[10] ZHANG Peng-fei, LI Guan-yu, JIA Cai-yan. Truncated Gaussian Distance-based Self-attention Mechanism for Natural Language Inference [J]. Computer Science, 2020, 47(4): 178-183.
[11] ZHANG Yi-fei,WANG Zhong-qing,WANG Hong-ling. Product Review Summarization Using Discourse Hierarchical Structure [J]. Computer Science, 2020, 47(2): 195-200.
[12] GAO Li-jian,MAO Qi-rong. Environment-assisted Multi-task Learning for Polyphonic Acoustic Event Detection [J]. Computer Science, 2020, 47(1): 159-164.
[13] LI Yuan,LI Zhi-xing,TENG Lei,WANG Hua-ming,WANG Guo-yin. Comment Sentiment Analysis and Sentiment Words Detection Based on Attention Mechanism [J]. Computer Science, 2020, 47(1): 186-192.
[14] YANG Dan-hao,WU Yue-xin,FAN Chun-xiao. Chinese Short Text Keyphrase Extraction Model Based on Attention [J]. Computer Science, 2020, 47(1): 193-198.
[15] SUN Zhong-feng, WANG Jing. RCNN-BGRU-HN Network Model for Aspect-based Sentiment Analysis [J]. Computer Science, 2019, 46(9): 223-228.
Full text



[1] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[2] WEN Jun-hao, SUN Guang-hui and LI Shun. Study on Matrix Factorization Recommendation Algorithm Based on User Clustering and Mobile Context[J]. Computer Science, 2018, 45(4): 215 -219, 251 .
[3] JIA Wei, HUA Qing-yi, ZHANG Min-jun, CHEN Rui, JI Xiang and WANG Bo. Mobile Interface Pattern Clustering Algorithm Based on Improved Particle Swarm Optimization[J]. Computer Science, 2018, 45(4): 220 -226 .
[4] DING Shu-yang, LI Bing and SHI Hong-bo. Study on Flexible Job-shop Scheduling Problem Based on Improved Discrete Particle Swarm Optimization Algorithm[J]. Computer Science, 2018, 45(4): 233 -239, 256 .
[5] ZHANG Wen-bo and HOU Xiao-rong. Estimation Algorithm of Atmospheric Light Based on Gaussian Distribution[J]. Computer Science, 2018, 45(4): 301 -305 .
[6] ZHENG Xiang-ping, YU Zhi-yong, WEN Guang-bin. Community Discovery in Location Network[J]. Computer Science, 2018, 45(6): 46 -50 .
[7] FENG Yan-hong, YU Hong, SUN Geng, PENG Song. Diversity Measures Method in High-dimensional Semantic Vector Based on Asymmetric Multi-valued Feature Jaccard Coefficient[J]. Computer Science, 2018, 45(6): 57 -66 .
[8] HUANG Dong-mei, DU Yan-ling, HE Qi, SUI Hong-yun, LI Yao. Marine Monitoring Data Replica Layout Strategy Based on Multiple Attribute Optimization[J]. Computer Science, 2018, 45(6): 72 -75,104 .
[9] WU Jian-xia, YANG Yong-li. Algorithm for Reducing PAPR of FBMC-OQAM System[J]. Computer Science, 2018, 45(6): 89 -95 .
[10] WANG Qian, YU Lai-hang, CAO Yan, ZHANG Lei, QIN Jie, YE Hai-qin. Blind Watermarking Algorithm for Digital Image Based on Fibonacci Scrambling in Wavelet Domain[J]. Computer Science, 2018, 45(6): 135 -140 .