Computer Science ›› 2020, Vol. 47 ›› Issue (9): 185-189.doi: 10.11896/jsjkx.190900001

• Artificial Intelligence • Previous Articles     Next Articles

MTHAM:Multitask Disease Progression Modeling Based on Hierarchical Attention Mechanism

PAN Zu-jiang1, LIU Ning1, ZHANG Wei2, WANG Jian-yong1   

  1. 1 Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China
    2 School of Computer Science and Technology,East China Normal University,Shanghai 200333,China
  • Received:2019-08-30 Published:2020-09-10
  • About author:PAN Zu-jiang,born in 1994,postgra-duate.His research interests include data mining and machine learnig.
    ZHANG Wei,born in 1988,Ph.D,associate researcher.His research interests include user data modeling and so on.
  • Supported by:
    Key Program of National Natural Science Foundation of China (61532010).

Abstract: Alzheimer’s disease (AD) is an irreversible neurodegenerative disease.The degeneration of brain tissue causes serious cognitive problems and eventually leads to death.There are many clinical trials and research projects to study AD pathology and produce some data for analysis.This paper focuses on the diagnosis of AD and the prediction of potential prognosis in combination with a variety of clinical features.In this paper,a multi-task disease progression model based on hierarchical attention mechanism is proposed.The task of disease automatic diagnosis is regarded as the main task,and the task of disease prognosis is regarded as the auxiliary task to improve the generalization ability of the model,and then improve the performance of disease automatic diagnosis task.In this paper,two layers of attention mechanism are applied in the feature layer and the medical record layer respectively,so that the model can pay different attention to different features and different medical records.The validation experiment is carried out on ADNI (Alzheimer’s Disease Neuroimaging Initiative) dataset.Compared with several benchmark models,the experimental results show that the proposed method has better performance and provides better robustness for clinical application.

Key words: Attention mechanism, Multi-task learning, Automatic diagnosis, Prognosis prediction, Alzheimer’s disease

CLC Number: 

  • TP391.4
[1] PATTERSON C.World Alzheimer Report 2018—The state of the art of dementia research:New frontiers[R].Alzheimer’s Disease International (ADI):London,UK,2018.
[2] WANG Q,SUN M,ZHAN L,et al.Multi-Modality DiseaseModeling via Collective Deep Matrix Factorization[C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.ACM,2017:1155-1164.
[3] DAI P,GWADRY-SRIDHAR F,BAUER M,et al.Healthy cognitive aging:A hybrid random vector functional-link model for the analysis of alzheimer’s disease[C]//Thirty-First AAAI Conference on Artificial Intelligence.2017.
[4] HUANG G B,ZHU Q Y,SIEW C K.Extreme learning ma-chine:theory and applications[J].Neurocomputing,2006,70(1/2/3):489-501.
[5] HINTON G,DENG L,YU D,et al.Deep neural networks for acoustic modeling in speech recognition[J].IEEE Signal proces-sing magazine,2012,29(6):82-97.
[6] LECUN Y,BENGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436.
[7] TUFAIL A B,ABIDI A,SIDDIQUI A M,et al.Automatic classification of initial categories of Alzheimer’s disease from structural MRI phase images:a comparison of PSVM,KNN and ANN methods[J].Age,2012,2012:1731.
[8] LEBEDEV A V,WESTMAN E,VAN WESTEN G J P,et al.Random Forest ensembles for detection and prediction of Alzheimer's disease with a good between-cohort robustness[J].NeuroImage:Clinical,2014,6:115-125.
[9] LÓPEZ M,RAMÍREZ J,GÓRRIZ J M,et al.Principal component analysis-based techniques and supervised classification schemes for the early detection of Alzheimer's disease[J].Neurocomputing,2011,74(8):1260-1271.
[10] SHI B,CHEN Y,HOBBS K,et al.Nonlinear Metric Learning for Alzheimer’s Disease Diagnosis with Integration of Longitudinal Neuroimaging Features[C]//BMVC.2015.
[11] DAI P,GWADRY-SRIDHAR F,BAUER M,et al.Bagging ensembles for the diagnosis and prognostication of alzheimer’s di-sease[C]//Thirtieth AAAI Conference on Artificial Intelligence.2016.
[12] XING E P,JORDAN M I,RUSSELL S J,et al.Distance metric learning with application to clustering with side-information[C]//Advances in neural information processing systems.2003:521-528.
[13] LIN T,ZHA H.Riemannian manifold learning[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2008,30(5):796-809.
[14] 邱锡鹏.神经网络与深度学习[OL].[2017-04-21].
[15] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[J].arXiv:1409.0473,2014.
[16] YANG Z,YANG D,DYER C,et al.Hierarchical attention networks for document classification[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1480-1489.
[17] MNIH V,HEESS N,GRAVES A.Recurrent models of visualattention[C]//Advances in Neural Information Processing Systems.2014:2204-2212.
[18] YANG Y,YANG L,ZOU Y B,et al.Humor Recognition Based on Linguistic Features and Hierarchical Attention Mechanism[J].Computer Engineering,2020,46(8):64-71.
[19] CHOI E,BAHADORI M T,SUN J,et al.Retain:An interpretable predictive model for healthcare using reverse time attention mechanism[C]//Advances in Neural Information Processing Systems.2016:3504-3512.
[20] COLLOBERT R,WESTON J.A unified architecture for natural language processing:Deep neural networks with multitask lear-ning[C]//Proceedings of the 25th International Conference on Machine Learning.ACM,2008:160-167.
[21] ZHANG W J.An Online Multi-Task Learning Algorithm Based on Weight Matrix Decomposition[J].Computer Engineering,2019,45(8):190-197.
[22] DENG L,HINTON G,KINGSBURY B.New types of deep neural network learning for speech recognition and related applications:An overview[C]//2013 IEEE International Conference on Acoustics,Speech and Signal Processing.IEEE,2013:8599-8603.
[23] GIRSHICK R.Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:1440-1448.
[24] RAMSUNDAR B,KEARNES S,RILEY P,et al.Massivelymultitask networks for drug discovery[J].arXiv:1502.02072,2015.
[25] RUDER S.An overview of multi-task learning in deep neuralnetworks[J].arXiv:1706.05098,2017.
[26] CARUANA R.Multitask learning[J].Machine Learning,1997,28(1):41-75.
[27] BAXTER J.A Bayesian/information theoretic model of learning to learn via multiple task sampling[J].Machine Learning,1997,28(1):7-39.
[28] HOCHREITER S,SCHMIDHUBER J.Long short-term memo-ry[J].Neural Computation,1997,9(8):1735-1780.
[29] KINGMA D P,BA J.Adam:A method for stochastic optimization[J].arXiv:1412.6980,2014.
[30] WEINER M W,VEITCH D P,AISEN P S,et al.The Alzhemer’sDisease Neuroimaging Initiative:a review of papers published since its inception[J].Alzheimer’s & Dementia,2013,9(5):e111-e194.
[1] ZHAO Jia-qi, WANG Han-zheng, ZHOU Yong, ZHANG Di, ZHOU Zi-yuan. Remote Sensing Image Description Generation Method Based on Attention and Multi-scale Feature Enhancement [J]. Computer Science, 2021, 48(1): 190-196.
[2] LIU Yang, JIN Zhong. Fine-grained Image Recognition Method Combining with Non-local and Multi-region Attention Mechanism [J]. Computer Science, 2021, 48(1): 197-203.
[3] WANG Rui-ping, JIA Zhen, LIU Chang, CHEN Ze-wei, LI Tian-rui. Deep Interest Factorization Machine Network Based on DeepFM [J]. Computer Science, 2021, 48(1): 226-232.
[4] WANG Run-zheng, GAO Jian, HUANG Shu-hua, TONG Xin. Malicious Code Family Detection Method Based on Knowledge Distillation [J]. Computer Science, 2021, 48(1): 280-286.
[5] ZHAO Wei, LIN Yu-ming, WANG Chao-qiang, CAI Guo-yong. Opinion Word-pairs Collaborative Extraction Based on Dependency Relation Analysis [J]. Computer Science, 2020, 47(8): 164-170.
[6] YUAN Ye, HE Xiao-ge, ZHU Ding-kun, WANG Fu-lee, XIE Hao-ran, WANG Jun, WEI Ming-qiang, GUO Yan-wen. Survey of Visual Image Saliency Detection [J]. Computer Science, 2020, 47(7): 84-91.
[7] LIU Yan, WEN Jing. Complex Scene Text Detection Based on Attention Mechanism [J]. Computer Science, 2020, 47(7): 135-140.
[8] YU Yi-lin, TIAN Hong-tao, GAO Jian-wei and WAN Huai-yu. Relation Extraction Method Combining Encyclopedia Knowledge and Sentence Semantic Features [J]. Computer Science, 2020, 47(6A): 40-44.
[9] NI Hai-qing, LIU Dan, SHI Meng-yu. Chinese Short Text Summarization Generation Model Based on Semantic-aware [J]. Computer Science, 2020, 47(6): 74-78.
[10] HUANG Yong-tao, YAN Hua. Scene Graph Generation Model Combining Attention Mechanism and Feature Fusion [J]. Computer Science, 2020, 47(6): 133-137.
[11] ZHANG Zhi-yang, ZHANG Feng-li, CHEN Xue-qin, WANG Rui-jin. Information Cascade Prediction Model Based on Hierarchical Attention [J]. Computer Science, 2020, 47(6): 201-209.
[12] DENG Yi-jiao, ZHANG Feng-li, CHEN Xue-qin, AI Qing, YU Su-zhe. Collaborative Attention Network Model for Cross-modal Retrieval [J]. Computer Science, 2020, 47(4): 54-59.
[13] ZHOU Zi-qin, YAN Hua. 3D Shape Recognition Based on Multi-task Learning with Limited Multi-view Data [J]. Computer Science, 2020, 47(4): 125-130.
[14] ZHANG Peng-fei, LI Guan-yu, JIA Cai-yan. Truncated Gaussian Distance-based Self-attention Mechanism for Natural Language Inference [J]. Computer Science, 2020, 47(4): 178-183.
[15] ZHANG Yi-fei,WANG Zhong-qing,WANG Hong-ling. Product Review Summarization Using Discourse Hierarchical Structure [J]. Computer Science, 2020, 47(2): 195-200.
Full text



[1] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75 .
[2] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[3] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[4] SHI Wen-jun, WU Ji-gang and LUO Yu-chun. Fast and Efficient Scheduling Algorithms for Mobile Cloud Offloading[J]. Computer Science, 2018, 45(4): 94 -99 .
[5] GENG Hai-jun, SHI Xin-gang, WANG Zhi-liang, YIN Xia and YIN Shao-ping. Energy-efficient Intra-domain Routing Algorithm Based on Directed Acyclic Graph[J]. Computer Science, 2018, 45(4): 112 -116 .
[6] ZHENG Xiu-lin, SONG Hai-yan and FU Yi-peng. Distinguishing Attack of MORUS-1280-128[J]. Computer Science, 2018, 45(4): 152 -156 .
[7] ZHU Shu-qin, WANG Wen-hong and LI Jun-qing. Chosen Plaintext Attack on Chaotic Image Encryption Algorithm Based on Perceptron Model[J]. Computer Science, 2018, 45(4): 178 -181 .
[8] GUO Shuai, LIU Liang and QIN Xiao-lin. Spatial Keyword Range Query with User Preferences Constraint[J]. Computer Science, 2018, 45(4): 182 -189 .
[9] WEN Jun-hao, SUN Guang-hui and LI Shun. Study on Matrix Factorization Recommendation Algorithm Based on User Clustering and Mobile Context[J]. Computer Science, 2018, 45(4): 215 -219 .
[10] JIA Wei, HUA Qing-yi, ZHANG Min-jun, CHEN Rui, JI Xiang and WANG Bo. Mobile Interface Pattern Clustering Algorithm Based on Improved Particle Swarm Optimization[J]. Computer Science, 2018, 45(4): 220 -226 .