计算机科学 ›› 2022, Vol. 49 ›› Issue (6A): 60-65.doi: 10.11896/jsjkx.201200072

• 智慧医疗 • 上一篇    下一篇

基于注意力机制和多任务学习的阿尔茨海默症分类

杜丽君1, 唐玺璐2, 周娇2, 陈玉兰2, 程建2   

  1. 1 乐山师范学院人工智能学院 四川 乐山 614000
    2 电子科技大学信息与通信工程学院 成都 611731
  • 出版日期:2022-06-10 发布日期:2022-06-08
  • 通讯作者: 程建(chengjian@uestc.edu.cn)
  • 作者简介:(dulijun@lsnu.edu.cn)
  • 基金资助:
    厅市共建智能终端四川省重点实验室开放课题基金(SCITLAB-0017);四川省科技计划项目(2020YFG0085,2021YFG0328);乐山市科技计划项目(19GZD044)

Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning

DU Li-jun1, TANG Xi-lu2, ZHOU Jiao2, CHEN Yu-lan2, CHENG Jian2   

  1. 1 School of Artificial Intelligence,Leshan Normal University,Leshan,Sichuan 614000,China
    2 School of Information and Communication Engineering,University of Electronic Science and Technology of China,Chengdu 611731,China
  • Online:2022-06-10 Published:2022-06-08
  • About author:DU Li-jun,born in 1976,master,lectu-rer.Her main research interests include graphics and images processing,computer vision,and medical image analysis,etc.
    CHENG Jian,born in 1978,professor,Ph.D supervisor.His main research interests include machine learning,computer vision,remote sensing image analysis,medical image analysis,and artificial intelligence,etc.
  • Supported by:
    Open Project Fund of Intelligent Terminal Key Laboratory of Sichuan Province(SCITLAB-0017),Sichuan Science and Technology Program(2020YFG0085,2021YFG0328) and Leshan Science and Technology Program(19GZD044).

摘要: 利用深度学习实现阿尔茨海默症分类已成为近年来医学图像研究热点之一。为了解决现有模型难以有效提取医学图像特征及疾病分类辅助信息资源浪费等问题,基于深度三维卷积神经网络提出一种引入注意力机制和多任务学习的阿尔茨海默症分类方法。首先,利用改进的基础C3D网络,生成较粗糙的低级特征图;然后,将其分别输入至引入注意力机制的卷积块与普通卷积块中,前者关注MRI图像的结构特性,能获取特征图中不同像素位置特有的注意力权重,与后者输出的特征图对应相乘;最后,利用不同的全连接层来实现多任务学习,获得包含主分类任务在内的3种输出,另2种输出在训练过程中通过反向传播优化主分类任务,得到优化后的阿尔茨海默症分类结果。实验结果表明,与目前已有阿尔茨海默症分类方法相比,所提方法在ADNI数据集上的分类准确率等指标均有所提升,有助于推进后续疾病分类研究。

关键词: 阿尔茨海默症, 多任务学习, 三维卷积, 医学图像分类, 注意力机制

Abstract: In recent years,using deep learning to classify Alzheimer's disease has become one of the hotspots in medical image research.But the existing models are difficult to extract medical image features effectively,what's more,the auxiliary information resources of disease classification are wasted.To solve these problems,a classification method of Alzheimer's disease with attention mechanism and multi-task learning based on the deep 3D convolution neural network is proposed.Firstly,using the improved traditional C3D network,a rough low-level feature map is generated.Secondly,this feature map is input into a convolution block with attention mechanism and a common convolution block respectively.The former focuses on the structural characteristics of MRI images,and can obtain the attention weight of different pixel in the feature map,which is multiplied by the output feature map of the latter.Finally,multi-task learning is used to obtain three kinds of outputs by adding different full connected layer.The other two outputs optimize the main classification task through back propagation in the training process.Experimental results show that,compared with the existing classification methods of Alzheimer's disease,the classification accuracy and other indicators of the proposed method on ADNI data set have been improved,which is helpful to promote the follow-up disease classification research.

Key words: 3D convolution, Alzheimer's disease, Attention mechanism, Medical image classification, Multi-task learning

中图分类号: 

  • TP183
[1] ALZHEIMER'S A.2018 Alzheimer's disease facts and figures[J].Alzheimer's & Dementia,2018,14(3):367-429.
[2] FAN Y,BATMANGHELICH N,CLARK C M,et al.Spatialpatterns of brain atrophy in MCI patients,identified via high-dimensional pattern classification,predict subsequent cognitive decline[J].Neuroimage,2008,39(4):1731-1743.
[3] KRUTHIKA K R,RAJESWARI,MAHESHAPPA H D.Multistage classifier-based approach for Alzheimer's disease prediction and retrieval[J].Informatics in Medicine Unlocked,2018,14:34-42.
[4] KRIZHEVSKY A,SUTSKEVER I,HINTON G.ImageNetClassification with Deep Convolutional Neural Networks[C]//Advances in Neural Information Processing Systems.2012:1106-1114.
[5] SIMONYAN K,ZISSERMAN A.Very deep convolutional net-works for large-scale image recognition[EB/OL].(2014-09-04).https://arxiv.org/abs/1409.1556.
[6] SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:1-9.
[7] HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[8] SHEN D,WU G,SUK H I.Deep Learning in Medical Image Analysis[J].Annual Review of Biomedical Engineering,2017,19(1):221-248.
[9] BILLONES C D,DEMETRIA O J L D,HOSTALLERO D E D,et al.DemNet:A convolutional neural network for the detection of Alzheimer's Disease and Mild Cognitive Impairment[C]//2016 IEEE Region 10 Conference(TENCON).2016:3724-3727.
[10] KOROLEV S,SAFIULLIN A,BELYAEV M,et al.Residual and plain convolutional neural networks for 3D brain MRI classification[C]//2017 IEEE 14th International Symposium on Biomedical Imaging.2017:835-838.
[11] SAHUMBAIEV I,POPOV A,RAMÍREZ J,et al.3D-CNNHadNet classification of MRI for Alzheimer's Disease diagnosis[C]//2018 IEEE Nuclear Science Symposium and Medical Imaging Conference Proceedings.2018:1-4.
[12] JI S,XU W,YANG M,et al.3D Convolutional Neural Networks for Human Action Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2013,35(1):221-231.
[13] TRAN D,BOURDEV L,FERGUSR,et al.Learning spatiotemporal features with 3d convolutional networks[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:4489-4497.
[14] KAUL C,MANANDHAR S,PEARS N.FocusNet:An attention-based Fully Convolutional Network for Medical Image Segmentation [EB/OL].(2019-02-08).https://arxiv.org/abs/1902.03091.
[15] VASWANI A,SHAZEER N,PARMARN,et al.Attention is all you need[C]//Advances in Neural Information Processing Systems.2017:5998-6008.
[16] ZHANG Y,ZHANG P Y,YAN Y H.Long short-term memory with attention and multitask learning for distant speech recognition[J].Journal of Tsinghua University(Science and Technology),2018,58(3):249-253.
[17] ZHAO B,WU X,FENG J,et al.Diversified visual attention networks for fine-grained object classification[J].IEEE Transactions on Multimedia,2017,19(6):1245-1256.
[18] RUDER S.An overview of multi-task learning in deep neuralnetworks[EB/OL].(2017-06-15).https://arxiv.org/abs/1706.05098.
[19] ABADI M,BARHAM P,CHEN J,et al.Tensorflow:A system for large-scale machine learning[C]//12th {USENIX} Sympo-sium on Operating Systems Design and Implementation(OSDI 16).2016:265-283.
[20] CHENG D,LIU M.Combining convolutional and recurrent neural networks for Alzheimer's disease diagnosis using PET images[C]//2017 IEEE International Conference on Imaging Systems and Techniques(IST).2017:1-5.
[21] ZENG A,JIA L F,PAN D.Early prognosis of Alzheimer's di-sease based on convolutional neural networks and ensemble learning[J].Journal of Biomedical Engineering,2019,36(5):711-719.
[22] RUOXUAN C,MANHUA L.Hippocampus Analysis by Combination of 3-D DenseNet and Shapes for Alzheimer's Disease Diagnosis[J].IEEE Journal of Biomedical and Health Informa-tics,2019,23(5):2099-2107.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[3] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[4] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[5] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[6] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[7] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[8] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[9] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[10] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[11] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[12] 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨.
基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism
计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
[13] 徐鸣珂, 张帆.
Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法
Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition
计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
[14] 孟月波, 穆思蓉, 刘光辉, 徐胜军, 韩九强.
基于向量注意力机制GoogLeNet-GMP的行人重识别方法
Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism
计算机科学, 2022, 49(7): 142-147. https://doi.org/10.11896/jsjkx.210600198
[15] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!