Computer Science ›› 2024, Vol. 51 ›› Issue (11A): 240200041-7.doi: 10.11896/jsjkx.240200041

• Intelligent Computing • Previous Articles     Next Articles

Multi-task Learning Model for Text Feature Enhancement in Medical Field

GUO Ruiqiang1,2,3, JIA Xiaowen1, YANG Shilong1, WEI Qianqiang1   

  1. 1 School of Computer and Cyber Security,Hebei Normal University,Shijiazhuang 050024,China
    2 Hebei Provincial Engineering Research Center for Supply Chain Big Data Analytics & Data Security,Hebei Normal University,Shijiazhuang 050024,China
    3 Hebei Provincial Key Laboratory of Network and Information Security,Shijiazhuang 050024,China
  • Online:2024-11-16 Published:2024-11-13
  • About author:GUO Ruiqiang,born in 1974,Ph.D,professor,master supervisor,is a member of CCF(No.17546M).His main research interests include database system design,data mining,big data proces-sing.
  • Supported by:
    2023 Hebei Province Talent Introduction and Intelligence Innovation Platform(606080123003).

Abstract: The recognition and standardization of medical named entities are the foundation for constructing high-quality medical knowledge graphs.This paper proposes a multi-task learning model based on text feature enhancement,aiming to address the issue of inadequate utilization of text features in existing models for medical entity recognition and standardization.The model incorporates word-level,character-level features,and contextual semantic information to enhance text representation.Through four hierarchical sub-tasks,it jointly models medical entity recognition and standardization tasks.Experiments indicate that the proposed model can learn common features for both entity recognition and entity standardization tasks,effectively improving the accuracy of learning.Satisfactory results are achieved on two datasets,NCBI and BC5CDR,with F1 scores for NER and NEN tasks 1.09%,91.02%;92.05%,92%,respectively.

Key words: Medical named entity recognition, Entity normalization, Multitask, Feature enhancement, Joint modeling

CLC Number: 

  • TP391
[1]ZHOU B Z,CAI X R,ZHANG Y,et al.MTAAL:Multi-Task Adversarial Active Learning for Medical Named Entity Recognition and Normalization[C]//Proceedings of the 35th AAAI,California.Palo Alto,AAAI Press,2021:14586-14593.
[2]LEE J,YOON W,KIM S,et al.BioBERT:a pre-trained biomedical language representation model for biomedical text mining[J].Bioinformatics,2020,36(4):1234-1240.
[3]ZHAO P,DOU Q S,TANG H L,et al.Attention AdaptiveModel with Word Information Embeding for Named Entity Re-cognition[J].Computer Engineering and Applications,2023,59(8):167-174.
[4]YANG R Y,HE Q,DU N S.Chinese Named Entity Recognition Based on Gated Multi-Feature Extractors[J].Computer Engineering and Applications,2022,58(8):117-124.
[5]ROBERT L,REZARTA L,ZHIYONG L.Dnorm:disease name normalization with pairwise learning to ran.[J].Bioinform,2013,29(22):2909-2917.
[6]LOWE D M,O'BOYLE N M,ASAYLE R.Leadmine:Diseaseidentification and concept mapping using wikipedia[C]//Proceedings of the Fifth BioCreative Challenge Evaluation Workshop.Seville,CEUR Workshop Proceedings,2015:240-246.
[7]ROBERT L,ZHIYONG L.Taggerone:joint named entity recognition and normalization with semi-markov models.[J].Bioinform,2016,32(18):2839-2846.
[8]SAHU S,ANAND A.Recurrent neural network models for disease name recognition using domain invariant features[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics,Berlin,Stroudsburg,Association for Computational Linguistics,2016:2216-2225.
[9]DEVLIN J,MINGWEI C,LEE K,TOUTANOVA K.BERT:pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics.Florence,Minneapolis,Association for Computational Linguistics,2019:4171-4186.
[10]XIONG Y,HUANG Y H,CHEN Q C,et al.TANG B Z.A joint model for medical named entity recognition and normalization[C]//Proceedings of the Iberian Languages Evaluation Forum colocated with 36th Conference of the Spanish Society for Natural Language Processing.Málaga,IberLEF@SEPLN,2020:499-504.
[11]ZHOU H,NING S,LIU Z,et al.Knowledge-enhanced biomedi-cal named entity recognition and normalization:application to proteins and genes.[J].Bioinform,2020,21(1):35-50.
[12]LOU Y X,ZHANG Y,QIAN T,et al.A transition-based joint model for disease named entity recognition and normalization.[J].Bioinform,2017,33(15):2363-2371.
[13]EMMA S,PATRICK V,DAVID B,ANDREW M.Fast and accurate entity recognition with iterated dilated convolutions[C]//Proceedings of the 25th Conference on Empirical Methods in Natural Language Processing.Copenhagen,Copenhagen,Asso-ciation for Computational Linguistics,2017:2670-2680.
[14]ZHAO Z H,YANG Z H,LUO L,et al.Disease named entity recognition from biomedical literature using a novel convolu-tional neural network.[J].BMC Medical Genomics,2017,10(5):73-82.
[15]WONJIN Y,CHAN H S,JINHYUK L,et al.Collabonet:colla-boration of deep neural networks for biomedical named entity re-cognition[J].BMC Bioinform,2019,20(10):55-65.
[16]ZHANG S D,LIU T,ZHAO S C,et al.A neural multi-task learning framework to jointly model medical named entity recognition and normalization[C]//Proceedings of the 33th AAAI.Hawaii,Honolulu,AAAI Press,2019:817-824.
[17]ZHOU B H,CAI X R,ZHANG Y,et al.An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing,Online,Aug 1-8,Florence.Association for Computational Linguistics,2021:6214-6224.
[18]CHEN M X,CHEN Y P,HU Y,et al.Biomedical Named Entity Recognition Method Based on Word Meaning Enhancement[J].Computer Engineering,2023,49(10):305-312.
[19]YU X Q,WANG X,LI Z Q,et al.Biomedical Named Entity Re-cognition Based on Character Level Feature Adaptation[J].Journal of Chinese Computer Systems,2023,44(9):1876-1883.
[20]ZHOU B,CAI X,ZHANG Y,et al.MTAAL:multi-task adversarial active learning for medical named entity recognition and normalization[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:14586-14593.
[21]ZHOU P,SHI W,TIAN J,et al.Attention-based bidirectional long short-term memory networks for relation classification[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics,Berlin,Aug 7-12,Stroudsburg.Association for Computational Linguistics,2016:207-212.
[22]REZARTA I D,ROBERT L,LU Z Y.NCBI disease corpus:A resource for disease name recognition and concept normalization.[J].Journal of Biomedical Informatics,2014,47(1):1-10.
[23]JIAO L,SUN Y P,ROBIN J J,et al.Biocreative V CDR task corpus:a resource for chemical disease relation extraction.[J].Database(Oxford),2016,2016(2016):68-78.
[24]DIEDERIK P.KINGMA,JIMMY L B.Adam:A method for stochastic optimization[C]//Proceedings of the 3rd International Conference on Learning Representations,San Diego,May 7-9,Ithaca.Conference Track Proceedings,2015:602-616.
[1] YIN Baosheng, ZHOU Peng. Chinese Medical Named Entity Recognition with Label Knowledge [J]. Computer Science, 2024, 51(6A): 230500203-7.
[2] XU Hao, LI Fengrun, LU Lu. Metal Surface Defect Detection Method Based on Dual-stream YOLOv4 [J]. Computer Science, 2024, 51(4): 209-216.
[3] ZHOU Yanlin, WU Kaijun, MEI Yuan, TIAN Bin, YU Tianxiu. Dunhuang Mural Element Detection Algorithm Based on Improved Yolov8 [J]. Computer Science, 2024, 51(11A): 231000034-6.
[4] ZHANG Shunyao, LI Huawang, ZHANG Yonghe, WANG Xinyu, DING Guopeng. Image Retrieval Based on Independent Attention Mechanism [J]. Computer Science, 2023, 50(6A): 220300092-6.
[5] ZHANG Jingyuan, WANG Hongxia, HE Peisong. Multitask Transformer-based Network for Image Splicing Manipulation Detection [J]. Computer Science, 2023, 50(1): 114-122.
[6] HUANG Xiao-ling, ZHANG De-ping. Fault Diagnosis Based on Channel Splitting CLAHE and Adaptive Threshold Residual NetworkUnder Variable Operating Conditions [J]. Computer Science, 2022, 49(11A): 211100122-7.
[7] ZHAO Jia-qi, WANG Han-zheng, ZHOU Yong, ZHANG Di, ZHOU Zi-yuan. Remote Sensing Image Description Generation Method Based on Attention and Multi-scale Feature Enhancement [J]. Computer Science, 2021, 48(1): 190-196.
[8] ZHU Wei, WANG Tu-qiang, CHEN Yue-feng, HE De-feng. Object-level Edge Detection Algorithm Based on Multi-scale Residual Network [J]. Computer Science, 2020, 47(6): 144-150.
[9] HE Xia, TANG Yi-ping, WANG Li-ran, CHEN Peng, YUAN Gong-ping. Multitask Hierarchical Image Retrieval Technology Based on Faster RCNNH [J]. Computer Science, 2019, 46(3): 303-313.
[10] HE Xia, TANG Yi-ping, YUAN Gong-ping, CHEN Peng, WANG Li-ran. Study on Bayonet Recognition Engine Based on Cascade Multitask Deep Learning [J]. Computer Science, 2019, 46(1): 303-308.
[11] HUAN Ruo-hong and CHEN Yue. Method of Human Activity Recognition Based on Feature Enhancement and Decision Fusion [J]. Computer Science, 2016, 43(Z11): 151-155.
[12] . Xen Virtual Machine Scheduling Enhancement by Improving Cache Efficiency [J]. Computer Science, 2012, 39(7): 297-201.
[13] YANG Zhi-yi,HUI Yong-li,LI Shi-ning,LI Zhi-gang. Implement Method of Desktop Environment Supporting Multitask Based on Embedded System [J]. Computer Science, 2011, 38(7): 261-264.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!