融合机器阅读理解的中文医学命名实体识别方法

doi:10.11896/jsjkx.220900226

Computer Science ›› 2023, Vol. 50 ›› Issue (9): 287-294.doi: 10.11896/jsjkx.220900226

• Artificial Intelligence • Previous Articles Next Articles

Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension

LUO Yuanyuan¹, YANG Chunming^1,3, LI Bo¹, ZHANG Hui², ZHAO Xujian^1,3

1 School of Computer Science and Technology,Southwest University of Science and Technology,Mianyang,Sichuan 621000,China
2 School of Mathematics and Physics,Southwest University of Science and Technology,Mianyang,Sichuan 621000,China
3 Sichuan Big Data and Intelligent System Engineering Technology Research Center,Mianyang,Sichuan 621010,China

Received:2022-09-23 Revised:2022-12-02 Online:2023-09-15 Published:2023-09-01
About author:LUO Yuanyuan,born in 1998,postgra-duate,is a member of China Computer Federation.Her main research interests include knowledge graphs and natural language processing.
YANG Chunming,born in 1980,asso-ciate professor,is a member of China Computer Federation.His main research interests include nature language processing and machine learning.
Supported by:
Key R & D Project of Science & Technology Department of Sichuan Province(2021YFG0031) and Scientific and Technological Achievements Transformation Project of Sichuan Provincial Scientific Research Institute(22YSZH0021).

Abstract

Abstract: Medical named entity recognition is the key to automatically build a large-scale medical knowledge base.However,medical entities are often nested,and it can not be recognized by the sequence labeling method.This paper proposes a Chinese medical named entity recognition method based on reading comprehension framework.It models the nested named entity recognition problem as a machine reading problem,uses BERT to establish the connection between the reading comprehension problem and medical text,and introduces a multi-head attention mechanism to strengthen the semantic connection between the problem and nested named entity,and finally uses two classifiers to predict the beginning and end positions of entities.This method achieves the best results with an F1-score of 67.65% when compared with the current five mainstream methods.Compared with the most classical BiLSTM-CRF,the F1-score improves by 7.17%,and the nested “symptom” entities increase by 16.81%.

Key words: Named entity recognition, Chinese medical, Nested entities, Machine reading comprehension, Multi-head attention mechanism

CLC Number:

TP391.1

LUO Yuanyuan, YANG Chunming, LI Bo, ZHANG Hui, ZHAO Xujian. Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension[J].Computer Science, 2023, 50(9): 287-294.

References

[1]CUI Y,CHE W,LIU T,et al.Revisiting Pre-Trained Models for Chinese Natural Language Processing[C]//Empirical Methods in Natural Language Processing.2020.657-668.
[2]VASWANI A,SHAZEER N,PARMAR N,et al.Attention is allyou need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:6000-6010.
[3]MORWAL S,JAHAN N,CHOPRA D.Named entity recogni-tion using hidden Markov model [J].International Journal on Natural Language Computing,2012,4(1):15-23.
[4]JU Z,WANG J,ZHU F.Named entity recognition from biome-dical text using SVM[C]//2011 5th International Conference on Bioinformatics and Biomedical Engineering.IEEE,2011:1-4.
[5]SONG S,ZHANG N,HUANG H.Named entity recognitionbased on conditional random fields[J].Cluster Computing,2019,22(3):5195-5206.
[6]GUI T,MA R,ZHANG Q,et al.CNN-Based Chinese NER with Lexicon Rethinking[C]//International Joint Conference on Artificial Intelligence.Macao,China,2019:4982-4988.
[7]CHOWDHURY S,DONG X,QIAN L,et al.A multitask bi-directional RNN model for named entity recognition on Chinese electronic medical records[J].BMC Bioinformatics,2018,19(17):75-84.
[8]OUYANG E,LI Y,JIN L,et al.Exploring n-gram characterpresentation in bidirectional RNN-CRF for Chinese clinical named entity recognition[C]//CEUR Workshop Proceedings.2017:37-42.
[9]XU K,ZHOU Z,HAO T,et al.A bidirectional LSTM and conditional random fields approach to medical named entity recognition[C]//International Conference on Advanced Intelligent Systems and Informatics.Cham:Springer,2017:355-365.
[10]HUANG Z,XU W,YU K.Bidirectional LSTM-CRF models for sequence tagging[J].arXiv:1508.01991,2015.
[11]TANG B,WANG X,YAN J,et al.Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF[J].BMC Medical Informatics and Decision Making,2019,19(3):89-97.
[12]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[J].arXiv:1301.3781,2013.
[13]DEVLIN J,CHANG M W,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[J].arXiv:1810.04805,2018.
[14]DAI Z,WANG X,NI P,et al.Named entity recognition usingBERT BiLSTM CRF for Chinese electronic health records[C]//2019 12th International Congresson Image and Signal Proces-sing,Biomedical Engineering and Informatics(CISP-BMEI).2019:1-5.
[15]LI X,ZHANG H,ZHOU X H.Chinese clinical named entityrecognition with variant neural structures based on BERT methods[J].Journal of Biomedical Informatics,2020,107:103422.
[16]JU M,MIWA M,ANANIADOU S.A neural layered model for nested named entity recognition[C]//Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.New Orleans,Louisiana,2018:1446-1459.
[17]XU H,LIU H,JIA Q,et al.A nested named entity recognitionmethod for traditional Chinese medicine records[C]//International Conference on Artificial Intelligence and Security.Cham:Springer,2021:488-497.
[18]ZHENG C,CAI Y,XU J,et al.A Boundary-aware Neural Model for Nested Named Entity Recognition[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP).2019:357-366.
[19]SUN L,SUN Y,JI F,et al.Joint Learning of Token Context and Span Feature for Span-Based Nested NER[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2020,28:2720-2730.
[20]MARINHO Z,MENDES A,MIRANDA S,et al.Hierarchicalnested named entity recognition[C]//Proceedings of the 2nd Clinical Natural Language Processing Workshop.2019:28-34.
[21]WANG B,LU W,WANG Y,et al.A Neural Transition-basedModel for Nested Mention Recognition[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:1011-1017.
[22]LU W,ROTH D.Joint mention extraction and classificationwith mention hypergraphs[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.Lisbon,Portugal,2015:857-867.
[23]KATIYAR A,CARDIE C.Nested Named Entity RecognitionRevisited[C]//Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies,Volume 1(Long Papers).2018:861-871.
[24]LEVY O,SEO M,CHOI E,et al.Zero-shot relation extraction via reading comprehension[J].arXiv:1706.04115,2017.
[25]LIU J,CHEN Y,LIU K,et al.Event extraction as machinereading comprehension[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing.2020:1641-1651.
[26]LIU S,ZHANG X,ZHANG S,et al.Neural machine reading comprehension:Methods and trends[J].Applied Sciences,2019,9(18):3698.
[27]CAO J,ZHOU X,XIONG W,et al.Electronic Medical Record Entity Recognition via Machine Reading Comprehension and Biaffine[J].Discrete Dynamics in Nature and Society,2021,2021(9):16408371-1-16408371-8.
[28]CHIANG Y L,LIN C H,SUNG C L,et al.Nested Named Entity Recognition for Chinese Electronic Health Records with QA-based Sequence Labeling[C]//Proceedings of the 33rd Confe-rence on Computational Linguistics and Speech Processing.2021:18-25.
[29]LI X,FENG J,MENG Y,et al.A unified MRC framework fornamed entity recognition[J].arXiv:1910.11476,2019.
[30]YANG P,CONG X,SUN Z,et al.Enhanced Language Representation with Label Knowledge for Span Extraction[J].arXiv:2111.00884,2021.
[31]MNIH V,HEESS N,GRAVES A.Recurrent models of visual attention[J].Advances in Neural Information Processing Systems,2014,27:2204-2212.

Related Articles 15

[1]	YANG Zhizhuo, XU Lingling, Zhang Hu, LI Ru. Answer Extraction Method for Reading Comprehension Based on Frame Semantics and GraphStructure [J]. Computer Science, 2023, 50(8): 170-176.
[2]	GAO Xiang, TANG Jiqiang, ZHU Junwu, LIANG Mingxuan, LI Yang. Study on Named Entity Recognition Method Based on Knowledge Graph Enhancement [J]. Computer Science, 2023, 50(6A): 220700153-6.
[3]	CUI Lin, CUI Chenlu, LIU Zhengwei, XUE Kai. Speech Emotion Recognition Based on Improved MFCC and Parallel Hybrid Model [J]. Computer Science, 2023, 50(6A): 220800211-7.
[4]	GAO Xiang, WANG Shi, ZHU Junwu, LIANG Mingxuan, LI Yang, JIAO Zhixiang. Overview of Named Entity Recognition Tasks [J]. Computer Science, 2023, 50(6A): 220200119-8.
[5]	HUANG Jiange, JIA Zhen, ZHANG Fan, LI Tianrui. Chinese Medical Named Entity Recognition Based on Multi-feature Embedding [J]. Computer Science, 2023, 50(6): 243-250.
[6]	LIU Pan, GUO Yanming, LEI Jun, LAO Mingrui, LI Guohui. Study on Chinese Named Entity Extraction Rules Based on Boundary Location and Correction [J]. Computer Science, 2023, 50(3): 276-281.
[7]	LIU Luping, ZHOU Xin, CHEN Junjun, He Xiaohai, QING Linbo, WANG Meiling. Event Extraction Method Based on Conversational Machine Reading Comprehension Model [J]. Computer Science, 2023, 50(2): 275-284.
[8]	ZHANG Rujia, DAI Lu, GUO Peng, WANG Bang. Chinese Nested Named Entity Recognition Algorithm Based on Segmentation Attention andBoundary-aware [J]. Computer Science, 2023, 50(1): 213-220.
[9]	DU Xiao-ming, YUAN Qing-bo, YANG Fan, YAO Yi, JIANG Xiang. Construction of Named Entity Recognition Corpus in Field of Military Command and Control Support [J]. Computer Science, 2022, 49(6A): 133-139.
[10]	WEI Ru-ming, CHEN Ruo-yu, LI Han, LIU Xu-hong. Analysis of Technology Trends Based on Deep Learning and Text Measurement [J]. Computer Science, 2022, 49(11A): 211100119-6.
[11]	LIU Kai, ZHANG Hong-jun, CHEN Fei-qiong. Name Entity Recognition for Military Based on Domain Adaptive Embedding [J]. Computer Science, 2022, 49(1): 292-297.
[12]	XIAO Ding, ZHANG Yu-fan, JI Hou-ye. Electricity Theft Detection Based on Multi-head Attention Mechanism [J]. Computer Science, 2022, 49(1): 140-145.
[13]	QIU Jia-zuo, XIONG De-yi. Frontiers in Neural Question Generation:A Literature Review [J]. Computer Science, 2021, 48(6): 159-167.
[14]	DONG Zhe, SHAO Ruo-qi, CHEN Yu-liang, ZHAI Wei-feng. Named Entity Recognition in Food Field Based on BERT and Adversarial Training [J]. Computer Science, 2021, 48(5): 247-253.
[15]	ZHOU Xiao-jin, XU Chen-ming, RUAN Tong. Multi-granularity Medical Entity Recognition for Chinese Electronic Medical Records [J]. Computer Science, 2021, 48(4): 237-242.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0