基于预训练模型和双向二维卷积的命名实体识别算法

doi:10.11896/jsjkx.240700143

Abstract

Abstract: A named entity recognition algorithm BAM-TDNN based on bidirectional two-dimensional convolution and pre-training model is proposed to address the problem of semantic information weakening layer by layer when processing nested structures in named entity recognition.This algorithm first uses four word embedding strategies,namely BERT,distance,locality,and attention embedding,to extract semantic features at different levels within a sentence,and converts semantic features at multiple levels into two-dimensional semantic representations,better capturing semantic information between nested structures.Secondly,the Bi TDNN model is used to learn the long-range semantic dependencies of entities in sentences,expand the receptive field of span representation,provide more accurate semantic information between nested entities,and better understand the semantic associations between nested entities.Through evaluation on four public datasets,experimental results show that the proposed named entity recognition algorithm has achieved good performance on multiple entity recognition datasets.The accuracy,recall,and F1 value of BAM-TDNN on the ACE2005 dataset is 86.83%,87.93%,and 86.83%,respectively.The accuracy,recall,and F1 value on the GENIA dataset is 86.52%,82.37%,and 84.36%,respectively.The accuracy,recall,and F1 value on the CoNLL2003 dataset is 92.24%,93.72%,and 91.97%,respectively.

Key words: Named entity recognition, Four-word embedding strategy, BERT, Bi-TDNN

CLC Number:

TP391

LIN Nan, LIU Zhihui, YANG Cong. Named Entity Recognition Algorithm Based on Pre-training Model and Bidirectional TwoDimensional Convolution[J].Computer Science, 2025, 52(6A): 240700143-6.

References

[1]OZCELIK O,TORAMAN C.Named entity recognition in Turkish:A comparative study with detailed error analysis [J].Information Processing & Management,2022,59(6):103065.
[2]LIU X F,TAN K W,DONG S B.Multi-granularity sequential neural network for document-level biomedical relation extraction.Information Processing & Management [J].Information Processing & Management,2021,58(6):102718.
[3]XIA Y,LAN M J,LOU J Y,et al.Iterative rule-guided reasoning over sparse knowledge graphs with deep reinforcement learning [J].Information Processing & Management,2022,59(5):103040.
[4]DINARELLI M,ROSSET S.Models cascade for tree-structurednamed entity detection [C]//Proceedings of 5th International Joint Conference on Natural Language Processing.2011:1269-1278.
[5]JU M Z,MIWA M,ANANIADOU S.A neural layered model for nested named entity recognition [C]//Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2018:1446-1459.
[6]FISHER J,VLACHOS A.Merge and label:A novel neural network architecture for nested NER [C]//Proceedings of the57th Annual Meeting of the Association for Computational Linguistics.2019:5840-5850.
[7]STRAKOVA J,STRAKA M,HAJIC J.Neural architectures for nested NER through linearization[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.2019:5326-5331.
[8]SOHRAB M G,MIWA M.Deep exhaustive model for nestednamed entity recognition [C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:2843-2849.
[9]KATIYAR A,CARDIE C.Nested named entity recognition revisited[C]//Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2018:861-871.
[10]YU J T,BOHNET B,POESIO M.Named entity recognition asdependency parsing [C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:6470-6476.
[11]LI J Y,FEI H,LIU J,et al.Unified named entity recognition as word-word relation classification [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:10965-10973.
[12]WANG J,LI D,CHEN G,et al.Pyramid:A layered model for nested named entity recognition [C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:5918-5928.
[13]YANG S L,TU K W.Bottom-up constituency parsing and nested named entity recognition with pointer networks [C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2022:2403-2416.
[14]OUCHI H,SUZUKI J,KOBAYASHI S,et al.Instance-basedlearning of span representations:A case study through named entity recognition [EB/OL].(2020-04-29)[2020-07-10].arXiv:2004.14514.
[15]SUN L,SUN Y X,JI F,et al.Joint learning of token context and span feature for span-based nested NER[C]//IEEE/ACM Transactions on Audio,Speech,and Language Processing.2020:2720-2730.
[16]LU W,ROTH D.Joint mention extraction and classificationwith mention hypergraphs [C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.2015:857-867.
[17]MUIS A O,LU W.Labeling gaps between words:Recognizing overlapping mentions with mention separators [C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing.2017:2608-2618.
[18]ZHANG Y,ZHOU H Q,LI Z H.Fast and accurate neural CRF constituency parsing [C]//Proceedings of the Twenty-ninth International Conference on International Joint Conferences on Artificial Intelligence.2021:4046-4053.
[19]FU Y,TAN C Q,CHEN M S,et al.Nested named entity recognition with partially-observed treecrfs [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:12839-12847.
[20]LOU C,YWANG S L,TU K W.Nested named entity recognition as latent lexicalized constituency parsing [C]//Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics.2022:6183-6198.
[21]GENG R,CHEN Y,HUANG R,et al.Planarized sentence representation for nested named entity recognition[C]//Information Processing and Management.2023,60:103352.
[22]LIU R B,WEI J,JIA C Y,et al.Modulating language modelswith emotions [C]//Findings of the Association for Computational Linguistics:ACL(IJCNLP 2021).2021:4332-4339.
[23]WALKER C,STRASSEL S,MEDERO J,et al.ACE 2005 multilingual training corpus[J].Linguistic Data Consortium,2006,57:45.
[24]KIM J,OHTA T,TATEISI Y,et al.GENIA corpus－a semantically annotated corpus for bio-textmining[J].Bioinformatics,2003,19(1):180-182.
[25]SANG E,DE M.Introduction to the CoNLL-2003 shared task:Language-independent named entity recognition [C]//Proceedings of the Seventh Conference on Natural Language Learning at HLT(NAACL 2003).2003:142-147.
[26]ZHANG Y,YANG J.Chinese NER using lattice LSTM [C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.2018:1554-1564.
[27]SHEN Y L,MA X Y,TAN Z Q,et al.Locate and label:A two-stage identifier for nested named entity recognition [EB/OL].(2021-01-01)[2024-03-01].arXiv:2105.06804.

Related Articles 15

[1]	HU Caishun. Study on Named Entity Recognition Algorithms in Audit Domain Based on Large LanguageModels [J]. Computer Science, 2025, 52(6A): 240700190-4.
[2]	ZHENG Xinxin, CHEN Fan, SUN Baodan, GONG Jianguang, JIANG Junhui. Question Answering System for Soybean Planting Management Based on Knowledge Graph [J]. Computer Science, 2025, 52(6A): 240500025-8.
[3]	LI Daicheng, LI Han, LIU Zheyu, GONG Shiheng. MacBERT Based Chinese Named Entity Recognition Fusion with Dependent Syntactic Information and Multi-view Lexical Information [J]. Computer Science, 2025, 52(6A): 240600121-8.
[4]	PAN Jian, WU Zhiwei, LI Yanjun. CGR-BERT-ZESHEL:Zero-shot Entity Linking Model with Chinese Features [J]. Computer Science, 2025, 52(4): 262-270.
[5]	HUANG Wei, SHEN Yaodi, CHEN Songling, FU Xiangling. CFGT:A Lexicon-based Chinese Address Element Parsing Model [J]. Computer Science, 2024, 51(9): 233-241.
[6]	TANG Ruiqi, XIAO Ting, CHI Ziqiu, WANG Zhe. Few-shot Image Classification Based on Pseudo-label Dependence Enhancement and NoiseInterferenceReduction [J]. Computer Science, 2024, 51(8): 152-159.
[7]	GUO Zhiqiang, GUAN Donghai, YUAN Weiwei. Word-Character Model with Low Lexical Information Loss for Chinese NER [J]. Computer Science, 2024, 51(8): 272-280.
[8]	YANG Binxia, LUO Xudong, SUN Kaili. Recent Progress on Machine Translation Based on Pre-trained Language Models [J]. Computer Science, 2024, 51(6A): 230700112-8.
[9]	LI Minzhe, YIN Jibin. TCM Named Entity Recognition Model Combining BERT Model and Lexical Enhancement [J]. Computer Science, 2024, 51(6A): 230900030-6.
[10]	YIN Baosheng, ZHOU Peng. Chinese Medical Named Entity Recognition with Label Knowledge [J]. Computer Science, 2024, 51(6A): 230500203-7.
[11]	JIANG Haoda, ZHAO Chunlei, CHEN Han, WANG Chundong. Construction Method of Domain Sentiment Lexicon Based on Improved TF-IDF and BERT [J]. Computer Science, 2024, 51(6A): 230800011-9.
[12]	YANG Junzhe, SONG Ying, CHEN Yifei. Text Emotional Analysis Model Fusing Theme Characteristics [J]. Computer Science, 2024, 51(6A): 230600111-8.
[13]	LAI Xin, LI Sining, LIANG Changsheng, ZHANG Hengyan. Ontology-driven Study on Information Structuring of Aeronautical Information Tables [J]. Computer Science, 2024, 51(6A): 230800150-7.
[14]	CHEN Bingting, ZOU Weiqin, CAI Biyu, LIU Wenjie. Bug Report Severity Prediction Based on Fine-tuned Embedding Model with Domain Knowledge [J]. Computer Science, 2024, 51(6A): 230400068-7.
[15]	MENG Xiangfu, REN Quanying, YANG Dongshen, LI Keqian, YAO Keyu, ZHU Yan. Literature Classification of Individual Reports of Adverse Drug Reactions Based on BERT and CNN [J]. Computer Science, 2024, 51(6A): 230400049-6.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Named Entity Recognition Algorithm Based on Pre-training Model and Bidirectional TwoDimensional Convolution

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0