Computer Science ›› 2023, Vol. 50 ›› Issue (6A): 220200119-8.doi: 10.11896/jsjkx.220200119

• Artificial Intelligence • Previous Articles     Next Articles

Overview of Named Entity Recognition Tasks

GAO Xiang1,2, WANG Shi2, ZHU Junwu1, LIANG Mingxuan1,2, LI Yang1,2, JIAO Zhixiang1,2   

  1. 1 College of Information Engineering,Yangzhou University,Yangzhou,Jiangsu 225000,China;
    2 Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
  • Online:2023-06-10 Published:2023-06-12
  • About author:GAO Xiang,born in 1996,postgraduate.His main research interests is natural language processing named entity re-cognition. WANG Shi,born in 1981,Ph.D,asso-ciate researcher,is a member of China Computer Federation.His main research interests include natural language processing semantic analysis and knowledge graph.
  • Supported by:
    National Natural Science Foundation of China(61702234),National 242 Information Security Program(2021A008),Beijing NOVA Program(Z191100001119014),National Key Research and Development Program of China(2017YFB1002300,2017YFC1700300).

Abstract: Named entity recognition,as a very basic task in natural language processing,lays the foundation for the efficient completion of many other downstream tasks.Its purpose is to identify the corresponding entity from a text described in natural language and label its type,so as to make preparations for data labeling for other related tasks.This paper first introduces the deve-lopment process of named entity recognition tasks and the key methods used in related research in the corresponding context,including the rule-based and dictionary-based methods used in the early days of the birth,and the statistics and deep learning derived from the later development.Secondly,it summarizes some of the more mainstream research directions in this field,including named entity recognition under low-resource conditions,nested named entity recognition,and cross-language named entity recognition.These directions are the hot research trends of this task recently,including the most popular research method of this task at present.Finally,the relevant experience in the research is summarized,and the future development direction and difficulties of the task are prospected.

Key words: Named entity recognition, Nested named entity recognition, Deep learning, Low-resource, Cross-language

CLC Number: 

  • TP391
[1]HAHNE A,FRIEDERICI A D.Electrophysiological evidencefor two steps in syntactic analysis:Early automatic and late controlled processes[J].Journal of Cognitive Neuroscience,1999,11(2):194-205.
[2]SHANG W,HUANG H,ZHU H,et al.A novel feature selection algorithm for text categorization[J].Expert Systems with Applications,2007,33(1):1-5.
[3]YANG Y,WU Y N,LI J.Text similarity calculation based on potential feature words[J].Computer Engineering and Design,2011,32(2):572-575.
[4]BABYCH B,HARTLEY A.Improving machine translationquality with automatic named entity recognition[C]//Procee-dings of the 7th International EAMT Workshop on MT and Other Language Technology Tools,Improving MT Through Other Language Technology Tools,Resource and Tools for Building MT at EACL 2003.2003.
[5]SORICUT R,BRILL E.Automatic question answering:Beyond the factoid[C]//Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics:HLT-NAACL 2004.2004:57-64.
[6]ZHANG J,XIE J,HOU W,et al.Mapping the knowledge structure of research on patient adherence:knowledge domain visuali-zation based co-word analysis and social network analysis[J].PloS One,2012,7(4):e34497.
[7]RAU L F.Extracting company names from text[C]//Procee-dings The Seventh IEEE Conference on Artificial Intelligence Application.IEEE Computer Society,1991:29,30,31,32-29,30,31,32.
[8]OUYANG X,CHEN S,ZHAO H,et al.A multi-Cross Matching Network for Chinese Named Entity Linking in Short Text[C]//Journal of Physics:Conference Series.IOP Publishing,2019:012069.
[9]XIA C,ZHANG C,YANG T,et al.Multi-grained named entity recognition[C]//57th Annual Meeting of the Association for Computational Linguistics,ACL 2019.Association for Computational Linguistics(ACL).2020:1430-1440.
[10]NI J,FLORIAN R.Improving Multilingual Named Entity Re-cognition with Wikipedia Entity Type Mapping[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing.2016:1275-1284.
[11]ZHANG X Y,WANG T,CHEN H W.Research on Named Entity Recognition[J].Computer Science,2005(4):44-48.
[12]CHEN S D,OUYANG X Y.Overview of named entity recognition technology[J].Radio Communication Technology,2020,46(3):251-260.
[13]SHEN W,WANG J,LUO P,et al.Linden:linking named entities with knowledge base via semantic knowledge[C]//Procee-dings of the 21st International Cconference on World Wide Web.2012:449-458.
[14]HUNG J C,CHANG J W.Multi-level transfer learning for improving the performance of deep neural networks:Theory and practice from the tasks of facial emotion recognition and named entity recognition[J].Applied Soft Computing,2021,109:107491.
[15]PETASIS G,VICHOT F,WOLINSKI F,et al.Using machine learning to maintain rule-based named-entity recognition and classification systems[C]//Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics.2001:426-433.
[16]CHITICARIU L,KRISHNAMURTHY R,LI Y,et al.Domain adaptation of rule-based annotators for named-entity recognition tasks[C]//Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing.2010:1002-1012.
[17]LI W,MCCALLUM A.Rapid development of Hindi named entity recognition using conditional random fields and feature induction[J].ACM Transactions on Asian Language Information Processing(TALIP),2003,2(3):290-294.
[18]KONKOL M,KONOPÍK M.CRF-based Czech named entityrecognizer and consolidation of Czech NER research[C]//International Conference on Text,Speech and Dialogue.Berlin:Springer,2013:153-160.
[19]XU Z,QIAN X,ZHANG Y,et al.CRF-based hybrid model for word segmentation,NER and even POS tagging[C]//Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing.2008.
[20]LIN B Y,XU F F,LUO Z,et al.Multi-channel bilstm-crf model for emerging named entity recognition in social media[C]//Proceedings of the 3rd Workshop on Noisy User-generated Text.2017:160-165.
[21]MIKHEEV A,MOENS M,GROVER C.Named entity recognition without gazetteers[C]//Ninth Conference of the European Chapter of the Association for Computational Linguistics.1999:1-8.
[22]BORTHWICK A,STERLING J,AGICHTEIN E,et al.Exploiting diverse knowledge sources via maximum entropy in named entity recognition[C]//Sixth Workshop on Very Large Corpora.1998.
[23]TSAI R T H,WU S H,LEE C W,et al.Mencius:A Chinese named entity recognizer using the maximum entropy-based hybrid model[C]//International Journal of Computational Linguistics & Chinese Language Processing.2004:65-82.
[24]KROGH A,LARSSON B,VON HEIJNE G,et al.Predicting transmembrane protein topology with a hidden Markov model:application to complete genomes[J].Journal of Molecular Biology,2001,305(3):567-580.
[25]BIKEL,DANIEL M.Nymble:a High-Performance LearningName-finder[C]//Fifth Conference on Applied Natural Language Processing.1997:194-201.
[26]LI Y,SHETTY P,LIU L,et al.BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Re-cognition[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics.2021.
[27]LIU F,ZHAO J,LV B,et al.Product named entity recognition based on hierarchical hidden Markov model[C]//Proceedings of the Fourth SIGHAN Workshop on Chinese Language Proces-sing.2005.
[28]PENG N,DREDZE M.Improving named entity recognition for Chinese social media with word segmentation representation learning[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.2016:149-155.
[29]LIU T,YAO J G,LINC Y.Towards improving neural named entity recognition with gazetteers[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.2019:5301-5307.
[30]LAMPLE G,BALLESTEROS M,SUBRAMANIAN S,et al.Neural Architectures for Named Entity Recognition[C]//HLT-NAACL.2016.
[31]DONG X,QIAN L,GUAN Y,et al.A multiclass classification method based on deep learning for named entity recognition in electronic medical records[C]//2016 New York Scientific Data Summit(NYSDS).IEEE,2016:1-10.
[32]SHAO Y,HARDMEIER C,NIVRE J.Multilingual named entity recognition using hybrid neural networks[C]//The Sixth Swedish Language Technology Conference(SLTC).2016.
[33]MNIH V,HEESS N,GRAVES A.Recurrent models of visual attention[C]//Advances in Neural Information Processing Systems.2014:2204-2212.
[34]BAHDANAU D,CHO K,BENGIO Y.Neural Machine Translation by Jointly Learning to Align and Translate[J].arXiv:1409.0473,2014.
[35]LIAO F,MA L,PEI J,et al.Combined Self-Attention Mechanism for Chinese Named Entity Recognition in Military[J].Future Internet,2019,11(8):180.
[36]WU C,LUO G,GUO C,et al.An attention-based multi-taskmodel for named entity recognition and intent analysis of Chinese online medical questions[J].Journal of Biomedical Informatics,2020,108:103511.
[37]EGAN T M,YANG B,BARTLETT K R.The effects of organizational learning culture and job satisfaction on motivation to transfer learning and turnover intention[J].Human Resource Development Quarterly,2004,15(3):279-301.
[38]LIN Y,YANG S,STOYANOV V,et al.A multi-lingual multi-task architecture for low-resource sequence labeling[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Long Papers).2018:799-809.
[39]QU L,FERRARO G,ZHOU L,et al.Named Entity Recognition for Novel Types by Transfer Learning[C]//EMNLP.2016.
[40]FRANCIS S,VAN LANDEGHEM J,MOENSM F.Transferlearning for named entity recognition in financial and biomedical documents[J].Information,2019,10(8):248.
[41]KRUENGKRAI C,NGUYEN T H,ALJUNIED S M,et al.Improving Low-Resource Named Entity Recognition using Joint Sentence and Token Labeling[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:5898-5905.
[42]RUDER S.An overview of multi-task learning in deep neuralnetworks[J].arXiv:1706.05098,2017.
[43]DEVLIN J,CHANG M W,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[J].arXiv:1810.04805,2018.
[44]YU H,MAO X L,CHI Z,et al.A Robust and Domain-Adaptive Approach for Low-Resource Named Entity Recognition[C]//2020 IEEE International Conference on Knowledge Graph(ICKG).IEEE,2020:297-304.
[45]YASEEN U,LANGER S.Data Augmentation for Low-Re-source Named Entity Recognition Using Backtranslation[J].arXiv:2108.11703,2021.
[46]ZHU W,LIU J,XU J,et al.Improving Low-Resource Named Entity Recognition via Label-Aware Data Augmentation and Curriculum Denoising[C]//China National Conference on Chinese Computational Linguistics.Cham:Springer,2021:355-370.
[47]WANG Y,LI Y,TONG H,et al.HIT:nested named entity re-cognition via head-tail pair and token interaction[C]//Procee-dings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP).2020:6027-6036.
[48]WANG J,SHOU L,CHEN K,et al.Pyramid:A layered model for nested named entity recognition[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:5918-5928.
[49]LONG X,NIU S,LI Y.Hierarchical Region Learning for Nested Named Entity Recognition[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing:Findings.2020:4788-4793.
[50]XU Y,HUANG H,FENG C,et al.A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:14185-14193.
[51]FU Y,TAN C,CHEN M,et al.Nested Named Entity Recognition with Partially-Observed TreeCRFs[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:12839-12847.
[52]MUELLER D,ANDREWS N,DREDZE M.Sources of Transfer in Multilingual Named Entity Recognition[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.2020:8093-8104.
[53]XIE J,YANG Z,NEUBIG G,et al.Neural Cross-Lingual Named Entity Recognition with Minimal Resources[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:369-379.
[54]YU X,MAYHEW S,SAMMONS M,et al.On the Strength of Character Language Models for Multilingual Named Entity Re-cognition[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018:3073-3077.
[55]MAYHEW S,TSAI C T,ROTH D.Cheap translation for cross-lingual named entity recognition[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Proces-sing.2017:2536-2545.
[56]HAMDI A,LINHARES PONTES E,BOROS E,et al.A Multilingual Dataset for Named Entity Recognition,Entity Linking and Stance Detection in Historical Newspapers[C]//Procee-dings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval.2021:2328-2334.
[57]SANG E F T K,DE MEULDER F.Introduction to the CoNLL-2003 Shared Task:Language-Independent Named Entity Recognition[J].Development,1837,922:1341.
[58]PRADHAN S,MOSCHITTI A,XUE N,et al.Towards robust linguistic analysis using ontonotes[C]//Proceedings of the Se-venteenth Conference on Computational Natural Language Learning.2013:143-152.
[59]ZHANG Y,YANG J.Chinese NER Using Lattice LSTM[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Long Papers).2018:1554-1564.
[60]LEVOWG A.The third international Chinese language proces-sing bakeoff:Word segmentation and named entity recognition[C]//Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing.2006:108-117.
[1] WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[2] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[3] SONG Xinyang, YAN Zhiyuan, SUN Muyi, DAI Linlin, LI Qi, SUN Zhenan. Review of Talking Face Generation [J]. Computer Science, 2023, 50(8): 68-78.
[4] WANG Xu, WU Yanxia, ZHANG Xue, HONG Ruize, LI Guangsheng. Survey of Rotating Object Detection Research in Computer Vision [J]. Computer Science, 2023, 50(8): 79-92.
[5] ZHOU Ziyi, XIONG Hailing. Image Captioning Optimization Strategy Based on Deep Learning [J]. Computer Science, 2023, 50(8): 99-110.
[6] ZHANG Xiao, DONG Hongbin. Lightweight Multi-view Stereo Integrating Coarse Cost Volume and Bilateral Grid [J]. Computer Science, 2023, 50(8): 125-132.
[7] LI Kun, GUO Wei, ZHANG Fan, DU Jiayu, YANG Meiyue. Adversarial Malware Generation Method Based on Genetic Algorithm [J]. Computer Science, 2023, 50(7): 325-331.
[8] WANG Mingxia, XIONG Yun. Disease Diagnosis Prediction Algorithm Based on Contrastive Learning [J]. Computer Science, 2023, 50(7): 46-52.
[9] SHEN Zhehui, WANG Kailai, KONG Xiangjie. Exploring Station Spatio-Temporal Mobility Pattern:A Short and Long-term Traffic Prediction Framework [J]. Computer Science, 2023, 50(7): 98-106.
[10] HUO Weile, JING Tao, REN Shuang. Review of 3D Object Detection for Autonomous Driving [J]. Computer Science, 2023, 50(7): 107-118.
[11] ZHOU Bo, JIANG Peifeng, DUAN Chang, LUO Yuetong. Study on Single Background Object Detection Oriented Improved-RetinaNet Model and Its Application [J]. Computer Science, 2023, 50(7): 137-142.
[12] MAO Huihui, ZHAO Xiaole, DU Shengdong, TENG Fei, LI Tianrui. Short-term Subway Passenger Flow Forecasting Based on Graphical Embedding of Temporal Knowledge [J]. Computer Science, 2023, 50(7): 213-220.
[13] LI Yuqiang, LI Linfeng, ZHU Hao, HOU Mengshu. Deep Learning-based Algorithm for Active IPv6 Address Prediction [J]. Computer Science, 2023, 50(7): 261-269.
[14] LIANG Mingxuan, WANG Shi, ZHU Junwu, LI Yang, GAO Xiang, JIAO Zhixiang. Survey of Knowledge-enhanced Natural Language Generation Research [J]. Computer Science, 2023, 50(6A): 220200120-8.
[15] WANG Dongli, YANG Shan, OUYANG Wanli, LI Baopu, ZHOU Yan. Explainability of Artificial Intelligence:Development and Application [J]. Computer Science, 2023, 50(6A): 220600212-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!