计算机科学 ›› 2022, Vol. 49 ›› Issue (1): 292-297.doi: 10.11896/jsjkx.201100007

• 人工智能 • 上一篇    下一篇

基于领域适应嵌入的军事命名实体识别

刘凯1, 张宏军2, 陈飞琼1   

  1. 1 陆军工程大学研究生院 南京 210000
    2 陆军工程大学指挥控制工程学院 南京 210000
  • 收稿日期:2020-11-02 修回日期:2021-03-17 出版日期:2022-01-15 发布日期:2022-01-18
  • 通讯作者: 张宏军(jsnjzhj_lgdx@163.com)
  • 作者简介:lkljgc@163.com

Name Entity Recognition for Military Based on Domain Adaptive Embedding

LIU Kai1, ZHANG Hong-jun2, CHEN Fei-qiong1   

  1. 1 School of Graduate,Army Engineering University of PLA,Nanjing 210000,China
    2 College of Command and Control Engineering,Army Engineering University of PLA,Nanjing 210000,China
  • Received:2020-11-02 Revised:2021-03-17 Online:2022-01-15 Published:2022-01-18
  • About author:LIU Kai,born in 1996,postgraduate,His main research interests include na-tural language processing and so on.
    ZHANG Hong-jun,born in 1963,professor,Ph.D supervisor.His main research interests include military modeling & simulation and data engineering.

摘要: 为了解决单一军事领域语料不足导致的领域嵌入空间质量欠佳,使得深度学习神经网络模型识别军事命名实体精度较低的问题,文中从字词分布式表示入手,通过领域自适应方法由额外的领域引入更多有用信息帮助学习军事领域的嵌入。首先建立领域词典,将其与CRF算法结合,对收集到的通用领域语料和军事领域语料进行领域自适应分词,作为嵌入训练语料,并将词向量作为特征与字向量拼接,以丰富嵌入信息并验证分词效果;然后对训练所得的通用领域和军事领域的异构嵌入空间进行领域自适应转换,生成领域自适应嵌入,并作为基础模型BiLSTM-CRF层的输入;最后通过CoNLL-2000进行识别评价。实验结果表明,在相同模型下,输入领域适应嵌入比输入一般分词后的语料训练所得的军事领域嵌入,其模型识别的精确率(P)、召回率(R)、综合F1值(F1)分别提高了2.17%,1.04%,1.59%。

关键词: 词向量, 领域自适应, 命名实体识别, 中文分词, 字向量

Abstract: In order to solve the poor quality problem of domain embedding space caused by inadequate military corpus which makes low accuracy of applying deep neural network model to military named entity recognition,this paper introduces a domain adaptive method to help learn the embedding of military fields from more useful information of additional fields through distributed representation of words.First,we establish the domain dictionary and combine CRF algorithm to perform domain adaptive word segment with the collected general domain and military areas corpus as training corpus for embedding,and word vectors are used as features and spliced with character vectors to enrich the embedding information and to validate the effect of word segmentation.Then the domain adaptive transformation is carried out to the heterogeneous embedded space of the general domain and the military domain,and the domain adaptive embedding is generated,as the input to BiLSTM-CRF layer of base model.At last,the recognition evaluation is carried out through CoNLL-2000.The experimental results show that,under the same model,the recognition precision rate (P),recall rate (R),and integrated F1 value (F1) of the proposed method are improved by 2.17%,1.04%,and 1.59%,respectively,compared with the military field embedding trained by a corpus which is obtained from general word segmentation.

Key words: Character embedding, Chinese word segmentation, Domain adaptation, Named entity recognition, Word embedding

中图分类号: 

  • TP391.1
[1]HUANG Z H,XU W,YU K.Bidirectional LSTM-CRF modelsfor sequence tagging[EB/OL].(2015-08-09)[2020-10-01].https://arxiv.org/pdf/1508.01991.
[2]LAMPLE G,BALLESTEROS M,SUBRAMANIAN S,et al.Neural architectures for named entity recognition[C]//Procee-dings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics.2016:260-270.
[3]REI M,CRICHTON G,PYYSALO S.Attending to characters in neural sequence labeling models[C]//Proceedings of the 26th International Conference on Computational Linguistics.2016:309-318.
[4]XU K,WANG Q,LI Z Z,et al.Biomedical named entity recognition based on BiGRU network with multi-head attention mechanism[J].Computer applications and software,2020,37(5):151-232.
[5]ZHANG D,CHEN W L.Chinese Named Entity RecognitionBased on Contextualized Char Embeddings[J].Computer Scien-ce,2021,48(3):233-238.
[6]DEVLIN J,CHANG M W,LEE K,et al.Bert:pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2019:4171-4186.
[7]JIANG W Z,GU J J,HU W X,et al.Military named entity re-cognition based on multi-models[J].Ordnance Industry Automation,2011,30(10):90-93.
[8]QIN J,CAO L,PENG H,et al.A domain feature word vector description method for military texts[J].Computer Enginee-ring,2016,42(8):160-165.
[9]ZHANG X H,CAO X W,GAO Y.Named Entity Recognition for Combat Documents Based on Deep Learning[J].Command Control & Simulation,2019,41(4):22-16.
[10]SHAN Y D,WANG H J,HUANG H,et al.Study on Named Entity Recognition Model Based on Attention Mechanism-Ta-king Military Text as Example[J].Computer Science,2019,46(z1):111-114.
[11]PAN S J,QIANG Y.A Survey on Transfer Learning[J].IEEE Transactions on Knowledge and Data Engineering,2010,22(10):1345-1359.
[12]WEISS K,KHOSHGOFTAAR T M,WANG D D.A survey of transfer learning[J].Journal of Big Data,2016,3(1):9.
[13]GUO T K.A research on Chinese Word Segmentation based on Dictionary[D].Harbin:Harbin University of Science and Technology,2010.
[14]ZHANG J.A Chinese Word Segmentation Method Based onRules[J].Computer and Modernization,2005(4):18-20.
[15]ZHAO Y Z.A Chinese word segmentation method based onword frequency statistics[J].Science and Technology,2016,26(10):283.
[16]STENETORP P,SOYER H,PYYSALO P,et al.Size(and domain)matters:Evaluating semantic word space representations for biomedical text[C]//Proceedings of the 5th International Symposium on Semantic Mining in Biomedicine.2012.
[17]ZHANG M S,CHE W X,LIU T.Combining statistical model and dictionary for domain adaption of Chinese word segmentation[J].Journal of Chinese Information Processing,2012,26(2):8-12.
[18]XUE N.Chinese word segmentation as character tagging[J].International Journal of Computational Linguistics and Chinese Language Processing,2003,8(1):28-48.
[19]XIE Z N.Research on Chinese name entity recognition algorithm[D].Hangzhou:Zhejiang University,2017.
[20]LI W K,LI W,WU Y F.Combination methods of Chinese cha-racter and word embeddings in deep learning[J].Journal of Chinese Information Processing,2017,31(6):140-146.
[21]TAN L C,ZHANG H T,SMUCKER M,et al.Lexical comparison between wikipedia and twitter corpora by using word embeddings[C]//Proceedings of ACL.2015.
[22]LIN B Y,LU W.Netural adaptation layers for cross-dominnamed entity recognition[C]//Proceedings of the 2018 Confe-rence on EMNLP.2018:2012-2022.
[23]MIKOLOV T,SUTSKEVER I,CHEN K,et al.DistributedRepresentations of Words and Phrases and their Compositiona-lity[C]//Proceedings of Neural Information Procesing Systems Foundation.2013.
[24]MIKOLOV T,CORRADO G,CHEN K,et al.Efficient Estimation of Word Representations in Vector Space[C]//Proceedings of the ICLR.2013:1-12.
[25]PETERS M,NEUMANN M,IYYER M,et al.Deep contextua-lized word representations[C]//Proceedings of NAACL-HLT.2018:2227-2237.
[1] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[2] 姜胜腾, 张亦弛, 罗鹏, 刘月玲, 曹阔, 赵海涛, 魏急波.
语义通信系统的性能度量指标分析
Analysis of Performance Metrics of Semantic Communication Systems
计算机科学, 2022, 49(7): 236-241. https://doi.org/10.11896/jsjkx.211200071
[3] 杜晓明, 袁清波, 杨帆, 姚奕, 蒋祥.
军事指控保障领域命名实体识别语料库的构建
Construction of Named Entity Recognition Corpus in Field of Military Command and Control Support
计算机科学, 2022, 49(6A): 133-139. https://doi.org/10.11896/jsjkx.210400132
[4] 韩红旗, 冉亚鑫, 张运良, 桂婕, 高雄, 易梦琳.
基于共同子空间分类学习的跨媒体检索研究
Study on Cross-media Information Retrieval Based on Common Subspace Classification Learning
计算机科学, 2022, 49(5): 33-42. https://doi.org/10.11896/jsjkx.210200157
[5] 刘硕, 王庚润, 彭建华, 李柯.
基于混合字词特征的中文短文本分类算法
Chinese Short Text Classification Algorithm Based on Hybrid Features of Characters and Words
计算机科学, 2022, 49(4): 282-287. https://doi.org/10.11896/jsjkx.210200027
[6] 杨进才, 曹元, 胡泉, 沈显君.
基于Transformer模型与关系词特征的汉语因果类复句关系自动识别
Relation Classification of Chinese Causal Compound Sentences Based on Transformer Model and Relational Word Feature
计算机科学, 2021, 48(6A): 295-298. https://doi.org/10.11896/jsjkx.200500019
[7] 刘昱彤, 李鹏, 孙云云, 胡素君.
基于深度动态联合自适应网络的图像识别方法
Image Recognition with Deep Dynamic Joint Adaptation Networks
计算机科学, 2021, 48(6): 131-137. https://doi.org/10.11896/jsjkx.210100008
[8] 董哲, 邵若琦, 陈玉梁, 翟维枫.
基于BERT和对抗训练的食品领域命名实体识别
Named Entity Recognition in Food Field Based on BERT and Adversarial Training
计算机科学, 2021, 48(5): 247-253. https://doi.org/10.11896/jsjkx.200800181
[9] 张栋, 陈文亮.
基于上下文相关字向量的中文命名实体识别
Chinese Named Entity Recognition Based on Contextualized Char Embeddings
计算机科学, 2021, 48(3): 233-238. https://doi.org/10.11896/jsjkx.191200074
[10] 余诗媛, 郭淑明, 黄瑞阳, 张建朋, 苏珂.
嵌套命名实体识别研究进展
Overview of Nested Named Entity Recognition
计算机科学, 2021, 48(11A): 1-10. https://doi.org/10.11896/jsjkx.201100165
[11] 杨青, 张亚文, 朱丽, 吴涛.
基于注意力机制和BiGRU融合的文本情感分析
Text Sentiment Analysis Based on Fusion of Attention Mechanism and BiGRU
计算机科学, 2021, 48(11): 307-311. https://doi.org/10.11896/jsjkx.201000075
[12] 张玉帅, 赵欢, 李博.
基于BERT和BiLSTM的语义槽填充
Semantic Slot Filling Based on BERT and BiLSTM
计算机科学, 2021, 48(1): 247-252. https://doi.org/10.11896/jsjkx.191200088
[13] 程婧, 刘娜娜, 闵可锐, 康昱, 王新, 周扬帆.
一种低频词词向量优化方法及其在短文本分类中的应用
Word Embedding Optimization for Low-frequency Words with Applications in Short-text Classification
计算机科学, 2020, 47(8): 255-260. https://doi.org/10.11896/jsjkx.191000163
[14] 李舟军,范宇,吴贤杰.
面向自然语言处理的预训练技术研究综述
Survey of Natural Language Processing Pre-training Techniques
计算机科学, 2020, 47(3): 162-173. https://doi.org/10.11896/jsjkx.191000167
[15] 唐国强,高大启,阮彤,叶琪,王祺.
融入语言模型和注意力机制的临床电子病历命名实体识别
Clinical Electronic Medical Record Named Entity Recognition Incorporating Language Model and Attention Mechanism
计算机科学, 2020, 47(3): 211-216. https://doi.org/10.11896/jsjkx.190200259
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!