计算机科学 ›› 2022, Vol. 49 ›› Issue (1): 292-297.doi: 10.11896/jsjkx.201100007
刘凯1, 张宏军2, 陈飞琼1
LIU Kai1, ZHANG Hong-jun2, CHEN Fei-qiong1
摘要: 为了解决单一军事领域语料不足导致的领域嵌入空间质量欠佳,使得深度学习神经网络模型识别军事命名实体精度较低的问题,文中从字词分布式表示入手,通过领域自适应方法由额外的领域引入更多有用信息帮助学习军事领域的嵌入。首先建立领域词典,将其与CRF算法结合,对收集到的通用领域语料和军事领域语料进行领域自适应分词,作为嵌入训练语料,并将词向量作为特征与字向量拼接,以丰富嵌入信息并验证分词效果;然后对训练所得的通用领域和军事领域的异构嵌入空间进行领域自适应转换,生成领域自适应嵌入,并作为基础模型BiLSTM-CRF层的输入;最后通过CoNLL-2000进行识别评价。实验结果表明,在相同模型下,输入领域适应嵌入比输入一般分词后的语料训练所得的军事领域嵌入,其模型识别的精确率(P)、召回率(R)、综合F1值(F1)分别提高了2.17%,1.04%,1.59%。
中图分类号:
[1]HUANG Z H,XU W,YU K.Bidirectional LSTM-CRF modelsfor sequence tagging[EB/OL].(2015-08-09)[2020-10-01].https://arxiv.org/pdf/1508.01991. [2]LAMPLE G,BALLESTEROS M,SUBRAMANIAN S,et al.Neural architectures for named entity recognition[C]//Procee-dings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics.2016:260-270. [3]REI M,CRICHTON G,PYYSALO S.Attending to characters in neural sequence labeling models[C]//Proceedings of the 26th International Conference on Computational Linguistics.2016:309-318. [4]XU K,WANG Q,LI Z Z,et al.Biomedical named entity recognition based on BiGRU network with multi-head attention mechanism[J].Computer applications and software,2020,37(5):151-232. [5]ZHANG D,CHEN W L.Chinese Named Entity RecognitionBased on Contextualized Char Embeddings[J].Computer Scien-ce,2021,48(3):233-238. [6]DEVLIN J,CHANG M W,LEE K,et al.Bert:pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2019:4171-4186. [7]JIANG W Z,GU J J,HU W X,et al.Military named entity re-cognition based on multi-models[J].Ordnance Industry Automation,2011,30(10):90-93. [8]QIN J,CAO L,PENG H,et al.A domain feature word vector description method for military texts[J].Computer Enginee-ring,2016,42(8):160-165. [9]ZHANG X H,CAO X W,GAO Y.Named Entity Recognition for Combat Documents Based on Deep Learning[J].Command Control & Simulation,2019,41(4):22-16. [10]SHAN Y D,WANG H J,HUANG H,et al.Study on Named Entity Recognition Model Based on Attention Mechanism-Ta-king Military Text as Example[J].Computer Science,2019,46(z1):111-114. [11]PAN S J,QIANG Y.A Survey on Transfer Learning[J].IEEE Transactions on Knowledge and Data Engineering,2010,22(10):1345-1359. [12]WEISS K,KHOSHGOFTAAR T M,WANG D D.A survey of transfer learning[J].Journal of Big Data,2016,3(1):9. [13]GUO T K.A research on Chinese Word Segmentation based on Dictionary[D].Harbin:Harbin University of Science and Technology,2010. [14]ZHANG J.A Chinese Word Segmentation Method Based onRules[J].Computer and Modernization,2005(4):18-20. [15]ZHAO Y Z.A Chinese word segmentation method based onword frequency statistics[J].Science and Technology,2016,26(10):283. [16]STENETORP P,SOYER H,PYYSALO P,et al.Size(and domain)matters:Evaluating semantic word space representations for biomedical text[C]//Proceedings of the 5th International Symposium on Semantic Mining in Biomedicine.2012. [17]ZHANG M S,CHE W X,LIU T.Combining statistical model and dictionary for domain adaption of Chinese word segmentation[J].Journal of Chinese Information Processing,2012,26(2):8-12. [18]XUE N.Chinese word segmentation as character tagging[J].International Journal of Computational Linguistics and Chinese Language Processing,2003,8(1):28-48. [19]XIE Z N.Research on Chinese name entity recognition algorithm[D].Hangzhou:Zhejiang University,2017. [20]LI W K,LI W,WU Y F.Combination methods of Chinese cha-racter and word embeddings in deep learning[J].Journal of Chinese Information Processing,2017,31(6):140-146. [21]TAN L C,ZHANG H T,SMUCKER M,et al.Lexical comparison between wikipedia and twitter corpora by using word embeddings[C]//Proceedings of ACL.2015. [22]LIN B Y,LU W.Netural adaptation layers for cross-dominnamed entity recognition[C]//Proceedings of the 2018 Confe-rence on EMNLP.2018:2012-2022. [23]MIKOLOV T,SUTSKEVER I,CHEN K,et al.DistributedRepresentations of Words and Phrases and their Compositiona-lity[C]//Proceedings of Neural Information Procesing Systems Foundation.2013. [24]MIKOLOV T,CORRADO G,CHEN K,et al.Efficient Estimation of Word Representations in Vector Space[C]//Proceedings of the ICLR.2013:1-12. [25]PETERS M,NEUMANN M,IYYER M,et al.Deep contextua-lized word representations[C]//Proceedings of NAACL-HLT.2018:2227-2237. |
[1] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[2] | 姜胜腾, 张亦弛, 罗鹏, 刘月玲, 曹阔, 赵海涛, 魏急波. 语义通信系统的性能度量指标分析 Analysis of Performance Metrics of Semantic Communication Systems 计算机科学, 2022, 49(7): 236-241. https://doi.org/10.11896/jsjkx.211200071 |
[3] | 杜晓明, 袁清波, 杨帆, 姚奕, 蒋祥. 军事指控保障领域命名实体识别语料库的构建 Construction of Named Entity Recognition Corpus in Field of Military Command and Control Support 计算机科学, 2022, 49(6A): 133-139. https://doi.org/10.11896/jsjkx.210400132 |
[4] | 韩红旗, 冉亚鑫, 张运良, 桂婕, 高雄, 易梦琳. 基于共同子空间分类学习的跨媒体检索研究 Study on Cross-media Information Retrieval Based on Common Subspace Classification Learning 计算机科学, 2022, 49(5): 33-42. https://doi.org/10.11896/jsjkx.210200157 |
[5] | 刘硕, 王庚润, 彭建华, 李柯. 基于混合字词特征的中文短文本分类算法 Chinese Short Text Classification Algorithm Based on Hybrid Features of Characters and Words 计算机科学, 2022, 49(4): 282-287. https://doi.org/10.11896/jsjkx.210200027 |
[6] | 杨进才, 曹元, 胡泉, 沈显君. 基于Transformer模型与关系词特征的汉语因果类复句关系自动识别 Relation Classification of Chinese Causal Compound Sentences Based on Transformer Model and Relational Word Feature 计算机科学, 2021, 48(6A): 295-298. https://doi.org/10.11896/jsjkx.200500019 |
[7] | 刘昱彤, 李鹏, 孙云云, 胡素君. 基于深度动态联合自适应网络的图像识别方法 Image Recognition with Deep Dynamic Joint Adaptation Networks 计算机科学, 2021, 48(6): 131-137. https://doi.org/10.11896/jsjkx.210100008 |
[8] | 董哲, 邵若琦, 陈玉梁, 翟维枫. 基于BERT和对抗训练的食品领域命名实体识别 Named Entity Recognition in Food Field Based on BERT and Adversarial Training 计算机科学, 2021, 48(5): 247-253. https://doi.org/10.11896/jsjkx.200800181 |
[9] | 张栋, 陈文亮. 基于上下文相关字向量的中文命名实体识别 Chinese Named Entity Recognition Based on Contextualized Char Embeddings 计算机科学, 2021, 48(3): 233-238. https://doi.org/10.11896/jsjkx.191200074 |
[10] | 余诗媛, 郭淑明, 黄瑞阳, 张建朋, 苏珂. 嵌套命名实体识别研究进展 Overview of Nested Named Entity Recognition 计算机科学, 2021, 48(11A): 1-10. https://doi.org/10.11896/jsjkx.201100165 |
[11] | 杨青, 张亚文, 朱丽, 吴涛. 基于注意力机制和BiGRU融合的文本情感分析 Text Sentiment Analysis Based on Fusion of Attention Mechanism and BiGRU 计算机科学, 2021, 48(11): 307-311. https://doi.org/10.11896/jsjkx.201000075 |
[12] | 张玉帅, 赵欢, 李博. 基于BERT和BiLSTM的语义槽填充 Semantic Slot Filling Based on BERT and BiLSTM 计算机科学, 2021, 48(1): 247-252. https://doi.org/10.11896/jsjkx.191200088 |
[13] | 程婧, 刘娜娜, 闵可锐, 康昱, 王新, 周扬帆. 一种低频词词向量优化方法及其在短文本分类中的应用 Word Embedding Optimization for Low-frequency Words with Applications in Short-text Classification 计算机科学, 2020, 47(8): 255-260. https://doi.org/10.11896/jsjkx.191000163 |
[14] | 李舟军,范宇,吴贤杰. 面向自然语言处理的预训练技术研究综述 Survey of Natural Language Processing Pre-training Techniques 计算机科学, 2020, 47(3): 162-173. https://doi.org/10.11896/jsjkx.191000167 |
[15] | 唐国强,高大启,阮彤,叶琪,王祺. 融入语言模型和注意力机制的临床电子病历命名实体识别 Clinical Electronic Medical Record Named Entity Recognition Incorporating Language Model and Attention Mechanism 计算机科学, 2020, 47(3): 211-216. https://doi.org/10.11896/jsjkx.190200259 |
|