基于多标签的军事领域命名实体识别

摘要/Abstract

摘要： 为了识别军事文本中的军事命名实体,根据军事命名实体的特点,将其分为6类标注。在此基础上,为了进一步解决多嵌套和组合的复合军事命名实体难以识别的问题,对传统的标注方法加以改进,提出了一种基于多标签的标注方法。首先,对复合的军事命名实体做分词处理,使之成为多个最小词组的组合;然后,各部分词组按其在命名实体中的位置做分段标注,各词组中的每个字则在分段标注的基础上,根据其在词组中的位置再做词位标注;最后,将整个标注作为军事命名实体中每个字的标注结果。实验结果表明,该标注方法能够提升军事命名实体的识别效果。

关键词: 多标签, 复合军事命名实体, 军事命名实体

Abstract: In order to identify military named entities in military texts,this paper classified them into six categories according to the characteristics of military named entities.On this basis,in order to further solve the problem that the multi-nested and combined composite military named entities are difficult to identify,the traditional annotation method was improved,and a multi-label annotation method was proposed.First,the compound military named entity is divided into several words,so that it becomes a combination of multiple minimum phrases,and then each part of the phrase is segmented according to its position in the named entity.On the basis of segmentation,each word in each phrase is marked with a vocabulary based on its position in the phrase.Finally,the entire label is ultimately used as the labeling result for each word in the military named entity.The experimental results show that the annotation method can enhance the recognition effect of military named entities.

Key words: Composite military named entity, Military named entity, Multi-label

中图分类号:

TP391

单义栋, 王衡军, 王娜. 基于多标签的军事领域命名实体识别[J]. 计算机科学, 2019, 46(11A): 9-12. https://doi.org/

SHAN Yi-dong, WANG Heng-jun, WANG Na. Military Domain Named Entity Recognition Based on Multi-label[J]. Computer Science, 2019, 46(11A): 9-12. https://doi.org/

参考文献

[1]田俊玮.军事领域中文术语抽取的研究[D].大连:大连理工大学,2013.
[2]冯蕴天,张宏军,郝文宁.面向军事文本的命名实体识别[J].计算机科学,2015,42(7):15-18,47.
[3]蒋超.研报领域的产品词命名实体识别的研究[D].南宁:广西大学,2017.
[4]姜文志,顾佼佼,胡文萱,等.基于多模型结合的军事命名实体识别[J].兵工自动化,2011,30(10):90-93.
[5]孙安,于英香,罗永刚,等.序列标注模型中的字粒度特征提取方案研究——以CCKS2017:Task2临床病历命名实体识别任务为例[J].图书情报工作,2018,62(11):103-111.
[6]章成志,苏新宁.基于条件随机场的自动标引模型研究[J].中国图书馆学报,2008(5):89-94,99.
[7]王学锋,杨若鹏,朱巍.基于深度学习的军事命名实体识别方法[J].装甲兵工程学院学报,2018,32(4):94-98.
[8]秦杰,曹雷,彭辉,等.一种面向军事文本的领域特征词向量描述方法[J].计算机工程,2016,42(8):160-165.
[9]谢志宁.中文命名实体识别算法研究[D].杭州:浙江大学,2017.
[10]高强,游宏梁.基于层叠模型的国防领域命名实体识别研究[J].现代图书情报技术,2012(11):47-52.
[11]乌兰敖日格乐.中文军事组织机构名的识别[D].大连:大连理工大学,2010.
[12]张磊.特定领域命名实体识别通用方法的研究[D].北京:北京交通大学,2018.
[13]周练.Word2vec的工作原理及应用探究[J].科技情报开发与经济,2015,25(2):145-148.
[14]SRIVASTAVA N,HINTON G,KRIZHEVSKY A,et al.Dropout:A Simple Way to Prevent Neural Networks from Overfitting[J].Journal of Machine Learning Research,2014,15(1):1929-1958.
[15]BOUTHILLIER X,KONDA K,VINCENT P,et al.Dropout as data augmentation[J].arXiv:1508.08700.
[16]单赫源,张海粟,吴照林.小粒度策略下基于CRFs的军事命名实体识别方法[J].装甲兵工程学院学报,2017,31(1):84-89.

相关文章 15

[1]	武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航. 监督和半监督学习下的多标签分类综述 Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning 计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111
[2]	朱旭东, 熊贇. 基于样本分布损失的图像多标签分类研究 Study on Multi-label Image Classification Based on Sample Distribution Loss 计算机科学, 2022, 49(6): 210-216. https://doi.org/10.11896/jsjkx.210300267
[3]	方仲礼, 王喆, 迟子秋. 面向多标签小样本学习的双流重构网络 Dual-stream Reconstruction Network for Multi-label and Few-shot Learning 计算机科学, 2022, 49(1): 212-218. https://doi.org/10.11896/jsjkx.201100143
[4]	李可悦, 陈轶, 牛少彰. 基于BERT的社交电商文本分类算法 Social E-commerce Text Classification Algorithm Based on BERT 计算机科学, 2021, 48(2): 87-92. https://doi.org/10.11896/jsjkx.200700111
[5]	蒋建峰, 尤澜涛. 基于MPLS-TE的数据中心网络QoS优化 QoS Optimization of Data Center Network Based on MPLS-TE 计算机科学, 2021, 48(11A): 485-489. https://doi.org/10.11896/jsjkx.210900190
[6]	陈洁婷, 王维莹, 金琴. 弹幕信息协助下的视频多标签分类 Multi-label Video Classification Assisted by Danmaku 计算机科学, 2021, 48(1): 167-174. https://doi.org/10.11896/jsjkx.200800198
[7]	王青松, 姜富山, 李菲. 大数据环境下基于关联规则的多标签学习算法 Multi-label Learning Algorithm Based on Association Rules in Big Data Environment 计算机科学, 2020, 47(5): 90-95. https://doi.org/10.11896/jsjkx.190300150
[8]	刘晓玲,刘柏嵩,王洋洋,唐浩. 基于深度学习的多标签生成研究进展 Research and Development of Multi-label Generation Based on Deep Learning 计算机科学, 2020, 47(3): 192-199. https://doi.org/10.11896/jsjkx.190300137
[9]	朱峙成, 刘佳玮, 阎少宏. 多标签学习在智能推荐中的研究与应用 Research and Application of Multi-label Learning in Intelligent Recommendation 计算机科学, 2019, 46(11A): 189-193.
[10]	葛宏孔, 罗恒利, 董佳媛. 基于深度学习的非实验室场景人脸属性识别 Face Attributes in Wild Based on Deep Learning 计算机科学, 2019, 46(11A): 246-250.
[11]	石静, 郑嘉利, 袁源, 王哲, 李丽. 基于Whittle索引的RFID多阅读器信道资源分配算法 RFID Multi-reader Channel Resources Allocation Algorithm Based on Whittle Index 计算机科学, 2019, 46(10): 122-127. https://doi.org/10.11896/jsjkx.180801602
[12]	温雯, 陈颖, 蔡瑞初, 郝志峰, 王丽娟. 基于多视角多标签学习的读者情绪分类 Emotion Classification for Readers Based on Multi-view Multi-label Learning 计算机科学, 2018, 45(8): 191-197. https://doi.org/10.11896/j.issn.1002-137X.2018.08.034
[13]	陈福才, 李思豪, 张建朋, 黄瑞阳. 基于标签关系改进的多标签特征选择算法 Multi-label Feature Selection Algorithm Based on Improved Label Correlation 计算机科学, 2018, 45(6): 228-234. https://doi.org/10.11896/j.issn.1002-137X.2018.06.041
[14]	汤一平, 王丽冉, 何霞, 陈朋, 袁公萍. 基于多任务卷积神经网络的舌象分类研究 Classification of Tongue Image Based on Multi-task Deep Convolutional Neural Network 计算机科学, 2018, 45(12): 255-261. https://doi.org/10.11896／j.issn.1002-137X.2018.12.042
[15]	黎健成,袁春,宋友. 基于卷积神经网络的多标签图像自动标注 Multi-label Image Annotation Based on Convolutional Neural Network 计算机科学, 2016, 43(7): 41-45. https://doi.org/10.11896/j.issn.1002-137X.2016.07.006

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed