计算机科学 ›› 2020, Vol. 47 ›› Issue (11A): 18-23.doi: 10.11896/jsjkx.200500090
崔丹丹, 刘秀磊, 陈若愚, 刘旭红, 李臻, 齐林
CUI Dan-dan, LIU Xiu-lei, CHEN Ruo-yu, LIU Xu-hong, LI Zhen, QI Lin
摘要: 基于《四库全书》数据集,研究古汉语的命名实体识别技术。提出了基于Lattice LSTM模型的古汉语命名实体识别算法,该方法将字符序列信息和词序列信息共同作为模型的输入。采用甲言(jiayan)分词工具,利用word2vec训练古文字、词向量并作为Lattice LSTM模型的输入,提升了古汉语命名实体识别的效果。基于Lattice LSTM模型和预训练的古文字、词向量,提高了古汉语的实体识别效果,相比传统的BiLSTM-CRF模型,其F1分数提升3.95%左右。
中图分类号:
[1] CHEN S D,OUYANG X Y.Summary of Named Entity Recognition Technology[J].Radio Communication Techology,2020,46(3):251-260. [2] ZHOU K.Research on Rule-based Named Entity Recognition[D].Hefei:Hefei University of Technology,2010. [3] BAO M N,SI L,GE L.Research on Mongolian Named Entity Recognition Based on Dictionary Matching[J].Journal of Minzu University of China,2017,44(3):165-169. [4] CHEN J,CHANG Z Q,XU J.Recognition and Classification of Biomedical Named Entities Based on HMM[J].Computer Age,2006(10):40-42. [5] WANG H C,ZHAO T J.Recognition of Named Biomedical Entities Based on SVM[J].Journal of Harbin Engineering University,2006,27(z1):570-574. [6] SHI H F.Research on Chinese Named Entity Recognition Based on CRF[D].Suzhou:Soochow University,2010. [7] GU Y.Research on Recognition of Complex Chinese Named Entity Based on BiLSTM-CRF[D].Nanjing:Nanjing University,2019. [8] MAI M T,FU A Y,WU S E,et al.Uyghur Named Entity Recognition Based on BiLSTM-CNN-CRF Model[J].Computer Engineering,2018,44(8):230-236. [9] ZHANG X T.Research and Implementation of Medical TextChinese NamedEntity Recognition Based on Lattice LSTM[D].University of Electronic Science and Technology of China,2019. [10] LI Y Y.Research on Named Entity Recognition AlgorithmBased on Attention Mechanism[D].Beijing:Beijing University of Posts and Telecommunications,2019. [11] SHENG J.Application of transfer learning in named entity recognition[D].Harbin:Harbin Institute of Technology,2019. [12] KHAN W,DAUD A,ALOTAIBI F,et al.Deep recurrent neural networks with word embeddings for Urdu named entity recognition[J].ETRI Journal,2020,42(1). [13] KURU O,CAN O A,YURET D.Charner:Character-levelnamed entity recognition[C]//COLING.2016:911-921. [14] REI M.Semi-supervised multitask learning for sequence labeling[C]//ACL.2017:2121-2130. [15] TRAN Q,MACKINLAY A,YEPES A J.Named entity recognition with stack residual lstm and trainable bias decoding[C]//IJCNLP.2017:566-575. [16] YE Z X,LING Z H.Hybrid semi-markovcrf for neural sequence labeling[C]//ACL.2018:235-240. [17] LAMPLE G,BALLESTEROS M,SUBRAMANIAN S,et al.Neural architectures for named entity recognition[C]//NAACL.2016:260-270. [18] GREGORIC A Z,BACHRACH Y,COOPE S.Named entity recognition with parallel recurrent neural networks[C]//ACL.2018:69-74. [19] ZHANG C P,FANG T,LIU Y L.Research and Application of Named Entity Recognition Technology Based on LSTM-CRF[J].Computer Technology and Development,2019,29(2):106-108. [20] Biotechnology-Biomedical Informatics;Data on Biomedical Informatics Described by Researchers at National University of Defence Science and Technology (Adversarial training based lattice LSTM for Chinese clinical named entity recognition)[J/OL].https://schlr.cnki.net/Detail/index/SPQDLAST/SPQDC4D58041D94430FE02664BB977E1BBDB. [21] WANG B R,LIN X,ZHU X D,et al.Research on Named Entity Recognition Model of Chinese Medical Text by Lattice LSTM Neural Network[J].Chinese Journal of Health Informatics and Management,2019,16(1):84-88. [22] ZHAO S,CAI Z P,CHEN H W,et al.Adversarial trainingbased lattice LSTM for Chinese clinical named entity recognition[J].Journal of Biomedical Informatics,2019,99. [23] ZHANG W J,ZHANG H M,YANG L,et al.Multi-granular Chinese Word Segmentation Based on Lattice-LSTM[J].Journal of Chinese Information Processing,2019,33(1):18-24. |
[1] | 饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277 |
[2] | 汤凌韬, 王迪, 张鲁飞, 刘盛云. 基于安全多方计算和差分隐私的联邦学习方案 Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy 计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108 |
[3] | 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺. 时序知识图谱表示学习 Temporal Knowledge Graph Representation Learning 计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204 |
[4] | 王剑, 彭雨琦, 赵宇斐, 杨健. 基于深度学习的社交网络舆情信息抽取方法综述 Survey of Social Network Public Opinion Information Extraction Based on Deep Learning 计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099 |
[5] | 郝志荣, 陈龙, 黄嘉成. 面向文本分类的类别区分式通用对抗攻击方法 Class Discriminative Universal Adversarial Attack for Text Classification 计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077 |
[6] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[7] | 孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061 |
[8] | 胡艳羽, 赵龙, 董祥军. 一种用于癌症分类的两阶段深度特征选择提取算法 Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification 计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092 |
[9] | 程成, 降爱莲. 基于多路径特征提取的实时语义分割方法 Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction 计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157 |
[10] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[11] | 周慧, 施皓晨, 屠要峰, 黄圣君. 基于主动采样的深度鲁棒神经网络学习 Robust Deep Neural Network Learning Based on Active Sampling 计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044 |
[12] | 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫. 小样本雷达辐射源识别的深度学习方法综述 Survey of Deep Learning for Radar Emitter Identification Based on Small Sample 计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138 |
[13] | 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋. 改进Faster R-CNN的光学遥感飞机目标检测 Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN 计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121 |
[14] | 王建明, 陈响育, 杨自忠, 史晨阳, 张宇航, 钱正坤. 不同数据增强方法对模型识别精度的影响 Influence of Different Data Augmentation Methods on Model Recognition Accuracy 计算机科学, 2022, 49(6A): 418-423. https://doi.org/10.11896/jsjkx.210700210 |
[15] | 毛典辉, 黄晖煜, 赵爽. 符合监管合规性的自动合成新闻检测方法研究 Study on Automatic Synthetic News Detection Method Complying with Regulatory Compliance 计算机科学, 2022, 49(6A): 523-530. https://doi.org/10.11896/jsjkx.210300083 |
|