计算机科学 ›› 2020, Vol. 47 ›› Issue (11A): 18-23.doi: 10.11896/jsjkx.200500090

• 人工智能 • 上一篇    下一篇

基于Lattice LSTM的古汉语命名实体识别

崔丹丹, 刘秀磊, 陈若愚, 刘旭红, 李臻, 齐林   

  1. 北京信息科技大学计算机学院 北京 100192
  • 出版日期:2020-11-15 发布日期:2020-11-17
  • 通讯作者: 陈若愚(ruoyu-chen@foxmail.com)
  • 作者简介:13718230670@163.com
  • 基金资助:
    国家重点研发计划课题(2017YFB1400402)

Named Entity Recognition in Field of Ancient Chinese Based on Lattice LSTM

CUI Dan-dan, LIU Xiu-lei, CHEN Ruo-yu, LIU Xu-hong, LI Zhen, QI Lin   

  1. Computer School,Beijing Information Science and Technology University,Beijing 100192,China
  • Online:2020-11-15 Published:2020-11-17
  • About author:CUI Dan-dan,born in 1997,postgra-duate.Her main research interests include natural language processing and knowledge graph.
    CHEN Ruo-yu,born in 1982,Ph.D,lecturer,is a member of China Computer Federation.His main research interests include natural language processing,data mining and semantic Web.
  • Supported by:
    This work was supported by the National Key R&D Program of China(2017YFB1400402).

摘要: 基于《四库全书》数据集,研究古汉语的命名实体识别技术。提出了基于Lattice LSTM模型的古汉语命名实体识别算法,该方法将字符序列信息和词序列信息共同作为模型的输入。采用甲言(jiayan)分词工具,利用word2vec训练古文字、词向量并作为Lattice LSTM模型的输入,提升了古汉语命名实体识别的效果。基于Lattice LSTM模型和预训练的古文字、词向量,提高了古汉语的实体识别效果,相比传统的BiLSTM-CRF模型,其F1分数提升3.95%左右。

关键词: BiLSTM-CRF, Lattice LSTM, 古汉语, 命名实体识别, 深度学习

Abstract: Investigated the named entity recognition problem of ancient Chinese literature based on the Complete Collection of Four Treasuries dataset.Proposed an algorithm for named entity recognition of ancient Chinese literature based on the Lattice LSTM model.This method combines both character sequence information and word sequence information as input to the model.Using jiayan word segmentation tool,word2vec is used to train character and word level embedding of ancient Chinese as input to the Lattice LSTM model,which improves the performance of named entity recognition based on ancient Chineseliterature.Based on the Lattice LSTM model and pre-trained character and word level embedding of ancient Chinese,the performance of named entity recognition based on ancient Chinese literature is improved.Compared with the traditional Bi-LSTM-CRF model,its F1 score is improved by about 3.95%.

Key words: Ancient Chinese literature, BiLSTM-CRF, Deep learning, Lattice LSTM, Named entity recognition

中图分类号: 

  • TP312
[1] CHEN S D,OUYANG X Y.Summary of Named Entity Recognition Technology[J].Radio Communication Techology,2020,46(3):251-260.
[2] ZHOU K.Research on Rule-based Named Entity Recognition[D].Hefei:Hefei University of Technology,2010.
[3] BAO M N,SI L,GE L.Research on Mongolian Named Entity Recognition Based on Dictionary Matching[J].Journal of Minzu University of China,2017,44(3):165-169.
[4] CHEN J,CHANG Z Q,XU J.Recognition and Classification of Biomedical Named Entities Based on HMM[J].Computer Age,2006(10):40-42.
[5] WANG H C,ZHAO T J.Recognition of Named Biomedical Entities Based on SVM[J].Journal of Harbin Engineering University,2006,27(z1):570-574.
[6] SHI H F.Research on Chinese Named Entity Recognition Based on CRF[D].Suzhou:Soochow University,2010.
[7] GU Y.Research on Recognition of Complex Chinese Named Entity Based on BiLSTM-CRF[D].Nanjing:Nanjing University,2019.
[8] MAI M T,FU A Y,WU S E,et al.Uyghur Named Entity Recognition Based on BiLSTM-CNN-CRF Model[J].Computer Engineering,2018,44(8):230-236.
[9] ZHANG X T.Research and Implementation of Medical TextChinese NamedEntity Recognition Based on Lattice LSTM[D].University of Electronic Science and Technology of China,2019.
[10] LI Y Y.Research on Named Entity Recognition AlgorithmBased on Attention Mechanism[D].Beijing:Beijing University of Posts and Telecommunications,2019.
[11] SHENG J.Application of transfer learning in named entity recognition[D].Harbin:Harbin Institute of Technology,2019.
[12] KHAN W,DAUD A,ALOTAIBI F,et al.Deep recurrent neural networks with word embeddings for Urdu named entity recognition[J].ETRI Journal,2020,42(1).
[13] KURU O,CAN O A,YURET D.Charner:Character-levelnamed entity recognition[C]//COLING.2016:911-921.
[14] REI M.Semi-supervised multitask learning for sequence labeling[C]//ACL.2017:2121-2130.
[15] TRAN Q,MACKINLAY A,YEPES A J.Named entity recognition with stack residual lstm and trainable bias decoding[C]//IJCNLP.2017:566-575.
[16] YE Z X,LING Z H.Hybrid semi-markovcrf for neural sequence labeling[C]//ACL.2018:235-240.
[17] LAMPLE G,BALLESTEROS M,SUBRAMANIAN S,et al.Neural architectures for named entity recognition[C]//NAACL.2016:260-270.
[18] GREGORIC A Z,BACHRACH Y,COOPE S.Named entity recognition with parallel recurrent neural networks[C]//ACL.2018:69-74.
[19] ZHANG C P,FANG T,LIU Y L.Research and Application of Named Entity Recognition Technology Based on LSTM-CRF[J].Computer Technology and Development,2019,29(2):106-108.
[20] Biotechnology-Biomedical Informatics;Data on Biomedical Informatics Described by Researchers at National University of Defence Science and Technology (Adversarial training based lattice LSTM for Chinese clinical named entity recognition)[J/OL].https://schlr.cnki.net/Detail/index/SPQDLAST/SPQDC4D58041D94430FE02664BB977E1BBDB.
[21] WANG B R,LIN X,ZHU X D,et al.Research on Named Entity Recognition Model of Chinese Medical Text by Lattice LSTM Neural Network[J].Chinese Journal of Health Informatics and Management,2019,16(1):84-88.
[22] ZHAO S,CAI Z P,CHEN H W,et al.Adversarial trainingbased lattice LSTM for Chinese clinical named entity recognition[J].Journal of Biomedical Informatics,2019,99.
[23] ZHANG W J,ZHANG H M,YANG L,et al.Multi-granular Chinese Word Segmentation Based on Lattice-LSTM[J].Journal of Chinese Information Processing,2019,33(1):18-24.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[3] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[4] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[8] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[9] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[10] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[11] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[12] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[13] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
[14] 王建明, 陈响育, 杨自忠, 史晨阳, 张宇航, 钱正坤.
不同数据增强方法对模型识别精度的影响
Influence of Different Data Augmentation Methods on Model Recognition Accuracy
计算机科学, 2022, 49(6A): 418-423. https://doi.org/10.11896/jsjkx.210700210
[15] 毛典辉, 黄晖煜, 赵爽.
符合监管合规性的自动合成新闻检测方法研究
Study on Automatic Synthetic News Detection Method Complying with Regulatory Compliance
计算机科学, 2022, 49(6A): 523-530. https://doi.org/10.11896/jsjkx.210300083
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!