计算机科学 ›› 2019, Vol. 46 ›› Issue (11A): 13-18.

• 智能计算 • 上一篇    下一篇

多义词语义拓扑及有监督的词义消歧研究

肖锐1, 蒋家琪2, 张云春2   

  1. (云南大学国际学院 昆明650091)1;
    (云南大学软件学院 昆明650091)2
  • 出版日期:2019-11-10 发布日期:2019-11-20
  • 通讯作者: 张云春(1981-),男,博士,讲师,主要研究方向为机器学习、自然语言处理和计算机网络,E-mail:yunchunzhang@hotmail.com。
  • 作者简介:肖锐(1981-),女,博士,讲师,主要研究方向为对外汉语、自然语言处理。
  • 基金资助:
    本文受国家自然科学基金(61762089),国家汉办汉考国际研究基金(CTI2018B06),云南大学孔子学院建设与汉语国际推广专项课题(2018-YNUCI-Y008),云南大学2019国家社科基金培育项目资助。

Study on Semantic Topology and Supervised Word Sense Disambiguation of Polysemous Words

XIAO Rui1, JIANG Jia-qi2, ZHANG Yun-chun2   

  1. (School of International Education,Yunnan University,Kunming 650091,China)1;
    (School of Software,Yunnan University,Kunming 650091,China)2
  • Online:2019-11-10 Published:2019-11-20

摘要: 多义词语义是汉语国际教育和HSK考试的重点和难点。词义消歧研究致力于确定多义词在给定上下文中的具体含义,在人机交互、机器翻译、作文自动评分等领域被广泛应用。然而,现有的词义消歧方法存在准确率较低、语料库匮乏、特征简单等弊端。针对汉语国际教育的相关语料库和评价系统,基于深度神经网络设计汉语多义词词义消歧的分类模型是当前的研究热点,同时也是实现HSK作文自动评分的重要技术保障。已有研究假定多个义项相互独立,缺乏对多义词义项演变关系的重视,对此文中首先对典型的汉语多义词进行语义研究,以区分基础义项和固定搭配义项来构建语义拓扑图,用于指导分类模型的训练。在建立多义词语义拓扑图的基础上,通过对汉语语料库的爬虫,获取典型多义词的语料样本,进而构建有监督的深度神经网络模型,包括RNN,LSTM和GRU。通过对爬虫所获样本的分析,选取了30字长和60字长,分别设计单向和双向6种神经网络,通过多次训练对模型参数进行优化,最终获得词义消歧分类模型。实验选取“意思”多义词作为代表,开展多义词在给定上下文的词义消歧实验。结果表明,基于RNN,LSTM网络和GRU的深度学习模型的平均准确率均超过75%,其中各模型的最大准确率均超过94%;各模型的ROC曲线下面积(Area Under Curve,AUC)均超过0.966,表明其对样本类不均衡性具有较好的处理效果;单向和双向RNN模型在不同字长条件下均取得最佳学习效果。

关键词: 长短期记忆网络, 词义消歧, 深度神经网络, 循环神经网络, 语义拓扑

Abstract: Polysemous words have been considered as learning emphasis and major obstacles for foreign students who learn Chinese as a foreign language.Word Sense Disambiguation (WSD),which is mainly used to determine the specific meaning of polysemous words in a given context,has important application in human-computer interaction,machine translation,automatic essay scoring and other emerging applications.It is also the difficulty in teaching Chinese as a foreign language and HSK examination.The existing word sense disambiguation methods are shown to be with low accuracy,lack of corpus,simple features etc.Considering teaching Chinese as foreign language and its evaluation corpus,it is a hot research topic on building Chinese polysemous words WSD based on deep neural networks.It provides necessary technical support for achieving automatic HSK essay scoring.Existing researches assumed that semantic items are mutually independent and thus paid little attention to their evolutional relationships among items.To solve this problem,this paper firstly made research on semantics of typical Chinese polysemous words.Followed by semantic topology construction the basic semantic items and set-phrases were discriminated for supervised classification model training.Based on the semantic topology construction of polysemous words,the corpus samples were collected by web crawling.The supervised deep neural networks,including RNN,LSTM and GRU,were constructed subsequently.By analyzing the scrawled samples,both uni-directional and bi-directional neural networks were designed by choosing 30 words and 60 words length respectively.The final WSD classification models were obtained by multiple rounds of training and optimization to model parameters.The “Yisi”was chosen for example and used for WSD experiments within their contexts.The experimental results shows that RNN,LSTM and GRU all achieve average classification accuracy more than 75% while maximum accuracy is more than 94%.AUC under each model is more than 0.966,which shows its good performance on class imbalance among samples.Both uni-directional and bi-directional RNN models achieve best classification performance under different words length.

Key words: Deep neural networks, Long short-term memory network, Recurrent neural network, Semantic topology, Word sense disambiguation

中图分类号: 

  • TP391
[1]CORREA E A J,LOPES A D A,AMANCIO D R.Word sense disambiguation:a complex network approach[J].Information Sciences,2018,442:103-113.
[2]万富强,吴云芳.基于中文维基百科的词语语义相关度计算[J].中文信息学报,2013,27(6):31-37.
[3]WU Y,ZHANG M.Overview of the NLPCC 2017 Shared Task:Chinese Word Semantic Relation Classification[C]∥ Natural Language Processing and Chinese Computing(NLPCC 2017).Cham:Springer,2017:919-925.
[4]鲁松,白硕,黄雄.基于向量空间模型中义项词语的无导词义消歧[J].软件学报,2002,13(6):1082-1089.
[5] 陈笑蓉,秦进.基于最大熵原理的汉语词义消歧[J].计算机科学,2005,32(5):174-176.
[6]全昌勤,何婷婷,姬东鸿,等.基于多分类器决策的词义消歧方法[J].计算机研究与发展,2006,43(5):933-939.
[7]曹鸿霞.统计与神经网络相结合的词义消歧模型[J].武汉理工大学学报,2006,28(8):131-134.
[8]张雄,陈福才,黄瑞阳.基于双词主题模型的半监督实体消歧方法研究[J].电子学报,2018,46(3):607-613.
[9]JIMENO Y A.Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedi-cal word sense disambiguation[J].Journal of biomedical informatics,2017,73:137-147.
[10]CORREA JR E A,LOPES A A,AMANCIO D R.Word sense disambiguation:A complex network approach[J].Information Science,2018,442:103-113.
[11]杨安,李素建,李芸.基于领域知识和词向量的词义消歧方法[J].北京大学学报(自然科学版),2017,53(2):204-210.
[12]杨陟卓,黄河燕.基于语言模型的有监督词义消歧模型优化研究[J].中文信息学报,2014,28(1):19-25.
[13]苏新春,洪桂治,唐师瑶.再论义类词典的分类原则与方法[J].世界汉语教学,2010(2):16-27.
[14]WEN L,LI J,JIN Y,et al.A method for Word Sense Disambiguation combining contextual semantic features[C]∥International Conference on Asian Language Processing.IEEE Press,2017:283-287.
[15]LEI X,CAI Y,LI Q,et al.Combining Local and Global Features in Supervised Word Sense Disambiguation[C]∥Web Information Systems Engineering-WISE 2017.Puschino,Russia:Springer,2017:117-131.
[16]PESARANGHADER A,PESARANGHADER A,MATWIN S,et al.One Single Deep Bidirectional LSTM Network for Word Sense Disambiguation of Text Data[C]∥Advances in Artificial Intelligence.Canadian AI 2018.Toronto,Canada:Springer,2018:96-107.
[17]LOPEZ-AREVALO I,SOSA-SOSA V J,ROJAS-LOPEZ F,et al.Improving selection of synsets from WordNet for domain-specific word sense disambiguation[J].Computer Speech & Language,2016,41(C):128-145.
[18]李安.多义词义项的语义关系及其对词义消歧的影响[J].语言文字应用,2014(1):29-37.
[19]廖光蓉.多义词意义关系模式研究[J].外语教学,2005,26(3):56-59.
[20]UNGERER F,SCHMID H J.An Introduction to Cognitive Linguistics[M].London:Addison Welsey,1996.
[1] 王馨彤, 王璇, 孙知信.
基于多尺度记忆残差网络的网络流量异常检测模型
Network Traffic Anomaly Detection Method Based on Multi-scale Memory Residual Network
计算机科学, 2022, 49(8): 314-322. https://doi.org/10.11896/jsjkx.220200011
[2] 彭双, 伍江江, 陈浩, 杜春, 李军.
基于注意力神经网络的对地观测卫星星上自主任务规划方法
Satellite Onboard Observation Task Planning Based on Attention Neural Network
计算机科学, 2022, 49(7): 242-247. https://doi.org/10.11896/jsjkx.210500093
[3] 赵冬梅, 吴亚星, 张红斌.
基于IPSO-BiLSTM的网络安全态势预测
Network Security Situation Prediction Based on IPSO-BiLSTM
计算机科学, 2022, 49(7): 357-362. https://doi.org/10.11896/jsjkx.210900103
[4] 康雁, 徐玉龙, 寇勇奇, 谢思宇, 杨学昆, 李浩.
基于Transformer和LSTM的药物相互作用预测
Drug-Drug Interaction Prediction Based on Transformer and LSTM
计算机科学, 2022, 49(6A): 17-21. https://doi.org/10.11896/jsjkx.210400150
[5] 王飞, 黄涛, 杨晔.
基于Stacking多模型融合的IGBT器件寿命的机器学习预测算法研究
Study on Machine Learning Algorithms for Life Prediction of IGBT Devices Based on Stacking Multi-model Fusion
计算机科学, 2022, 49(6A): 784-789. https://doi.org/10.11896/jsjkx.210400030
[6] 喻昕, 林植良.
解决一类非光滑伪凸优化问题的新型神经网络
Novel Neural Network for Dealing with a Kind of Non-smooth Pseudoconvex Optimization Problems
计算机科学, 2022, 49(5): 227-234. https://doi.org/10.11896/jsjkx.210400179
[7] 焦翔, 魏祥麟, 薛羽, 王超, 段强.
基于深度学习的自动调制识别研究
Automatic Modulation Recognition Based on Deep Learning
计算机科学, 2022, 49(5): 266-278. https://doi.org/10.11896/jsjkx.211000085
[8] 高捷, 刘沙, 黄则强, 郑天宇, 刘鑫, 漆锋滨.
基于国产众核处理器的深度神经网络算子加速库优化
Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor
计算机科学, 2022, 49(5): 355-362. https://doi.org/10.11896/jsjkx.210500226
[9] 安鑫, 代子彪, 李阳, 孙晓, 任福继.
基于BERT的端到端语音合成方法
End-to-End Speech Synthesis Based on BERT
计算机科学, 2022, 49(4): 221-226. https://doi.org/10.11896/jsjkx.210300071
[10] 时雨涛, 孙晓.
一种会话理解模型的问题生成方法
Conversational Comprehension Model for Question Generation
计算机科学, 2022, 49(3): 232-238. https://doi.org/10.11896/jsjkx.210200153
[11] 高堰泸, 徐圆, 朱群雄.
基于A-DLSTM夹层网络结构的电能消耗预测方法
Predicting Electric Energy Consumption Using Sandwich Structure of Attention in Double -LSTM
计算机科学, 2022, 49(3): 269-275. https://doi.org/10.11896/jsjkx.210100006
[12] 李昊, 曹书瑜, 陈亚青, 张敏.
基于注意力机制的用户轨迹识别模型
User Trajectory Identification Model via Attention Mechanism
计算机科学, 2022, 49(3): 308-312. https://doi.org/10.11896/jsjkx.210300231
[13] 肖丁, 张玙璠, 纪厚业.
基于多头注意力机制的用户窃电行为检测
Electricity Theft Detection Based on Multi-head Attention Mechanism
计算机科学, 2022, 49(1): 140-145. https://doi.org/10.11896/jsjkx.210100177
[14] 范红杰, 李雪冬, 叶松涛.
面向电子病历语义解析的疾病辅助诊断方法
Aided Disease Diagnosis Method for EMR Semantic Analysis
计算机科学, 2022, 49(1): 153-158. https://doi.org/10.11896/jsjkx.201100125
[15] 周欣, 刘硕迪, 潘薇, 陈媛媛.
自然交通场景中的车辆颜色识别
Vehicle Color Recognition in Natural Traffic Scene
计算机科学, 2021, 48(6A): 15-20. https://doi.org/10.11896/jsjkx.200800078
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!