计算机科学 ›› 2019, Vol. 46 ›› Issue (11A): 103-107.

• 智能计算 • 上一篇    下一篇

基于神经网络的关系词非充盈态复句层次的自动识别

杨进才, 杨璐璐, 汪燕燕, 沈显君   

  1. (华中师范大学计算机学院 武汉430079)
  • 出版日期:2019-11-10 发布日期:2019-11-20
  • 通讯作者: 杨进才(1967-),男,博士,教授,博士生导师,主要研究方向为现代信息系统、中文信息处理,E-mail:jcyang@mail.ccnu.edu。
  • 基金资助:
    本文受国家社科基金项目(19BYY092)资助。

Hierarchy Division of Compound Sentence with Non-saturated Relation Word via Neural Network

YANG Jin-cai, YANG Lu-lu, WANG Yan-yan, SHEN Xian-jun   

  1. (School of Computer,Central China Normal University,Wuhan 430079,China)
  • Online:2019-11-10 Published:2019-11-20

摘要: 复句层次关系划分是复句句法结构分析以及语义甄别的基础,但关系词非充盈态复句由于关系标记的省略给层次划分带来了困难。文中利用依存关系句法树和word2vec词向量模型的方法来提取复句中分句的句法特征和语义特征,并利用神经网络进行训练,获得三句式关系词的非充盈态复句层次划分模型,对测试集中的复句进行层次划分测试,其准确率为74%。

关键词: word2vec, 复句层次划分, 关系词非充盈态, 神经网络, 依存句法

Abstract: Hierarchical division of a compound sentence is the basis of syntactic structure analysis and semantic discrimination.However,the ellipsis of relational markers bring difficulties to the hierarchical division of a compound sentence.This paper combined dependency syntactic trees and word2vec word vector model to extract the syntactic structure and semantic features of compound sentences,then used the neural network to train a hierarchy division model for compound sentences with non-saturated relation word,and the hierarchical division test was carried out on the complex sentences in the test set.The test accuracy of test set is 74%.

Key words: Compound sentence with non-saturated relation word, Depen-dency grammar, Hierarchical division of compound sentence, Neural network, word2vec

中图分类号: 

  • TP751
[1]邢福义.汉语复句研究[M].北京:商务印刷馆,2003[2]李晋霞,刘云.面向计算机的二重复句层次划分研究[C]∥第7届计算语言学联合学术会议论文.2003[3]胡金柱,陈江曼,杨进才,等.基于规则的连用关系标记的自动标识研究[J].计算机科学,2012,39(7):196-200.
[4]杜超华,胡金柱,沈威,等.基于复句语料库分词系统研究 [J].计算机数字与工程,2007,35(5):43-44.
[5]王飞,易绵竹,谭新.基于本体语义网络的语言理解模型[J].计算机科学,2018,45(S1):101-105.
[6]鲁松,白硕,李素建,等.汉语多重关系复句的关系层次分析[J].软件学报,2001,12(7):987-995.
[7]李幸,宗成庆.引入标点处理的层次化汉语长句句法分析方法[J].中文信息学报,2006,20(4):8-15.
[8]吴锋文.基于主谓语知识挖掘的分句语义关联研究[J].语言文字应用,2011,20(4):132-142.
[9]吴锋文.面向信息处理的“一标三句式”复句层次关系判定[J].北方论丛,2012,54(1):64-68.
[10]胡金柱,舒江波,罗进军.汉语复句中分句的语义关联特征[J].语言文字应用,2010,19(4):121-130.
[11]李源,等.基于语义与规则的有标复句层次体系研究[J].计算机工程与科学,2017,39(12):2306-2313.
[12]黄仁,张卫.基于word2vec的互联网商品评论情感倾向研究[J].计算机科学,2016,43(S1):387-389.
[13]MIKOLOV T,SUTSKEVER I,CHEN K,et al.Distributed representations of words and phrases and their compositionality[C]∥Advances in Neural Information Processing Systems.2013: 3111-3119.
[14]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[EB/OL].http://arxiv.org/pdf/1301.3781v3.pdf.
[15]YU M,DREDZE M.Improving lexical embeddings with semantic knowledge[C]∥Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics.Baltimore,Stroudsburg,USA:ACL,2014:545-550.
[16]KINGMA D P,JIMMY B A.Adam:A Method for StochasticOptimization[C]∥3rd International Conference for Learning Representations.San Diego,2015.
[17]TANG D Y,QIN B,LIU T.Document modeling with gated recurrent neural network for sentiment classification[C]∥Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.2015:1422-1432.
[18]LAI S,XU L,LIU K,et al.Recurrent convolutional neural networks for text classification[C]∥Proceedings of the National Conference on Artifical Intelligence.2015:2267-2273.
[1] 宁晗阳, 马苗, 杨波, 刘士昌.
密码学智能化研究进展与分析
Research Progress and Analysis on Intelligent Cryptology
计算机科学, 2022, 49(9): 288-296. https://doi.org/10.11896/jsjkx.220300053
[2] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[3] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[5] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[6] 王润安, 邹兆年.
基于物理操作级模型的查询执行时间预测方法
Query Performance Prediction Based on Physical Operation-level Models
计算机科学, 2022, 49(8): 49-55. https://doi.org/10.11896/jsjkx.210700074
[7] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[8] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[9] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[10] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[11] 齐秀秀, 王佳昊, 李文雄, 周帆.
基于概率元学习的矩阵补全预测融合算法
Fusion Algorithm for Matrix Completion Prediction Based on Probabilistic Meta-learning
计算机科学, 2022, 49(7): 18-24. https://doi.org/10.11896/jsjkx.210600126
[12] 杨炳新, 郭艳蓉, 郝世杰, 洪日昌.
基于数据增广和模型集成策略的图神经网络在抑郁症识别上的应用
Application of Graph Neural Network Based on Data Augmentation and Model Ensemble in Depression Recognition
计算机科学, 2022, 49(7): 57-63. https://doi.org/10.11896/jsjkx.210800070
[13] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[14] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
[15] 刘月红, 牛少华, 神显豪.
基于卷积神经网络的虚拟现实视频帧内预测编码
Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network
计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!