计算机科学 ›› 2019, Vol. 46 ›› Issue (11A): 33-37.

• 智能计算 • 上一篇    下一篇

基于文本信息和层次神经网络的产品评分方法

赵赟, 王中卿, 李寿山   

  1. (苏州大学计算机科学与技术学院 江苏 苏州215006)
  • 出版日期:2019-11-10 发布日期:2019-11-20
  • 通讯作者: 王中卿(1987-),男,博士,主要研究方向为自然语言处理,E-mail:wangzq.antony@gmail.com。
  • 作者简介:赵赟(1995-),男,硕士生,主要研究方向为自然语言处理,E-mail:yzhao666@stu.suda.edu.cn。
  • 基金资助:
    本文受国家自然科学基金青年项目(61806137)资助。

Product Rating with Text Information and Hierarchical Neural Network

ZHAO Yun, WANG Zhong-qing, LI Shou-shan   

  1. (School of Computer Science and Technology,Soochow University,Suzhou,Jiangsu 215006,China)
  • Online:2019-11-10 Published:2019-11-20

摘要: 通常点评网站对商品的打分都是通过对商品评论的评分求均值而获得,但是这种方式严重依赖于评论的评分,而且对于评论数较少的商品,这种方式显得不够精确。不同于传统的产品打分机制,文中提出了一种根据产品评论的文本信息对产品进行整体打分的层次神经网络模型,该模型可以从有限的评论中分析出产品较为公正的得分。在产品评论中,存在着[词-句子-评论-商品]的层级结构,因此采用了三层GRU的结构分别来对句子、评论、商品进行表示,从而预测商品最终的打分。除此之外,还对评论层进行了额外地输出,进一步提高了商品得分预测的准确率。在回归和分类两种预测任务上的实验结果表明,模型的层次结构对于预测商品得分具有至关重要的作用,同时输出评论的得分可以进一步提高预测的准确率。

关键词: 层次神经网络, 产品打分, 评分预测

Abstract: Usually,the rating of the product on the website is obtained by averaging the rating of the product review,but this method relies heavily on the rating of reviews,which is not accurate enough for products with fewer reviews.Different from the traditional product scoring mechanism,this paper proposed a hierarchical neural network model for the overall scoring of products based on the text information of them,which can analyze the fair scores of products from limited reviews.In the product review,there is a hierarchical structure of [word-sentence-review-product],so the structure of three-layer GRU is used to get the representations of the sentences,reviews and products separately,so as to predict the final score of the product.In addition,this paper also makes additional output to the review layer to further improve the accuracy of the prediction.Experiments on the two prediction tasks of regression and classification show that the hierarchical structure of the model plays a crucial role in predicting the score of the product,and the score of outputting comment can further improve the prediction accuracy.

Key words: Hierarchical neural network, Product rating, Rating prediction

中图分类号: 

  • TP391
[1]PANG B,LEE L.Seeing stars:exploiting class relationships for sentiment categorization with respect to rating scales[J/OL].http://arXiv.org/abs/cs/0506075.
[2]THET T T,NA J C,KHOO C S G.Aspect-based sentiment analysis of movie reviews on discussion boards[J].Journal of Information Science,2010,36(6):823-848.
[3]SEVERYN A,MOSCHITTI A.Twitter Sentiment Analysiswith Deep Convolutional Neural Networks[C]∥International ACM SIGIR Conference.ACM,2015:959-962.
[4]TANG D,QIN B,FENG X,et al.Target-Dependent Sentiment Classification with Long Short Term Memory[J/OL].http://arXiv.org/abs/1512.01100v1.
[5]SARWAR B,KARYPIS G,KONSTAN J,et al.Item-based collaborative filtering recommendation algorithms[C]∥International Conference on World Wide Web.ACM,2001:285-295.
[6]IFRIM G,WEIKUM G.The Bag-of-Opinions Method for Review Rating Prediction from Sparse Text Patterns[C]∥计算语言学国际学术会议.2010:913-921.
[7]FANG X,ZHAN J.Sentiment analysis using product review data[J].Journal of Big Data,2015,2(1):1-14.
[8]TANG D,QIN B,YANG Y,et al.User modeling with neuralnetwork for review rating prediction[C]∥International Conference on Artificial Intelligence.AAAI Press,2015:1340-1346.
[9]LI F,LIU N,JIN H,et al.Incorporating reviewer and product information for review rating prediction[C]∥International Joint Conference on Artificial Intelligence.AAAI Press,2011:1820-1825.
[10]FAN M,KHADEMI M.Predicting a Business Star in Yelp from Its Reviews Text Alone[J/OL].http://arXiv.org/abs/1401.0864.
[11]XIE L X,ZHOU M,SUN M S.Hierarchical Structure Based on Hybrid Approach to Sentiment Analysis of Chinese Micro Blog and Its Feature Extraction[J].Journal of Chinese Information Processing,2012,26(1):73-84.
[12]LI J,LUONG T,JURAFSKY D,et al.A Hierarchical Neural Autoencoder for Paragraphs and Documents[J].International Joint Conference on Natural Language Processing,2015:1106-1115.
[13]YANG Z,YANG D,DYER C,et al.Hierarchical Attention Networks for Document Classification[C]∥North American Chapter of the Association for Computational Linguistics,2016:1480-1489.
[14]RUDER S,GHAFFARI P,JOHN G.A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis[J/OL].http://arXiv.org/abs.1609.02745.
[15]KINGMA D P,BA J.Adam:A Method for Stochastic Optimization[J/OL].http://arXiv.org/abs/1412.6980.
[1] 陈志强, 韩萌, 李慕航, 武红鑫, 张喜龙.
数据流概念漂移处理方法研究综述
Survey of Concept Drift Handling Methods in Data Streams
计算机科学, 2022, 49(9): 14-32. https://doi.org/10.11896/jsjkx.210700112
[2] 王明, 武文芳, 王大玲, 冯时, 张一飞.
生成链接树:一种高数据真实性的反事实解释生成方法
Generative Link Tree:A Counterfactual Explanation Generation Approach with High Data Fidelity
计算机科学, 2022, 49(9): 33-40. https://doi.org/10.11896/jsjkx.220300158
[3] 张佳, 董守斌.
基于评论方面级用户偏好迁移的跨领域推荐算法
Cross-domain Recommendation Based on Review Aspect-level User Preference Transfer
计算机科学, 2022, 49(9): 41-47. https://doi.org/10.11896/jsjkx.220200131
[4] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[5] 宋杰, 梁美玉, 薛哲, 杜军平, 寇菲菲.
基于无监督集群级的科技论文异质图节点表示学习方法
Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level
计算机科学, 2022, 49(9): 64-69. https://doi.org/10.11896/jsjkx.220500196
[6] 柴慧敏, 张勇, 方敏.
基于特征相似度聚类的空中目标分群方法
Aerial Target Grouping Method Based on Feature Similarity Clustering
计算机科学, 2022, 49(9): 70-75. https://doi.org/10.11896/jsjkx.210800203
[7] 郑文萍, 刘美麟, 杨贵.
一种基于节点稳定性和邻域相似性的社区发现算法
Community Detection Algorithm Based on Node Stability and Neighbor Similarity
计算机科学, 2022, 49(9): 83-91. https://doi.org/10.11896/jsjkx.220400146
[8] 吕晓锋, 赵书良, 高恒达, 武永亮, 张宝奇.
基于异质信息网的短文本特征扩充方法
Short Texts Feautre Enrichment Method Based on Heterogeneous Information Network
计算机科学, 2022, 49(9): 92-100. https://doi.org/10.11896/jsjkx.210700241
[9] 徐天慧, 郭强, 张彩明.
基于全变分比分隔距离的时序数据异常检测
Time Series Data Anomaly Detection Based on Total Variation Ratio Separation Distance
计算机科学, 2022, 49(9): 101-110. https://doi.org/10.11896/jsjkx.210600174
[10] 聂秀山, 潘嘉男, 谭智方, 刘新放, 郭杰, 尹义龙.
基于自然语言的视频片段定位综述
Overview of Natural Language Video Localization
计算机科学, 2022, 49(9): 111-122. https://doi.org/10.11896/jsjkx.220500130
[11] 曹晓雯, 梁美玉, 鲁康康.
基于细粒度语义推理的跨媒体双路对抗哈希学习模型
Fine-grained Semantic Reasoning Based Cross-media Dual-way Adversarial Hashing Learning Model
计算机科学, 2022, 49(9): 123-131. https://doi.org/10.11896/jsjkx.220600011
[12] 周旭, 钱胜胜, 李章明, 方全, 徐常胜.
基于对偶变分多模态注意力网络的不完备社会事件分类方法
Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification
计算机科学, 2022, 49(9): 132-138. https://doi.org/10.11896/jsjkx.220600022
[13] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[14] 曲倩文, 车啸平, 曲晨鑫, 李瑾如.
基于信息感知的虚拟现实用户临场感研究
Study on Information Perception Based User Presence in Virtual Reality
计算机科学, 2022, 49(9): 146-154. https://doi.org/10.11896/jsjkx.220500200
[15] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!