计算机科学 ›› 2018, Vol. 45 ›› Issue (6A): 46-49.
拥措1,2,史晓东2,尼玛扎西1
YONG Tso1,2,SHI Xiao-dong2,NyimaTrashi1
摘要: 随着社交网络的逐渐成熟,各类语种的文本出现在社交网络上。而这些非规范的短文本蕴藏着人们对事物的褒贬、需求等意见,是国家政府和企业了解公众舆论的重要参考信息,具有重大的研究价值和应用价值。首先,对目前互联网短文本情感分析领域常用的神经网络、跨语言和应用语言学知识等研究方法进行归纳和总结;其次,对当前短文本情感分析研究的热点领域——社交媒体和资源稀缺语言的情感分析进行现状分析;最后,对短文本情感分析研究的趋势进行总结,分析存在的问题,并对未来进行展望。
中图分类号:
[1]TURNEY P D.Thumps Up or Thumps Down? Semantic Orien- tation Applied to Unsupervised Classification of Reviews[C]∥Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics.Philadelphia,2002:417-424. [2]PANG B,LEE L,VAITHYANATHAN S.Thumps Up? Sentiment Classification using Machine Learning Techniques[C]∥Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing.Philadelphia,2002:79-86. [3]KIM Y.Convolutional Neural Networks For Sentence Classification[C]∥2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014).2014:1746-1751. [4]SOCHER R,PENNINGTON J,HUANG E H,et al.Semi-supervised recursive autoencoders for predicting sentiment distributions[C]∥2011 Conference on Empirical Methods in Natural Language Processing.Edinburgh,Scotland,UK,2011:151-161. [5]MIKOLOV T,ZWEIG G.Context dependent recurrent neural network language model[C]∥SLT Workshop.2012. [6]TAI K,SOCHER R,MANNING C D.Improved semantic representations from tree-structured long short-term memory networks[J].Computer Science,2015,5(1):36. [7]POLANYI L,ZAENEN A.Contextual Valence Shifters[J].Information Retrieval,2004,20:1-10. [8]ZHU X,GUO H,MOHAMMAD S,et al.An Empirical Study on the Effect of Negation Words on Sentiment[C]∥Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics.Baltimore,Maryland,USA,2014:304-313. [9]KIRITCHENKO,MOHAMMAD.Sentiment Composition of Words with Opposing Polarities[C]∥Proceedings of NAACL-HLT 2016.San Diego,California,2016:1102-1108. [10]TABOADA M,BROOKE,TOFILOSKI,et al.Lexicon-Based Methods for Sentiment Analysis[J].Computational Linguistics,2011,37(2):267-307. [11]WEI W,WU C H,LIN J C.A regression approach to affective rating of chinese words from anew[M]∥Affective Computing and Intelligent Interaction.Springer Berlin Heidelberg,2011:121-131. [12]MALANDRAKIS N,POTAMIANOS A,LOSIF E,,et al.Distributional semantic models for affective text analysis[J].IEEE transactions on audio,speech,and language processing,2013,21(11):2379-2392. [13]QIAN Q,HUANG M L,ZHU X Y.Linguistically Regularized LSTMs for Sentiment Classification[J].arXiv:1611.03949v1 [14]WAN X J.Using Bilingual Knowledge and Ensemble Technics for Unsupervised Chinese Sentiment Analysis[C]∥2008 Conference on Empirical Methods in Natual Language Processing.Honolulu,China,2008:553-561. [15]WAN X J.Co-Training for Cross-Lingual Sentiment Classification[C]∥Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP.Suntec,Singapore,2009:235-243. [16]MENG X F,WEI F R,LIU X H,et al.Cross-Lingual Mixture Model for Sentiment Classification[C]∥Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics.Jeju,Republic of Korea,2012:572-581. [17]POPAT K,BALAMURALI A R,BHATTACHARYYA P, et al.Thehaves and the have-nots:Leveraging unlabelled corporafor sentiment analysis[C]∥Proceedings of the AnnualMeeting of the Association for Computational Linguistics.Sofia,Bulga-ria,2013:412-422. [18]XU R F,XU J,WANG X L.Instance Level Transfer Learning for Cross LingualOpinion Analysis[C]∥Proceedings of the 2nd Workshopon Computational Approaches to Subjectivityand Sentiment Analysis(ACL-HLT 2011).Portland,Oregon,USA,2011:182-188. [19]GUI L,XU R F,LU Q,et al.Cross-lingual OpinionAnalysis via Negative Transfer Detection[C]∥Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers).2014:860-865. [20]CHEN Q,LI W,LEI Y,et al.Learning to Adapt Credible Knowledge in Cross-lingual SentimentAnalysis[C]∥Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing.Beijing,China,2015:419-429. [21]XIAO M,GUO Y H.Semi-supervisedrepresentation learning for cross-lingual text classification[C]∥Proceedings of EMNLP 2013.2013:1465-1475. [22]ZHOU H,CHEN L,SHI F,et al.Learning bilingual sentiment word embeddings for cross-language sentiment classification[C]∥Proceedings of 52rd Annual Meeting of the Association for Computational Linguistic.2015:430-440. [23]CHANDAR A P S,KHAPRA M M,RAVINDRAN B,et al. Multilingual deep learning[C]∥Deep LearningWorkshop at NIPS 2013.2013. [24]SARATH CHANDAR A P,LAULY S,LAROCHELLE H, et al.An autoencoder approachto learning bilingual word representations[C]∥Advances in Neural Information Processing Systems.2014:1853-1861. [25]ZHOU G Y,HE T T,ZHAO J.Bridging the Language Gap: Learning Distributed Semantics for Cross-Lingual Sentiment Classification[C]∥Proceedings of Natural Language Processingand Chinese Computing.Springer Verlag,2014:138-149. [26]MAAS A L,DALY R E,PHAM P T,et al.Learning Word Vectors for Sentiment Analysis[C]∥Proceedings of the 49thAnnualMeeting of the Association for Computational Linguistics.2011:142-150. [27]WANG Y,LI ZH,LIU J,et al.Word Vector Modeling for Sentiment Analysis of Product Reviews[C]∥Proceedings of Natural Language Processing and Chinese Computing.Springer Verlag,2014:168-180. [28]TANG X W,WAN X J.Learning Bilingual Embedding Model for Cross-language Sentiment Classification[C]∥Proceedings of 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies(IAT).IEEE,2014:134-141. [29]ZHOU X J,WAN X J,XIAO J G.Cross-Lingual Sentiment Classification with Bilingual DocumentRepresentation Learning[C]∥Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.Berlin,Germany,2016:1403-1412. [30]ZHOU X J,WAN X J,XIAO J G.Attention-based LSTM Network for Cross-Lingual Sentiment Classification[C]∥Procee-dings of the 2016 Conference on Empirical Methods in Natural Language Processing.Austin,Texas,2016:247-256. [31]TANG D Y,WEI F R,YANG N,et al.Learning Sentiment- Specific Word Embedding for Twitter Sentiment Classification[C]∥Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistic.2014:1555-1565. [32]MOHAMMAD S M,KIRITCHENKO S,ZHU X D.Nrc-canada:Building the state-of-the-art in sentiment analysis of tweets[C]∥Proceedings of SemEval-2013.2013. [33]RETRIEVAL F.Opinion mining and sentiment analysis[J].Founda- tions and Trends in Information Retrieval,2008,2(1/2):1-135. [34]OWOPUTI O,DYER C,GIMPEL K,et al.Part-of-speech tagging for twitter:Word clusters and other advances[R].CMU,2012. [35]MOHAMMAD S,KIRITCHENKO S,ZHU X.Nrc-canada:Buil- ding the state-of-the-art in sentiment analysis of tweets[C]∥Proceedings of the Seventh International Workshop on Semantic Evaluation Exercises (SemEval-2013).Atlanta,Georgia,USA,2013. [36]NABIL M,ALY M,ATIYA A F.Astd:Arabic sentiment tweets dataset[C]∥Proceedings of EMNLP.2015:2515-2519. [37]AL-TWAIRESH N,AL-KHALIFA H,AL-SALMAN A.AraSenTi:Large-Scale Twitter-Specific Arabic Sentiment Lexicons[C]∥Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.2016:697-705. [38]VO D T,ZHANG Y.Don’t Count,Predict! An Automatic Approach to Learning Sentiment Lexicons for Short Text[C]∥Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.2016:219-224. [39]李海刚,于洪志.藏文文本情感分类系统设计[J].甘肃科技纵横,2011,40(1):106-107. [40]张俊,李应兴.基于情感词典的藏文微博情感分析研究[J].硅谷,2014(20):220-222. [41]袁斌,江涛,于洪志.基于语义空间的藏文微博情感分析方法[J].计算机应用研究,2016,33(3):682-685. |
[1] | 吕晓锋, 赵书良, 高恒达, 武永亮, 张宝奇. 基于异质信息网的短文本特征扩充方法 Short Texts Feautre Enrichment Method Based on Heterogeneous Information Network 计算机科学, 2022, 49(9): 92-100. https://doi.org/10.11896/jsjkx.210700241 |
[2] | 周旭, 钱胜胜, 李章明, 方全, 徐常胜. 基于对偶变分多模态注意力网络的不完备社会事件分类方法 Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification 计算机科学, 2022, 49(9): 132-138. https://doi.org/10.11896/jsjkx.220600022 |
[3] | 王剑, 彭雨琦, 赵宇斐, 杨健. 基于深度学习的社交网络舆情信息抽取方法综述 Survey of Social Network Public Opinion Information Extraction Based on Deep Learning 计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099 |
[4] | 邵欣欣. TI-FastText自动商品分类算法 TI-FastText Automatic Goods Classification Algorithm 计算机科学, 2022, 49(6A): 206-210. https://doi.org/10.11896/jsjkx.210500089 |
[5] | 么晓明, 丁世昌, 赵涛, 黄宏, 罗家德, 傅晓明. 大数据驱动的社会经济地位分析研究综述 Big Data-driven Based Socioeconomic Status Analysis:A Survey 计算机科学, 2022, 49(4): 80-87. https://doi.org/10.11896/jsjkx.211100014 |
[6] | 刘硕, 王庚润, 彭建华, 李柯. 基于混合字词特征的中文短文本分类算法 Chinese Short Text Classification Algorithm Based on Hybrid Features of Characters and Words 计算机科学, 2022, 49(4): 282-287. https://doi.org/10.11896/jsjkx.210200027 |
[7] | 丁锋, 孙晓. 基于注意力机制和BiLSTM-CRF的消极情绪意见目标抽取 Negative-emotion Opinion Target Extraction Based on Attention and BiLSTM-CRF 计算机科学, 2022, 49(2): 223-230. https://doi.org/10.11896/jsjkx.210100046 |
[8] | 张虎, 柏萍. 融入句子中远距离词语依赖的图卷积短文本分类方法 Graph Convolutional Networks with Long-distance Words Dependency in Sentences for Short Text Classification 计算机科学, 2022, 49(2): 279-284. https://doi.org/10.11896/jsjkx.201200062 |
[9] | 袁景凌, 丁远远, 盛德明, 李琳. 基于视觉方面注意力的图像文本情感分析模型 Image-Text Sentiment Analysis Model Based on Visual Aspect Attention 计算机科学, 2022, 49(1): 219-224. https://doi.org/10.11896/jsjkx.201000074 |
[10] | 胡艳丽, 童谭骞, 张啸宇, 彭娟. 融入自注意力机制的深度学习情感分析方法 Self-attention-based BGRU and CNN for Sentiment Analysis 计算机科学, 2022, 49(1): 252-258. https://doi.org/10.11896/jsjkx.210600063 |
[11] | 戴宏亮, 钟国金, 游志铭, 戴宏明. 基于Spark的舆情情感大数据分析集成方法 Public Opinion Sentiment Big Data Analysis Ensemble Method Based on Spark 计算机科学, 2021, 48(9): 118-124. https://doi.org/10.11896/jsjkx.210400280 |
[12] | 张瑾, 段利国, 李爱萍, 郝晓燕. 基于注意力与门控机制相结合的细粒度情感分析 Fine-grained Sentiment Analysis Based on Combination of Attention and Gated Mechanism 计算机科学, 2021, 48(8): 226-233. https://doi.org/10.11896/jsjkx.200700058 |
[13] | 史伟, 付月. 考虑语境的微博短文本挖掘:情感分析的方法 Microblog Short Text Mining Considering Context:A Method of Sentiment Analysis 计算机科学, 2021, 48(6A): 158-164. https://doi.org/10.11896/jsjkx.210200089 |
[14] | 潘芳, 张会兵, 董俊超, 首照宇. 基于高效Transformer的中文在线课程评论方面情感分析 Aspect Sentiment Analysis of Chinese Online Course Review Based on Efficient Transformer 计算机科学, 2021, 48(6A): 264-269. https://doi.org/10.11896/jsjkx.200800116 |
[15] | 张明阳, 王刚, 彭起, 张岩峰. 学术论文公开评审平台数据分析 Data Analysis of OpenReview 计算机科学, 2021, 48(6): 63-70. https://doi.org/10.11896/jsjkx.200500138 |
|