Computer Science ›› 2024, Vol. 51 ›› Issue (6A): 230600111-8.doi: 10.11896/jsjkx.230600111

• Artificial Intelligenc • Previous Articles     Next Articles

Text Emotional Analysis Model Fusing Theme Characteristics

YANG Junzhe1, SONG Ying2, CHEN Yifei2   

  1. 1 School of Automation,Nanjing University of Information Science and Technology,Nanjing 210044,China
    2 School of Automation,Wuxi University,Wuxi,Jiangsu 214105,China
  • Published:2024-06-06
  • About author:YANG Junzhe,born in 1999,postgra-duate,is a member of CCF(No.P2586G).His main research interests include sentiment analysis and topic classification.
    SONG Ying,born in 1979,Ph.D,postgraduate supervisor,is a member of CCF(No.P2602M).Her main research interests include computer vision and digital twins.
  • Supported by:
    Natural Science Foundation of the Jiangsu Higher Education Institutions of China(19KJB520044) and Postgraduate Research & Practice Innovation Program of Jiangsu Province(SJCX23_0392).

Abstract: With the rapid development of large-scale language models,how to reduce the number of model parameters while ensuring model performance has become an important challenge in the field of natural language processing.However,the existing parameter compression techniques are often difficult to balance the stability and generalization ability of the model.To this end,this paper proposes a new framework for sentiment analysis that integrates topic features,aiming to use topic information to enhance the model’s ability to judge text sentiment polarity.Specifically,a method combining LDA and K-means is used to extract the topic features of the text,and it is spliced with word embeddings as a fixed-dimensional vector to obtain a new word vector representation.Sentence-level representation vectors are then constructed using average pooling techniques and fed into a fully connected layer for sentiment classification.To verify the effectiveness of the proposed model,comparative experiments with multiple benchmark algorithms are carried out on public sentiment analysis datasets.Experimental results show that the proposed model is significantly better than ALBERT in multiple data sets,with an accuracy rate increases by about 3.5%,and it maintains high stability and generalization ability with only a small increase in the number of parameters.

Key words: Emotional analysis, ALBERT model, Latent dirichlet allocation, Theme features, Average pooling

CLC Number: 

  • TP391
[1]WANKHADE M,RAO A C S,KULKARNI C.A survey onsentiment analysis methods,applications,and challenges[J].Artificial Intelligence Review,2022,55(7):5731-5780.
[2]TAHERDOOST H,MADANCHIAN M.Artificial Intelligence and Sentiment Analysis:A Review in Competitive Research[J].Computers,2023,12(2):37.
[3]ZHOU J,YE J M.Sentiment analysis in education research:a review of journal publications[J].Interactive learning environments,2023,31(3):1252-1264.
[4]LAN Y X,ZHANG L W,WANG H W,et al.Risk-oriented online public opinion abnormal perception and empirical research[J].Modern intelligence,2022,42(3):102-108.
[5]OSMANI A,MOHASEFI J B,SHI Y.Opinion Mining Using Enriched Joint Sentiment-Topic Model[J].International Journal of Information Technology & Decision Making,2023,22(1):313-375.
[6]CHATURVEDI I,CAMBRIA E,WELSCH R E,et al.Distinguishing between facts and opinions for sentiment analysis:Survey and challenges[J].Information Fusion,2018,44:65-77.
[7]ZHOU C,LI Q,LI C,et al.A comprehensive survey on pretrained foundation models:A history from bert to chatgpt[J/OL].https://arxiv.org/abs/2302.09419.
[8]DEVLIN J,CHANG M W,LEE K,et al.BERT:Pre-training of Deep Bidirectional Transformers for Language Understanding[J].arXiv:1810.04805,2018.
[9]LAN Z,CHENM,GOODMAN S,et al.ALBERT:A Lite BERT for Self-supervised Learning of Language Representations[J].arXiv:1909.11942,2019.
[10]TURNEY P D,LITTMAN M L.Measuring praise and criticism:Inference of semantic orientation from association[J].ACM Transactions on Information Systems(TOIS),2003,21(4):315-346.
[11]DEY S,WASIF S,TONMOY D S,et al.A comparative study of support vector machine and Naive Bayes classifier for sentiment analysis on Amazon product reviews[C]//2020 International Conference on Contemporary Computing and Applications(IC3A),2020:217-220.
[12]CHEN Y.Convolutional neural network for sentence classification[D].University of Waterloo,2015.
[13]ALROOBAEAl R.Sentiment Analysis on Amazon Product Reviews using the Recurrent Neural Network(RNN)[J].International Journal of Advanced Computer Science and Applications,2022,13(4):314-318.
[14]LIN X,CHEN Z Z,WANG Z Q.Attribute-level emotional classification based on unbalanced data and integrated learning[J].Computer Science,2022,49(S1):144-149.
[15]HU Y L,TONG T Q,ZHANG X Y,et al.In-depth learningemotional analysis method of integrating self-attention mechanism[J].Computer Science,2022,49(1):252-258.
[16]YAHAV A,VISHWAKARMAI D K.Sentiment analysis using deep learning architectures:a review[J].Artificial Intelligence Review,2020,53(6):4335-4385.
[17]MIKOLOV T,CHEN K,COEEADO G,et al.Efficient estimation of word representations in vector space[J].arXiv:1301.3781,2013.
[18]VASWANI A,SHAZEER N,PARMAR N,et al.Attention Is All You Need[J].arXiv:1706.03762,2017.
[19]HUANG S C,HAN D H,QIAO B Y,et al.Insumer emotional analysis method based on ERNIE2.0-BILSTM-ATTENTION[J].Journal of Chinese Computer Systems,2021,42(12):2485-2489.
[20]LIU P,YUAN W,FU J,et al.Pre-train,prompt,and predict:A systematic survey of prompting methods in natural language processing[J].ACM Computing Surveys,2023,55(9):1-35.
[21]LI M,LI W,WANG F,et al.Applying BERT to analyze investor sentiment in stock market[J].Neural Computing and Applications,2020(3):1-14.
[22]SONG M,LIU Y L.Bert in the application and optimization of the emotional classification of Weibo short text[J].Journal of Chinese Computer Systems,2021,42(4):714-718.
[23]WANG H,HU X,ZHANG H.Sentiment analysis of com modity reviews based on ALBERT-LSTM[C]//Journal of Physics:Conference Series.Bristol,UK,2020:012022.
[24]GAO X,DING G,LIU C,et al.Research on high precision Chinese text sentiment Classification based on ALBERT Optimization[C]//2023 15th International Conference on Advanced Computational Intelligence(ICACI).Nanjing,China,2023:1-6.
[25]BLEI D M,NG A Y,JORDAN M I.Latent Dirichlet Allocation[J].The Annals of Applied Statistics,2003,3:993-1022.
[26]YAN F,DU T F,MAO J H,et al.Emotional analysis of the stock market text based on emotional dictionary and LDA model[J].Electronic Measurement Technology,2017,40(12):82-87.
[27]XUE J,CHEN J,HU R,et al.Twitter discussions and emotions about the COVID-19 pandemic:Machine learning approach[J].Journal of Medical Internet Research,2020,22(11):e20550.
[28]BUI Q V,SAYADI K,BUI M.A multi-criteria document clustering method based on topic modeling and pseudoclosure function[C]//Proceedings of the Sixth International Symposium on Information and Communication Technology.Ho Chi Minh City,Vietnam,2015:38-45.
[29]SUN Y,WANG S,FENG S,et al.ERNIE 3.0:Large-scaleKnowledge Enhanced Pre-training for Language Understanding and Generation[J].arXiv:2107.02137,2021.
[1] JIANG Haoda, ZHAO Chunlei, CHEN Han, WANG Chundong. Construction Method of Domain Sentiment Lexicon Based on Improved TF-IDF and BERT [J]. Computer Science, 2024, 51(6A): 230800011-9.
[2] XU Yuan-yin,CHAI Yu-mei,WANG Li-ming,LIU Zhen. Emotional Sentence Classification Method Based on OCC Model and Bayesian Network [J]. Computer Science, 2020, 47(3): 222-230.
[3] ZHOU Kai, REN Yi, WANG Zhe, GUAN Jian-bo, ZHANG Fang, ZHAO Yan-kang. Classification and Analysis of Ubuntu Bug Reports Based on Topic Model [J]. Computer Science, 2020, 47(12): 35-41.
[4] LI Yuan,LI Zhi-xing,TENG Lei,WANG Hua-ming,WANG Guo-yin. Comment Sentiment Analysis and Sentiment Words Detection Based on Attention Mechanism [J]. Computer Science, 2020, 47(1): 186-192.
[5] JU Ya-ya, YANG Lu, YAN Jian-feng. LDA Algorithm Based on Dynamic Weight [J]. Computer Science, 2019, 46(8): 260-265.
[6] DU Hui, CHEN Yun-fang and ZHANG Wei. Survey for Methods of Parameter Estimation in Topic Models [J]. Computer Science, 2017, 44(Z6): 29-32.
[7] LUO Hai-jiao and KE Xiao-hua. Automated Scoring Chinese Subjective Responses Based on Improved-LDA [J]. Computer Science, 2017, 44(Z11): 102-105.
[8] ZHANG Jian-wei, YAN Jian-feng, LIU Xiao-sheng and YANG Lu. Online LDA on Dynamic Vocabulary [J]. Computer Science, 2016, 43(12): 120-124.
[9] GAO Yang, YANG Lu, LIU Xiao-sheng and YAN Jian-feng. Study of Semantic Understanding by LDA [J]. Computer Science, 2015, 42(8): 279-282.
[10] GAO Yang, YAN Jian-feng and LIU Xiao-sheng. Nave Parallel LDA [J]. Computer Science, 2015, 42(6): 243-246.
[11] ZHOU Li-juan,LIN Hong-fei and YAN Jun. Tags Know You Better:A New Approach to Enhancing MIR System [J]. Computer Science, 2014, 41(2): 174-178.
[12] WANG Zhen-zhen,HE Ming and DU Yong-ping. Text Similarity Computing Based on Topic Model LDA [J]. Computer Science, 2013, 40(12): 229-232.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!