计算机科学 ›› 2013, Vol. 40 ›› Issue (6): 206-210.

• 人工智能 • 上一篇    下一篇

并行化的情感分类算法的研究

余永红,向小军,商琳   

  1. 南京邮电大学通达学院 南京210003;南京大学计算机科学与技术系 南京210093;南京大学计算机科学与技术系 南京210093
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家自然科学基金项目(61035003),科技部国际科技合作计划项目(2010DFA11030),江苏省自然科学基金项目(BK2010054)资助

Research on Parallelized Sentiment Classification Algorithms

YU Yong-hong,XIANG Xiao-jun and SHANG Lin   

  • Online:2018-11-16 Published:2018-11-16

摘要: 在海量数据集上执行情感分类任务时,传统的单机情感分类算法的扩展性成为系统的瓶颈。在云计算平台Hadoop上,实现了情感分类任务中特征提取、特征向量加权和情感分类等算法的MapReduce化。在情感语料数据集上,对各种子步骤组合下情感分类算法的精度及每种算法的时间开销进行了对比分析。实验结果验证了实现的并行化情感分类算法的有效性,同时它为用户选择合适算法实现情感分类任务提供了有价值的参考信息。

关键词: 情感分类,Hadoop,云计算,MapReduce

Abstract: The scalability problem becomes a bottleneck for traditional stand-alone sentiment classification algorithms due to the massive data.We implemented feature extraction,feature weighting and classification algorithms involved in sentiment classification task by using MapReduce technique on Hadoop platform.We evaluated our proposed paralle-lized sentiment classification algorithms on real data sets in terms of precision and time costs.Experimental results show the effectiveness of these parallelized sentiment classification algorithms and also provide valuable references for users to select suitable sentiment classification algorithms according to user requirements.

Key words: Sentiment classification,Hadoop,Cloud computing,MapReduce

[1] Pang B,Lee L,Vaithyanathan S.Thumbs up? Sentiment Classification Using Machine Learning Techniques[C]∥Proceedings of the EMNLP’02.2002:79-86
[2] Pang B,Lee L.A Sentimental Education:Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts[C]∥Proceeding of ACL.2004:271-278
[3] Dave K,Lawrence S,Pennock D.Mining the peanut gallery:opinion extraction and semantic classification of product reviews[C]∥Proceedings of WWW2003.2003
[4] Mullen T,Collier N.Sentiment analysis using support vectormachines with diverse information sources[C]∥Proceedings of EMNLP’2004.2004
[5] Li J,Sun M.Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques[C]∥Proceedings of IEEE NLP-KE’2007.2007
[6] Zhai Zhong-wu,Xu Hua,Li Juu,et al.Sentiment Classification for Chinese Reviews Based on Key Substring Features[C]∥Proceedings of Natural Language Processing and Knowledge Engineering.2009:24-27
[7] Devitt A,Ahmad K.Sentiment polarity identification in financial news:a cohesion-based approach[C]∥Proceedings of ACL.2007:984-991
[8] Shein K P P,Nyunt T T S.Sentiment classification based on Ontology and SVM Classifier[C]∥Proceedings of ICCSN.2010:169-172
[9] Zhai Zhong-wu,Xu Hua,Kang Ba-da,et al.Exploiting effective features for Chinese sentiment classification[J].Expert Syst.Appl.,2011,38(8):9139-9146
[10] Jeffrey D,Sanjay G.MapReduce:simplified data processing onlarge clusters[J].Commun.ACM,2008,1(1):107-113
[11] Rocchio J.Relevance Feedback in Information Retrieval[M].The SMART Retrieval System:Experiments in Automatic Do-cument Processing,Chapter 14,Prentice-Hall,1971:313-323

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!