摘要: 在海量数据集上执行情感分类任务时,传统的单机情感分类算法的扩展性成为系统的瓶颈。在云计算平台Hadoop上,实现了情感分类任务中特征提取、特征向量加权和情感分类等算法的MapReduce化。在情感语料数据集上,对各种子步骤组合下情感分类算法的精度及每种算法的时间开销进行了对比分析。实验结果验证了实现的并行化情感分类算法的有效性,同时它为用户选择合适算法实现情感分类任务提供了有价值的参考信息。
[1] Pang B,Lee L,Vaithyanathan S.Thumbs up? Sentiment Classification Using Machine Learning Techniques[C]∥Proceedings of the EMNLP’02.2002:79-86 [2] Pang B,Lee L.A Sentimental Education:Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts[C]∥Proceeding of ACL.2004:271-278 [3] Dave K,Lawrence S,Pennock D.Mining the peanut gallery:opinion extraction and semantic classification of product reviews[C]∥Proceedings of WWW2003.2003 [4] Mullen T,Collier N.Sentiment analysis using support vectormachines with diverse information sources[C]∥Proceedings of EMNLP’2004.2004 [5] Li J,Sun M.Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques[C]∥Proceedings of IEEE NLP-KE’2007.2007 [6] Zhai Zhong-wu,Xu Hua,Li Juu,et al.Sentiment Classification for Chinese Reviews Based on Key Substring Features[C]∥Proceedings of Natural Language Processing and Knowledge Engineering.2009:24-27 [7] Devitt A,Ahmad K.Sentiment polarity identification in financial news:a cohesion-based approach[C]∥Proceedings of ACL.2007:984-991 [8] Shein K P P,Nyunt T T S.Sentiment classification based on Ontology and SVM Classifier[C]∥Proceedings of ICCSN.2010:169-172 [9] Zhai Zhong-wu,Xu Hua,Kang Ba-da,et al.Exploiting effective features for Chinese sentiment classification[J].Expert Syst.Appl.,2011,38(8):9139-9146 [10] Jeffrey D,Sanjay G.MapReduce:simplified data processing onlarge clusters[J].Commun.ACM,2008,1(1):107-113 [11] Rocchio J.Relevance Feedback in Information Retrieval[M].The SMART Retrieval System:Experiments in Automatic Do-cument Processing,Chapter 14,Prentice-Hall,1971:313-323 |
No related articles found! |
|