Computer Science ›› 2014, Vol. 41 ›› Issue (12): 197-201.doi: 10.11896/j.issn.1002-137X.2014.12.043

Previous Articles     Next Articles

MapReduce-based SOINN Clustering Algorithm for Web Tag

WANG Jie,YU Yan-shuo,ZHOU Kuan-jiu and HOU Gang   

  • Online:2018-11-14 Published:2018-11-14

Abstract: Web tag helps users to classify,organize and search internet resources according to their interests.Tag clustering can help to solve problems caused by openness and freedom of Web tag system,such as inaccurate information description,disorganized tags,ambiguity,and so on.Three tag feature vector representation (FVR) methods were pre-sented which are resource-based FVR,other tag co-occurrence FVR and total tag co-occurrence FVR,can all apply to SOINN clustering algorithm.SOINN clustering can be parallelized by MapReduce model.Experiments show that accuracy and recall rate of three tag FVR are superior to original tag co-occurrence FVR and tag SOINN clustering by MapReduce owns optimum performance when the number of class center is more than 2000.The experimental results prove that distributed clustering algorithms proposed in this paper have good scalability which can be applied to more massive cluster Web tag analysis system.

Key words: Web tag clustering,SOINN algorithm,MapReduce

[1] Kamel B,Wheeler S.The emerging Web 2.0 social software:an enabling suite of sociable technologies in health and health care education1[J].Health Information & Libraries Journal,2007,24(1):2-23
[2] Li Y,An J.Analysis on the Online Public Opinion Management in the Context of Web 2.0[C]∥Proceedings of 2010 International Conference on Public Administration(6th).2010:418-422
[3] Kipp M E I,Campbell D G.Patterms and inconsistencies in collaborative tagging systems:An examination of tagging prattices[C]∥Proceedings of the American Society for Information Science and Technology.2006:1-18
[4] Grigory B,Philipp K,Frank S.Automated Tag Clustering:Improving search and exploration in the tag space[C]∥ Collaborative Web Tagging Workshop at www 2006.Edinburgh,Scotland,2006:15-33
[5] Paul H,Hector G.Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems[R].Stanford,2006
[6] Ramage D,Heymann P,Manning C D,et al.Clustering thetagged web[C]∥International Conference on Web Search and Web Data Mining(WSDM).ACM,2009:54-63
[7] Gunarathne T,Zhang B J,Wu T L,et al.Scalable parallel computing on clouds using Twister4Azure iterative MapReduce[J].Future Generation Computer Systems,2013,29(4):1035-1048
[8] Furao S,Hui Y,Sakurai K,et al.An incremental online semi-supervised ac-tive learning algorithm based on self-organizing incremental neural network[J].Neural Computing & Applications,2011,0(7):1061-74
[9] Kawewong A,Honda Y,Tsuboyama M,et al.Reasoning on the Self-organizing Incremental Associative Memory for Online Robot Path Planning[J].IEICE transactions on information and systems,2010,3(3):569-582
[10] Ching-man A,Nicholas G,Nigel S.Contextualising Tags in Collaborative Tagging Systems[C]∥20th ACM Conference on Hypertext and Hypermedia.ACM,2009:251-260
[11] Gunarathne T,Zhang B,Wu T L,et al.Scalable parallel computing on clouds using Twister4Azure iterative MapReduce[J].Future Generation Computer Systems,2013,29(4):1035-1048
[12] Shen F,Osamu H.An incremental network for on-line unsupervised classification and topology learning[J].Neural Networks,2006,9(1):90-106

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!