计算机科学 ›› 2014, Vol. 41 ›› Issue (12): 197-201.doi: 10.11896/j.issn.1002-137X.2014.12.043
王洁,于颜硕,周宽久,侯刚
WANG Jie,YU Yan-shuo,ZHOU Kuan-jiu and HOU Gang
摘要: Web标签有助于用户根据自己特定的兴趣完成信息资源的分类、组织和检索。然而,正是由于协同标记系统特有的公开性、自由化的特点,采用其对信息资源进行描述、组织、分类和检索,存在着信息描述不精确、标签组织混乱和标签语意模糊等问题。在此背景下提出了3种基于特征向量表示法(FVR)的Web标签SOINN聚类算法:基于资源的特征向量表示法、基于其他共现标签的特征向量表示法和基于全集共现标签的特征向量表示法。同时应用MapReduce框架将SOINN算法进行并行化。实验表明,当类中心数量超过2000时,3种分布式聚类FVR算法的召回率和准确度优于原始算法,可获得很好的加速比。从而证明此分布式聚类算法具有很好的可扩展性,可以用于更为海量的Web日志聚类分析系统。
[1] Kamel B,Wheeler S.The emerging Web 2.0 social software:an enabling suite of sociable technologies in health and health care education1[J].Health Information & Libraries Journal,2007,24(1):2-23 [2] Li Y,An J.Analysis on the Online Public Opinion Management in the Context of Web 2.0[C]∥Proceedings of 2010 International Conference on Public Administration(6th).2010:418-422 [3] Kipp M E I,Campbell D G.Patterms and inconsistencies in collaborative tagging systems:An examination of tagging prattices[C]∥Proceedings of the American Society for Information Science and Technology.2006:1-18 [4] Grigory B,Philipp K,Frank S.Automated Tag Clustering:Improving search and exploration in the tag space[C]∥ Collaborative Web Tagging Workshop at www 2006.Edinburgh,Scotland,2006:15-33 [5] Paul H,Hector G.Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems[R].Stanford,2006 [6] Ramage D,Heymann P,Manning C D,et al.Clustering thetagged web[C]∥International Conference on Web Search and Web Data Mining(WSDM).ACM,2009:54-63 [7] Gunarathne T,Zhang B J,Wu T L,et al.Scalable parallel computing on clouds using Twister4Azure iterative MapReduce[J].Future Generation Computer Systems,2013,29(4):1035-1048 [8] Furao S,Hui Y,Sakurai K,et al.An incremental online semi-supervised ac-tive learning algorithm based on self-organizing incremental neural network[J].Neural Computing & Applications,2011,0(7):1061-74 [9] Kawewong A,Honda Y,Tsuboyama M,et al.Reasoning on the Self-organizing Incremental Associative Memory for Online Robot Path Planning[J].IEICE transactions on information and systems,2010,3(3):569-582 [10] Ching-man A,Nicholas G,Nigel S.Contextualising Tags in Collaborative Tagging Systems[C]∥20th ACM Conference on Hypertext and Hypermedia.ACM,2009:251-260 [11] Gunarathne T,Zhang B,Wu T L,et al.Scalable parallel computing on clouds using Twister4Azure iterative MapReduce[J].Future Generation Computer Systems,2013,29(4):1035-1048 [12] Shen F,Osamu H.An incremental network for on-line unsupervised classification and topology learning[J].Neural Networks,2006,9(1):90-106 |
No related articles found! |
|