计算机科学 ›› 2015, Vol. 42 ›› Issue (3): 206-209.doi: 10.11896/j.issn.1002-137X.2015.03.042

• 人工智能 • 上一篇    下一篇

基于网络社团结构的Web内容分级算法及其性能分析

刘 艳,王 泰   

  1. 华中师范大学国家数字化学习工程技术研究中心 武汉430079;武汉理工大学教育技术学系 武汉430079,华中师范大学国家数字化学习工程技术研究中心 武汉430079
  • 出版日期:2018-11-14 发布日期:2018-11-14

Web Content Rating Algorithm Based on Network Community Structure and its Performance Analysis

LIU Yan and WANG Tai   

  • Online:2018-11-14 Published:2018-11-14

摘要: 万维网内容因其海量性、形式多样性和缺乏语义描述等特征,给内容分级实时自动化处理带来了巨大挑战。本算法充分利用相近主题网页聚合成内容社团的万维网结构特性,在对请求内容分级网页进行处理的同时,通过网络社团检测方法来自动获取其他更多相近内容的网页,以提升网页内容分级处理效率;此外,它能很好地融入现有网络内容第三方分级系统。理论分析证明,本算法能显著提升万维网内容分级处理效率。

关键词: 网络内容分级,网络社团结构,第三方分级

Abstract: The Web contents are massive,diverse and semantics-missing,which bring significant challenges to the content-rating.This new algorithm takes full advantage of the Web structure feature that the similar topic webpages aggregate into the Web community,and uses the Web community detection algorithm to rate more content-similar webpages automatically when rating one webpage.In addition,it can be used in the current third-party content-rating system.Theory analysis shows that this algorithm significantly raises the efficiency of Web contents rating.

Key words: Web content rating,Network community structure,Third-party rating

[1] Resnick P,Miller J.PICS:Internet Access Controls WithoutCensorship[J].Communications of the ACM,1996,9(10):87-93
[2] Archer P.ICRAfail-A Lesson For the Future.2009.ht-tp://philarcher.org/icra/ICRAfail.pdf
[3] Watson G.Internet Content Rating Association (ICRA) Rating System.http://256.com/gray/docs/pics/icra.html
[4] Hepple M,Ireson N.NLP-enhanced Content Filtering within the POESIA Project[C]∥Proceedings of the Fourth International Conference on Language Resources and Evaluation(LREC).2004:1967-1970
[5] 国家计算机网络应急技术处理协调中心,中国软件评测中心,等.YDN 138-2006 基于PC终端的互联网内容过滤软件技术要求[S].北京:人民邮电出版社,2006
[6] 国家计算机网络应急技术处理协调中心,中国软件评测中心,等.YDN 139-2006 基于PC终端的互联网内容过滤软件测试方法[S].北京:人民邮电出版社,2006
[7] Google.We knew the web was big.2008.http://www.oddhubs.com/2012/04/29/we-knew-the-web-was-big/
[8] 中国互联网络信息中心.第33次中国互联网络发展状况统计报告.2014.http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/201403/P020140305346585959798.pdf
[9] Yu Li-yang.Introduction to the Semantic Web and SemanticWeb Services[M].New York:Chapman& Hall/CRC,2007:6-8
[10] Flake G W,Lawrence S R,Giles C L,et al.Self-organization and identification of Web communities[J].IEEE Computer,2002,5(3):66-71
[11] 邓智龙,淦文燕.复杂网络中的社团结构发现方法[J].计算机科学,2012,9(Z6):103-108
[12] Newman M E J,Girvan M.Finding and evaluating community structure in networks[J].Physical Review E,2004,69(2):1-16
[13] Shen Hua-wei.Community Structure of Complex Networks[M].Berlin Heidelberg:Springer-Verlag,2013:21-25
[14] Cho J,Garcia-Molina H,Haveliwala T,et al.Stanford WebBase components and applications[J].ACM Transactions on Internet Technology(TOIT),2006,6(2):153-186
[15] Rofouei M,Moazeni M,Sarrafzadeh M.Fast GPU-based space-time correlation for activity recognition in video sequences[C]∥IEEE/ACM/IFIP Workshop on EmbeddedSystems for Real-Time Multimedia (ESTImedia).Los Alamitos,CA,USA:IEEE,2008:33-38

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!