计算机科学 ›› 2013, Vol. 40 ›› Issue (Z11): 379-382.
彭红超,童名文,邹军华,郝秋红
PENG Hong-chao,TONG Ming-wen,ZOU Jun-hua and HAO Qiu-hong
摘要: 针对国家精品课程网站中网页内容和样式独立设计,网页分割算法难以运行的问题,基于规则提出了一种网页分割预处理算法,建立了网页标签和样式信息的关联。算法包括3个步骤:第一,获取样式信息;第二,关联样式信息和标签;第三,输出HTML和PerfectNode关联类列表。随机选取了100个国家精品课程网站的网页运行预处理算法,实验结果表明该算法可以有效地 融合 网页标签和样式信息,解决了网页分割算法无法运行的问题。
[1] Sano H,Shiramatsu S,Ozono T,et al.A Web Page Segmentation Method based on Page Layouts and Title Blocks[J].International Journal of Computer Science and Network Security,2011,11(10):84-90 [2] Chibane I,Doan B L.A Web page topic segmentation algorithm based on visual criteria and content layout[C]∥Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval.ACM,2007:817-818 [3] Simon K,Lausen G.ViPER:augmenting automatic information extraction with visual perceptions[C]∥Proceedings of the 14th ACM international conference on Information and knowledge management.ACM,2005:381-388 [4] Cai D,Yu S,Wen J R,et al.VIPS:a visionbased page segmentation algorithm[R].Microsoft technical report,MSR-TR-2003-79.2003 [5] Gupta A,Kumar A,Tripathi V N,et al.Mobile web:web manipulation for small displays using multi-level hierarchy page segmentation[C]∥Proceedings of the 4th international conference on mobile technology,applications,and systems and the 1st international symposium on Computer human interaction in mobile technology.ACM,2007:599-606 [6] Yang S J H,Zhang J,Chen R C S,et al.A unit of information-based content adaptation method for improving web content accessibility in the mobile Internet[J].ETRI journal,2007,29(6):794-807 [7] Chen Y,Xie X,Ma W Y,et al.Adapting web pages for small-screen devices[J].Internet Computing,IEEE,2005,9(1):50-56 [8] Artail A,Raydan M.Device-aware desktop web page transformation for rendering on handhelds[J].Personal and Ubiquitous Computing,2005,9(6):368-380 |
No related articles found! |
|