计算机科学 ›› 2006, Vol. 33 ›› Issue (3): 191-193.

• • 上一篇    下一篇

基于网页结构挖掘的信息提取

  

  • 出版日期:2018-11-17 发布日期:2018-11-17

  • Online:2018-11-17 Published:2018-11-17

摘要: 本文提出了两种细粒度的、基于网页结构挖掘的信息提取方法,比较了它们的优缺点,并给出了相应具体实现的性能测试和结果分析.

关键词: 信息提取 网页结构挖掘 重复模式 时间特征 RSS

Abstract: To simplify the task of obtaining information from the vast number of information sources that are available on the WWW, we have developed two different methods to extract information of fine grain. This paper firstly describes the principles of the two m

Key words: Information extraction, Mining structures of Web pages, Repeated pattern, Time characteristic, RSS

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!