计算机科学 ›› 2014, Vol. 41 ›› Issue (Z11): 455-460.

• 智能系统及应用 • 上一篇    下一篇

手机产品垂直搜索引擎的研究与实现

苏永红,张玉蓉   

  1. 武汉理工大学华夏学院 武汉430223;武汉理工大学华夏学院 武汉430223
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受武汉理工大学华夏学院院级科研基金项目(11030)资助

Research and Implementation of Mobile Phone Vertical Search Engine

SU Yong-hong and ZHANG Yu-rong   

  • Online:2018-11-14 Published:2018-11-14

摘要: 随着网络技术的快速发展,通用搜索引擎已经不能满足用户的一些需求,特别是当用户需要搜索某一领域内的信息时,垂直搜索引擎就正好符合这种需求。以手机资源为背景,通过运用扩展Heritrix和Lucene,构建了一个检索结果比较精准的垂直搜索引擎。研究了通过定制和扩展Heritrix从互联网上爬取相关的信息资源,利用HtmlParser工具对爬取的信息进行分析和抽取,运用Lucene建立全文索引和提供检索服务,并设计了MVC的查询接口。通过响应时间、查全率和查准率的测试实验表明,系统达到了设计目标。

关键词: 垂直搜索,Heritrix,抽取,索引

Abstract: With the fast development of network technology,universal search engine always can not meet many user demands,especially when user needs to search some information in a field,vertical search engine accords with user demands.Cell phone resource search was discussed.It initially comes up with a vertical search with fairly precise outcome through expanding the use of Heritrix and Lucene.The major research work of this paper is divided into four parts.Firstly,by customizing and extending the Heritrix,it crawled some information from Internet.Secondly,the crawled information was analyzed and cramped out,some of that with the tool of HtmlParser.Thirdly,Lucene used to build a full-text index and retrieval service for the system.Finally,the system design a MVC connector.The system achieves design goals through the tests of response time,recall ratio and precision ratio.

Key words: Vertical search,Heritrix,Extraction,Index

[1] Lei Xiang,Xin Meng.A Data Mining Approach to Topic-Specific Web Resource Discovery[C]∥Second International Conference on Intelligent Computation Technology and Automation.2009,2:595-599
[2] Jia Y,Fan H,et al.Design of an Application Model Based onVertical Search Engine[C]∥Second International Conference on Networking and Distributed Computing.2011:57-60
[3] Wang Chuan,Chang Gui-ran,et al.An Architecture for Improving the Efficiency of Specialized Vertical Search Engine Based on GPGPUs[C]∥Fourth International Conference on Genetic and Evolutionary Computing.2010:67-70
[4] 王晔.垂直搜索引擎若干问题研究[D].上海:复旦大学,2011
[5] 刘育莲.手机产品垂直搜索引擎的设计与实现[D].西安:西安电子科技大学,2012
[6] 刘丽杰.垂直搜索引擎中聚焦爬虫技术的研究[D].哈尔滨:哈尔滨工程大学,2012
[7] 奉国和,郑伟.国内中文自动分词技术研究综述[J].图书情报工作,2011(2):43-47
[8] 刘琦.垂直搜索引擎的设计与开发[D].广州:中山大学,2010
[9] 罗刚.解密搜索引擎技术实战[M].北京:电子工业出版社,2011
[10] 邱哲,符涛涛,王学松.开发自己的搜索引擎[M].北京.人民邮电出版社,2010

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!