Computer Science ›› 2017, Vol. 44 ›› Issue (Z11): 411-413.doi: 10.11896/j.issn.1002-137X.2017.11A.087

Previous Articles     Next Articles

Research on Text Data Topic Mining and Association Search

ZHU Wei-xing, XU Wei-guang, HE Hong-yue and LI Wen   

  • Online:2018-12-01 Published:2018-12-01

Abstract: Text data is the most natural way of storing and exchanging information.Text mining technology can disco-ver knowledge patterns hidden in massive text data.The text data mining and related search technology were studied in the paper,Firstly,text information is extracted by text parsing and extraction,word preprocessing and indexing.Then the theme information model based on latent semantic relations is used to mine the hidden topic information in large amount of text data.Finally,the topic model is used to calculate the relevance degree of keywords.In order to achieve the associated search,a prototype system of text data mining and association search is implemented.Subject discovery and association search were performed on Tancorp dataset,and the process of association search was displayed synchronously with visualization and Web page.

Key words: Text mining,Topic discovery,Association search

[1] 曹波伟,薛青.面向军事基础数据的数据挖掘研究[C]∥2009年系统仿真技术及其应用学术会议(CCSSTA’2009)论文集.2009.
[2] CORMEN T H,LEISERSON C E,RIVEST R L,et al.Introduction to Algorithms(Second Edition)[M].The MIT Press,2001.
[3] FELDMAN R,DAGAN I.KDT-Knowledge Discovery in Tex-tual Database [C]∥Proceedings of the 1st Annual Conference on Knowledge Discovery and DataMining.1995:112-117.
[4] MOTHE J,CHRISMENT C,DKAKI T.Information mining-use of the document dimensions to analyze interactively a document set[C]∥European Colloquium on Information Retrieval Research.2001:6-20.
[5] GHANEM M,CHORTARAS A,GUO Y,et al.A grid of infrastructure for mixed bioinformatics data and text mining[J].Computer Systems and Applications,2005,4(1):116-130.
[6] KARANIKAS H,TJORTJIS C,THEODOULIDIS B.An ap-proach to Text Mining using Information Extraction[C]∥Proceeding of the Fourth European Conference on Principles and Practice of Knowledge Discovery in Database.Lyon,France,2000:13-16.
[7] HU Q,YU D,DUAN Y,et al.A novel weighting formula and feature selection for text classification based on rough set theory [C]∥Proceedings of Natural Language Processing and Know-ledge Engineering.2003:638-645.
[8] KOSALA R,BLOCKEEL H.Web Mining Research:A Survey [C]∥ACM SIGKDD.2000:1-15.
[9] LI H,YAMANISHI K.Mining from Open Answers in Questionaire Data [C]∥Proc.of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2001:443-449.
[10] PONS-PORRATA A,BERLANGA-LAVORI R,RUI-SHU-LCLOPER J.Topic discovery based on text mining techniques[J].Information Processing and Management,2007,43(3):752-768.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!