Computer Science ›› 2017, Vol. 44 ›› Issue (12): 232-238.doi: 10.11896/j.issn.1002-137X.2017.12.042

Previous Articles     Next Articles

Text Mining Algorithm and Application of Telecom Big Data

WANG Dong-sheng, HUANG Chuan-he, HUANG Xiao-peng and NI Qiu-fen   

  • Online:2018-12-01 Published:2018-12-01

Abstract: Major telecom data contain a large number of unstructured text data,which are difficult for conventional methods to mine information.Text mining can do better than conventional methods under this circumstance.Based on the text data,this paper proposed a new word identification algorithm and a named entity recognition algorithm.At this process,we analyzed the customers’ complaint texts and judged their categories,and then identified the user’s terminal types from their information,which provides better user supports and experiences for the telecom industry.Experiment results validate that the proposed algorithm achieves good performance for the identification of customers’ complaint texts in the telecom.

Key words: Telecom,Big data,Text mining,Pattern recognition,User’s terminal types

[1] SENBALC C,ALTUNTAS S,BOZKUS Z,et al.Big data paltform development with a domain specific language for telecom industries [C]∥High Capacity Optical Networks and Emerging/Enabling Technologies.2013:116-120.
[2] TSENG J C,TSENG H C,LIU C W.A successful application of big storage techniques implemented to criminal investigation for telecom [C]∥Network Operations and Management Sympo-sium (APNOMS).2013:1-3.
[3] JONY R I,HABIB A,MOHANMMED N,et al.Big Data Use Case Domains for Telecom Operates [C]∥IEEE International Conference on Smart City/SocialCom/SustainCom.2015:850-855.
[4] ZHONG N,LI Y F.Effective Pattern Discovery for Text Mining [J].IEEE Transactions on Knowledge and Data Engineering,2012,24(1):30-44.
[5] ELAGIB S B,HASHIM A H A,OLANREWAJU R F.CDRanalysis using Big Data technology[C]∥International Confe-rence on Computing,Control,Networking,Electronics and Embedded Systems Engineering (ICCNEEE).2015:467-471.
[6] DAM R V D.Big Data a Sure Thing for Telecommunications:Telecom’s Future in Big Data [C]∥Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC).2013:148-154.
[7] OUYANG Y,HU M M,HUET A,et al.Mining of leaders in mobile telecom social networks [C]∥Wireless Telecommunications Symposium (WTS).2016:1-4.
[8] HUANG W L,CHEN Z,DONG W Y,et al.Mobile Internet big data platform in China Unicom [J].Tsinghua Science and Technology,2014,19(1):95-101.
[9] CHETAN S B J,SRINIVASA K G.Large Scale Multi-labelText Classification of a Hierarchical Dataset using Rocchio algorithm [C]∥International Conference on Computational Systems and Information Systems for Sustainable Solutions.2016:291-296.
[10] YANG W C,FU Y M,ZHANG D.An Improved Parallel Algorithm for Text Categorization [C]∥International Symposium on Computer,Consumer and Control.2016:451-454.
[11] SANTOSO J,YUNIARNO E M,HARIADI M.Large ScaleText Classification using Map Reduce and Nave Bayes Algorithm for Domain Specified Ontology Building [C]∥7th International Conference on Intelligent Human-Machine Systems and Cybernetics.2015:428-432.
[12] YANG J,YANG M H.Top-Down Visual Saliency via Joint CRF and Dictionary Learning[C]∥ Computer Vision and Pattern Recognition.IEEE,2012:2296-2303.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!