Computer Science ›› 2020, Vol. 47 ›› Issue (6A): 429-435.doi: 10.11896/JsJkx.190700161

• Database & Big Data & Data Science • Previous Articles     Next Articles

Application of Improved GHSOM Algorithm in Civil Aviation Regulation Knowledge Map Construction

ZHANG Hao-yang and ZHOU Liang   

  1. School of Computer Science and Technology,NanJing University of Aeronautics and Astronautics,NanJing 211100,China
  • Published:2020-07-07
  • About author:ZHANG Hao-yang, born in 1994, postgraduate.His main research interests include natural language processing and so on.

Abstract: Aiming at the problems that the number of clusters cannot be dynamically changed and the text classification results are not accurate enough during the text clustering process,this paper introduces and improves the Growing Hierarchical Self-Organizing Map (GHSOM) algorithm to improve text clustering accuracy,and tries to use the improved GHSOM algorithm to build a knowledge map of civil aviation regulations.The GHSOM algorithm has a multi-level hierarchical structure,and each layer contains several independent growing SOMs.Through the growth of the scale,the data set is described in more detail to a certain extent,and the classification effect is improved.Based on this,taking various laws and regulations in the field of civil aviation as the sample data set,combined with Chinese word segmentation,keyword extraction,file vector and other technical means,the text is clustered and analyzed using the improved GHSOM algorithm,and finally the construction of civil aviation regulation knowledge map is completed.Experimental results show that the proposed algorithm has significant text clustering ability.The civil aviation regulation knowledge map constructed by this algorithm has achieved good classification results,and its evaluation indicators such as accuracy and recall rate have been further improved.

Key words: GHSOM, Knowledge map, Natural language processing, Text clustering, word2vec

CLC Number: 

  • TP391.1
[1] CHEN Q,LIAO K J,XI J Q.Research status and prospect of knowledge map.Intelligence magazine,2006,25(5):43-46.
[2] DAVENPORT T,PRUSAK L.Working knowledge:how orga
[3] nizations manage what they know.Boston:Harvard Busi-ness School Press,2008.
[4] EPPLER M J.Toward a pragmatic taxonomy of knowledge
[5] map-s:classification principles,sample typologies and applicationexamples//Tenth International Conference on InformationVisualization.2014.
[6] YE F B,TANG R Z.Knowledge Matching Method of Product Design Process Based on Knowledge Map.Journal of ZheJiang University:Engineering Science,2008,42(6):927-932.
[7] PAN C,WANG J,LIU L.A Knowledge Map Model Based on Concept Clustering.Systems Engineering-Theory & Practice,2007(2):126-132.
[8] LI M,LU X Z,CHEN L S,et al.Knowledge map construction for question and answer archives.Expert Systems With Applications,2020,141:112923.
[9] CHA W W.Disciplinary knowledge representation and quantification method of multiple independent parameters in knowledge maps .Wuhan:Central China Normal University,2017.
[10] FAN Y X.Design and implementation of personalized learning system based on knowledge map .Changsha:Hunan Normal University,2018.
[11] ZHAO T,YU L,ZHAO Q.Research on Digital Education Resource Service Model Based on Knowledge Map .Journal of Southwest China Normal University (Natural Science Edition),2019,44(11):136-141.
[12] BU W X.Based on improved GHSOM intrusion detection technology .TianJin:TianJin University,2016.
[13] CHEN L.Application of Improved GHSOM Algorithm in Text Clustering.Computer and Telecom,2016(5):57-61.
[14] TIAN W F.A Method of Feature Selection Based on Word2Vec in Text Categorization.https://kns.cnki.net/KCMS/detail/detail.aspx?dbcode=CPFD&dbname=CPFDLAST2018&filename=KZLL201807006157&v=MTc5MJNIWXJHNEg5bk-1xSTlGWXVvS0N4Tkt1aGRobmo5OFRuanFxeGRFZU1PVU-tyaWZaZVp2RUNubFU3Zk5KbG9VTGpm.
[15] XIE Z L,LI N,ZHOU C J.Research on Emotion Classification of Hotel Reviews Based on Word2vec.Journal of BeiJing Union University,2018,32(4):34-39.
[16] ZHAO Z B,SHI Y X,LI B Y.Newly-emerging Domain Word Detection Method Based on Syntactic Analysis and Term Vector.Computer Science,2019,46(6):29-34.
[17] LIU W J,LUO J X.Image Retrieval Based on Improved GHSOM Clustering Algorithm.Journal of East China University of Science and Technology(Natural Science),2015,41(2):216-221.
[1] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[2] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[3] XIONG Luo-geng, ZHENG Shang, ZOU Hai-tao, YU Hua-long, GAO Shang. Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism [J]. Computer Science, 2022, 49(7): 212-219.
[4] LI Xiao-wei, SHU Hui, GUANG Yan, ZHAI Yi, YANG Zi-ji. Survey of the Application of Natural Language Processing for Resume Analysis [J]. Computer Science, 2022, 49(6A): 66-73.
[5] ZHANG Hu, BAI Ping. Graph Convolutional Networks with Long-distance Words Dependency in Sentences for Short Text Classification [J]. Computer Science, 2022, 49(2): 279-284.
[6] CHEN Zhi-yi, SUI Jie. DeepFM and Convolutional Neural Networks Ensembles for Multimodal Rumor Detection [J]. Computer Science, 2022, 49(1): 101-107.
[7] WANG Li-mei, ZHU Xu-guang, WANG De-jia, ZHANG Yong, XING Chun-xiao. Study on Judicial Data Classification Method Based on Natural Language Processing Technologies [J]. Computer Science, 2021, 48(8): 80-85.
[8] WU Yu, LI Zhou-jun. Survey on Retrieval-based Chatbots [J]. Computer Science, 2021, 48(12): 278-285.
[9] TONG Xin, WANG Bin-jun, WANG Run-zheng, PAN Xiao-qin. Survey on Adversarial Sample of Deep Learning Towards Natural Language Processing [J]. Computer Science, 2021, 48(1): 258-267.
[10] TIAN Ye, SHOU Li-dan, CHEN Ke, LUO Xin-yuan, CHEN Gang. Natural Language Interface for Databases with Content-based Table Column Embeddings [J]. Computer Science, 2020, 47(9): 60-66.
[11] LU Long-long, CHEN Tong, PAN Min-xue, ZHANG Tian. CodeSearcher:Code Query Using Functional Descriptions in Natural Languages [J]. Computer Science, 2020, 47(9): 1-9.
[12] ZHANG Ying, ZHANG Yi-fei, WANG Zhong-qing and WANG Hong-ling. Automatic Summarization Method Based on Primary and Secondary Relation Feature [J]. Computer Science, 2020, 47(6A): 6-11.
[13] WU Xiao-kun, ZHAO Tian-fang. Application of Natural Language Processing in Social Communication:A Review and Future Perspectives [J]. Computer Science, 2020, 47(6): 184-193.
[14] HU Chao-wen, YANG Ya-lian, WU Chang-xing. Survey of Implicit Discourse Relation Recognition Based on Deep Learning [J]. Computer Science, 2020, 47(4): 157-163.
[15] YU Shan-shan, SU Jin-dian, LI Peng-fei. Sentiment Classification Method for Sentences via Self-attention [J]. Computer Science, 2020, 47(4): 204-210.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!