Computer Science ›› 2018, Vol. 45 ›› Issue (8): 203-207.doi: 10.11896/j.issn.1002-137X.2018.08.036

• Artificial Intelligence • Previous Articles     Next Articles

Improved Bayesian Algorithm Based Automatic Classification Method for Bibliography

YANG Xiao-hua1, GAO Hai-yun2   

  1. Zhicheng College,Fuzhou University,Fuzhou 350002,China1
    College of Physics and Information Engineering,Fuzhou University,Fuzhou 350016,China2
  • Received:2018-02-28 Online:2018-08-29 Published:2018-08-29

Abstract: Bayesian algorithm is widely used in the field of automatic classification for bibliography.This method usually adopts differential evolution method to estimate the probability items.However,the traditional differential evolution method is easy to fall into the local optimum when estimating the probability items,which reduces the accuracy of Bayesian classifcation.In view of this problem,this paper proposed an improved Bayesian algorithm based automatic classification method for bibliography.In this method,the optimal solution of probability items is estimated through multi-parent mutation and crossover operation,which improves the accuracy of Bayesian classification.In the process of automatic classification for bibliography,the ICTCLAS system is used to preprocess the text and then extract the term frequency-inverse document frequency features of texts.Then,the improved Bayesian estimation method is utilized to train and classify the features.Finally,the automatic classification for bibliography is achieved.Simulation results show that this method has a high classification accuracy.

Key words: Automatic classification for bibliography, Bayesian algorithm, Differential evolution, Feature extraction

CLC Number: 

  • TP391
[1]MURTAGH F,KURTZ M J.The Classification Society’s Biblio-graphy Over Four Decades:History and Content Analysis[J].Journal of Classification,2016,33(1):6-29.
[2]KLEIN K.A Review of Bibliography Complex:Fundamentals of Librarianship and Knowledge Management[J].Cataloging & Classification Quarterly,2014,52(3):341-342.
[3]WELDON S P.Organizing knowledge in the Isis bibliographyfrom Sarton to the early twenty-first century[J].Isis,2013,104(3):540-550.
[4]ARGAMON S,WHITELAW C,CHASE P,et al.Stylistic textclassification using functional lexical features[J].Journal of the Association for Information Science & Technology,2014,58(6):802-822.
[5]LIN Y S,JIANG J Y,LEE S J.A Similarity Measure for Text Classification and Clustering[J].IEEE Transactions on Know-ledge & Data Engineering,2015,26(7):1575-1590.
[6]UYSAL A K,GUNAL S.The impact of preprocessing on text classification[J].Information Processing & Management,2014,50(1):104-112.
[7]D’ASPREMONT A.Predicting abnormal returns from newsusing text classification[J].Quantitative Finance,2015,15(6):999-1012.
[8]SHANG C,LI M,FENG S,et al.Feature selection via maximizing global information gain for text classification[J].Know-ledge-Based Systems,2013,54(4):298-309.
[9]KANAAN G,AL-SHALABI R,GHWANMEH S,et al.A comparison of text-classification techniques applied to Arabic text[J].Journal of the American Society for Information Science & Technology,2014,60(9):1836-1844.
[10]KHORSHEED M S.Comparative evaluation of text classification techniques using a large diverse Arabic dataset[J].Language Resources & Evaluation,2013,47(2):513-538.
[11]ABUERRUB A.Arabic Text Classification Algorithm usingTFIDF and Chi Square Measurements[J].International Journal of Computer Applications,2014,93(6):40-45.
[12]HU J,YAO Y.Research on the Application of an ImprovedTFIDF Algorithm in Text Classification[J].Journal of Convergence Information Technology,2013,8(7):639-646.
[13]GHAG K,SHAH K.SentiTFIDF-Sentiment Classification using Relative Term Frequency Inverse Document Frequency[J].International Journal of Advanced Computer Science & Applications,2014,5(2):36-43.
[14]BILAL M,ISRAR H,SHAHID M,et al.Sentiment classification of Roman-Urdu opinions using Navie Baysian,Decision Tree and KNN classification techniques[J].Journal of King Saud University-Computer and Information Sciences,2016,28(3):330-344.
[15]CHEN R,CHEN F,SUN Y.Research on Automatic Text Classification Algorithm Based on ITF-IDF and KNN[J].Applied Mechanics & Materials,2015,713-715:1830-1834.
[16]FENG G,WANG H,SUN T,et al.A Term Frequency BasedWeighting Scheme Using Naïve Bayes for Text Classification[J].Journal of Computational & Theoretical Nanoscience,2016,13(1):319-326.
[17]GONG W,CAI Z.Differential evolution with ranking-based mutation operators[J].IEEE Transactions on Cybernetics,2013,43(6):2066-2081.
[18]YANG M,GU J.Study and Apply of Chinese BibliographiesAutomatic Classification Based on Support Vector Machine[J].Library and Information Service,2012,56(9):114-119.(in Chinese)杨敏,谷俊.基于SVM的中文书目自动分类及应用研究[J].图书情报工作,2012,56(9):114-119.
[19]PAULINAS M.A survey of genetic algorithms applications for image enhancement and segmentation[J].Information Techno-logy & Control,2015,36(3):278-284.
[20]JIN X R,QI J D,WANG L C,et al.Approach of classification mapping between international patent--classification and chinese library classification based on machine learning[J].Journal of Computer Applications,2011,31(7):1781-1784.(in Chinese)靳雪茹,齐建东,王立臣,等.基于机器学习的类目映射方法——国际专利分类法与中国图书馆分类法[J].计算机应用,2011,31(7):1781-1784.
[21]YANG B,HAN Q W,LEI M,et al.Short Text Clssification Algorithm Based on Improved TF-IDF Weight.Journal of Chongqing University of Technology(Natural Science),2016,30(12):103-113.(in Chinese)杨彬,韩庆文,雷敏,等.基于改进TF-IDF权重的短文本分类算法.重庆理工大学学报(自然科学),2016,30(12):103-113.
[1] ZHANG Yuan, KANG Le, GONG Zhao-hui, ZHANG Zhi-hong. Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM [J]. Computer Science, 2022, 49(7): 31-39.
[2] ZENG Zhi-xian, CAO Jian-jun, WENG Nian-feng, JIANG Guo-quan, XU Bin. Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism [J]. Computer Science, 2022, 49(7): 106-112.
[3] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[4] LIU Wei-ye, LU Hui-min, LI Yu-peng, MA Ning. Survey on Finger Vein Recognition Research [J]. Computer Science, 2022, 49(6A): 1-11.
[5] LI Dan-dan, WU Yu-xiang, ZHU Cong-cong, LI Zhong-kang. Improved Sparrow Search Algorithm Based on A Variety of Improved Strategies [J]. Computer Science, 2022, 49(6A): 217-222.
[6] LIU Bao-bao, YANG Jing-jing, TAO Lu, WANG He-ying. Study on Prediction of Educational Statistical Data Based on DE-LSTM Model [J]. Computer Science, 2022, 49(6A): 261-266.
[7] GAO Yuan-hao, LUO Xiao-qing, ZHANG Zhan-cheng. Infrared and Visible Image Fusion Based on Feature Separation [J]. Computer Science, 2022, 49(5): 58-63.
[8] ZUO Jie-ge, LIU Xiao-ming, CAI Bing. Outdoor Image Weather Recognition Based on Image Blocks and Feature Fusion [J]. Computer Science, 2022, 49(3): 197-203.
[9] REN Shou-peng, LI Jin, WANG Jing-ru, YUE Kun. Ensemble Regression Decision Trees-based lncRNA-disease Association Prediction [J]. Computer Science, 2022, 49(2): 265-271.
[10] ZHANG Shi-peng, LI Yong-zhong. Intrusion Detection Method Based on Denoising Autoencoder and Three-way Decisions [J]. Computer Science, 2021, 48(9): 345-351.
[11] FENG Xia, HU Zhi-yi, LIU Cai-hua. Survey of Research Progress on Cross-modal Retrieval [J]. Computer Science, 2021, 48(8): 13-23.
[12] ZHANG Li-qian, LI Meng-hang, GAO Shan-shan, ZHANG Cai-ming. Summary of Computer-assisted Tongue Diagnosis Solutions for Key Problems [J]. Computer Science, 2021, 48(7): 256-269.
[13] BAO Yu-xuan, LU Tian-liang, DU Yan-hui, SHI Da. Deepfake Videos Detection Method Based on i_ResNet34 Model and Data Augmentation [J]. Computer Science, 2021, 48(7): 77-85.
[14] CHEN Yang, WANG Jin-liang, XIA Wei, YANG Hao, ZHU Run, XI Xue-feng. Footprint Image Clustering Method Based on Automatic Feature Extraction [J]. Computer Science, 2021, 48(6A): 255-259.
[15] LI Na-na, WANG Yong, ZHOU Lin, ZOU Chun-ming, TIAN Ying-jie, GUO Nai-wang. DDoS Attack Random Forest Detection Method Based on Secondary Screening of Feature Importance [J]. Computer Science, 2021, 48(6A): 464-467.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!