Computer Science ›› 2015, Vol. 42 ›› Issue (Z11): 7-9.

Previous Articles     Next Articles

Disambiguation Algorithm Design and Implementation of Food Safety Issues in Network

LIU Jin-shuo, DENG Ying-ying and DENG Juan   

  • Online:2018-11-14 Published:2018-11-14

Abstract: The article aimed to put forward a disambiguation algorithm which can correctly classify the unknown terms,based on the food safety information in network.The disambiguation algorithms used in this paper combines the hidden Markov model(HMM) and SVM classifier to achieve terminology disambiguation,based on the improved TF-IDF feature selection algorithm.This paper proposed a new feature extraction algorithm LN-TF-IDF with two additional weighting factors on traditional TF-IDF.Experiments show that,the improved TF-IDF disambiguation algorithm designed in the field of food safety enhances the effect of disambiguation by average 7.31% on the 202831 texts.It was compared with the traditional TF-IDF text feature selection algorithm,with the F-measure as evaluation criteria.At the same time,the effect of the algorithm is relatively stable on different experimental data sets obtained from different time.

Key words: Food safety,Disambiguation,HMM,TF-IDF,SVM

[1] 龚凌晖.中文命名实体识别与歧义消解研究[D].上海:复旦大学,2011
[2] 何径舟,王厚峰.基于特征选择和最大熵模型的汉语词义消歧[J].软件学报,2010(6):1287-1295
[3] Pedersen T.A Decision Tree of Bigrams is an Accurate Predictor of Word Sense [C]∥Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics(NAACL-01).Pittsburgh,PA,2001
[4] Hoffart J,Yosef M A,Bordino H,et al.Robust Disambiguation of Named Entities in Text[C]∥Proceedings of the 2011 Con-ference on Empirical Methods in Natural Language Processing.Edinburgh,Scotland,UK,2011:782-792
[5] 戴祥鹰.文本聚类在话题检测与人名消歧中的应用研究[D].哈尔滨:哈尔滨工业大学,2010
[6] 韩伟.人名消歧研究与实现[D].北京:北京大学,2014
[7] 李永亮,黄曙光,鲍蕾.一种基于PageRank算法和知网的词义消歧方法[J].计算机应用与软件,2011,8(4):213-215
[8] 徐钟.隐含马尔科夫模型在中文实体分类中的应用及研究[D].南昌:南昌大学,2012
[9] Mena B H,van K M.A Hybrid Approach for Robust Multilingual Toponym Extraction and Disambiguation [C]∥International Conference on Language Processing and Intelligent Information Systems.Warsaw,Poland,2013
[10] 廖浩,李志蜀,王秋野,等.基于词语关联的文本特征词提取方法[J].计算机应用,2007,27(12):3009-3012
[11] 平源.基于支持向量机的聚类及文本分类研究[D].北京:北京邮电大学,2012
[12] 范昕炜.支持向量机算法的研究及其应用[D].杭州:浙江大学,2003

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] . [J]. Computer Science, 2018, 1(1): 1 .
[2] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75 .
[3] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[4] ZHANG Jia-nan and XIAO Ming-yu. Approximation Algorithm for Weighted Mixed Domination Problem[J]. Computer Science, 2018, 45(4): 83 -88 .
[5] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[6] SHI Wen-jun, WU Ji-gang and LUO Yu-chun. Fast and Efficient Scheduling Algorithms for Mobile Cloud Offloading[J]. Computer Science, 2018, 45(4): 94 -99 .
[7] ZHOU Yan-ping and YE Qiao-lin. L1-norm Distance Based Least Squares Twin Support Vector Machine[J]. Computer Science, 2018, 45(4): 100 -105 .
[8] LIU Bo-yi, TANG Xiang-yan and CHENG Jie-ren. Recognition Method for Corn Borer Based on Templates Matching in Muliple Growth Periods[J]. Computer Science, 2018, 45(4): 106 -111 .
[9] GENG Hai-jun, SHI Xin-gang, WANG Zhi-liang, YIN Xia and YIN Shao-ping. Energy-efficient Intra-domain Routing Algorithm Based on Directed Acyclic Graph[J]. Computer Science, 2018, 45(4): 112 -116 .
[10] CUI Qiong, LI Jian-hua, WANG Hong and NAN Ming-li. Resilience Analysis Model of Networked Command Information System Based on Node Repairability[J]. Computer Science, 2018, 45(4): 117 -121 .