Computer Science ›› 2015, Vol. 42 ›› Issue (Z11): 49-54.

Previous Articles     Next Articles

Bi-direction Maximum Matching Method Based on Hash Structural Dictionary

CHEN Zhi-yan, LI Xiao-jie, ZHU Shu-hua, FU Dan-long and XING Yi-hai   

  • Online:2018-11-14 Published:2018-11-14

Abstract: In the Chinese natural language processing,aimming at the problem that ordinary dictionary cannot be used for reverse maximum matching method and it is difficult to maintain a reverse dictionary,we put forward a new kind of dictionary structure and corresponding bi-direction maximum matching method,and added mutual information ambiguity processing block in the algorithm.Compared with the previous maximum matching method,this algorithm can increase the segmentation accuracy significantly.It is applicable to some Chinese natural language processing systems which have high segmentation accuracy requirement.

Key words: Segmentation dictionary,Bi-direction maximum matching method,Single word index based on Hash structure,Mutual information ambiguity processing

[1] 奉国和,郑伟.国内中文自动分词技术研究综述[J].图书情报工作,2011,5(2):41-45
[2] 罗智勇,宋柔.现代汉语通用分词系统中歧义切分的实用技术[J].计算机研究与发展,2006,3(6):1122-1128
[3] 吴育良.百度中文分词技术浅析[J].河南图书馆学刊,2008(8):115-117
[4] 莫建文,郑阳,首照宇,等.改进的基于词典的中文分词方法[J].计算机工程与设计,2013,4(5):1802-1807
[5] 吴旭东.正向最大匹配分词算法的分析与改进[J].科技传播,2011(20)
[6] 王瑞雷,栾静,潘晓花,等.一种改进的中文分词正向最大匹配算法[J].计算机应用与软件,2011,8(3):195-197
[7] 张李义,李亚子.基于反序词典的中文逆向最大匹配分词系统设计[J].现代图书情报技术,2006(8):42-45
[8] 赵艳红,费洪晓.一个基于改进的反序分词词典的中文分词算法[J].深圳职业技术学院学报,2004,3(4):28-31
[9] 罗桂琼,费洪晓,戴弋.基于反序词典的中文分词技术研究[J].计算机技术与发展,2008,8(1):80-83
[10] 丁振国,张卓,黎靖.基于Hash结构的逆向最大匹配分词算法的改进[J].计算机工程与设计,2008,9(12):3208-3211

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!