计算机科学 ›› 2011, Vol. 38 ›› Issue (1): 240-245.
• 人工智能 • 上一篇 下一篇
丁建立,慈祥,黄剑雄
出版日期:
发布日期:
基金资助:
DING Jian-li,CI Xiang,HUANG Jian-xiong
Online:
Published:
摘要: 随着互联网的发展,网络新词不断涌现,但是目前的分词方法很难及时、准确地对其做出识别。对此提出一种应用免疫遗传算法的网络新词识别方法。在分析网络新词特点的基础上,利用汉语词群现象和词位的概念提取出示范抗体,在遗传算法进行的过程中有针对性地注入该抗体。实验表明,该方法对于分词碎片中符合词群现象的新词有着极高的识别率,对于一般网络新词的识别率也基本令人满意。
关键词: 免疫遗传算法,汉语词群,词位,杭体,网络新词识别
Abstract: The development of Internet leads the Internet new word coming into being. These unknown words are difficult to identify timely and accurately by the current Word Segmentation Method, therefore Internet new word identification method using Immune genetic algorithm was brought forward. This method is based on the analysis of characteristics of Internet new word, using the phenomenon of Chinese words and word groups to extract exemplary antibody, and injecting the antibody targeted during the process of genetic algorithm.The experiment results show that the method not only has a higher recognition rates of the new words consistent with the phenomenon of word groups in word fragments but the result of identifying ordinary Internet new word is adequate.
Key words: Immune genetic algorithm, Word group, Word position, Antibody, Internet new word identification
丁建立,慈祥,黄剑雄. 一种基于免疫遗传算法的网络新词识别方法[J]. 计算机科学, 2011, 38(1): 240-245. https://doi.org/
DING Jian-li,CI Xiang,HUANG Jian-xiong. Approach of Internet New Word Identification Based on Immune Genetic Algorithm[J]. Computer Science, 2011, 38(1): 240-245. https://doi.org/
0 / / 推荐
导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks
链接本文: https://www.jsjkx.com/CN/
https://www.jsjkx.com/CN/Y2011/V38/I1/240
Cited