计算机科学 ›› 2011, Vol. 38 ›› Issue (1): 240-245.

• 人工智能 • 上一篇    下一篇

一种基于免疫遗传算法的网络新词识别方法

丁建立,慈祥,黄剑雄   

  1. (中国民航大学计算机科学与技术学院 天津300300);(中国民航信息技术科研基地 天津300300);(中国国际航空股份有限公司信息管理部 北京100071)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家高技术研究发展计划(863)(2006AA12A106),国家自然科学基金(60879015,60572167)资助。

Approach of Internet New Word Identification Based on Immune Genetic Algorithm

DING Jian-li,CI Xiang,HUANG Jian-xiong   

  • Online:2018-11-16 Published:2018-11-16

摘要: 随着互联网的发展,网络新词不断涌现,但是目前的分词方法很难及时、准确地对其做出识别。对此提出一种应用免疫遗传算法的网络新词识别方法。在分析网络新词特点的基础上,利用汉语词群现象和词位的概念提取出示范抗体,在遗传算法进行的过程中有针对性地注入该抗体。实验表明,该方法对于分词碎片中符合词群现象的新词有着极高的识别率,对于一般网络新词的识别率也基本令人满意。

关键词: 免疫遗传算法,汉语词群,词位,杭体,网络新词识别

Abstract: The development of Internet leads the Internet new word coming into being. These unknown words are difficult to identify timely and accurately by the current Word Segmentation Method, therefore Internet new word identification method using Immune genetic algorithm was brought forward. This method is based on the analysis of characteristics of Internet new word, using the phenomenon of Chinese words and word groups to extract exemplary antibody, and injecting the antibody targeted during the process of genetic algorithm.The experiment results show that the method not only has a higher recognition rates of the new words consistent with the phenomenon of word groups in word fragments but the result of identifying ordinary Internet new word is adequate.

Key words: Immune genetic algorithm, Word group, Word position, Antibody, Internet new word identification

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!