基于字符层马尔科夫模型的多语种识别

计算机科学 ›› 2006, Vol. 33 ›› Issue (1): 226-228.

基于字符层马尔科夫模型的多语种识别

出版日期:2018-11-17 发布日期:2018-11-17
基金资助:
受国家自然科学基金（编号60272088）资助.

Online:2018-11-17 Published:2018-11-17

摘要/Abstract

摘要： 语种识别是机器翻译等多语种语言处理任务的必要预处理过程。但双字节编码语种的识别，如中文、日文等，尚未被充分研究和试验。本文采用Markov语言模型，提出并测试了一种有效的基于EM的训练算法。同时，给出了性能分析和与其他算法的比较。

关键词: 字符层马尔科夫模型语种识别机器翻译多语种马尔科夫模型识别字符 Markov 训练算法预处理过程

Abstract: Language identification is a necessary pre-process in machine translation and other muhi-language applications, but no experiments hase yet been reported on double-byte encoded languages, such as Chinese and Japanese. An efficient EM based training algori

Key words: Character based markov models, Language identification, Machine translation

. 基于字符层马尔科夫模型的多语种识别[J]. 计算机科学, 2006, 33(1): 226-228. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

基于字符层马尔科夫模型的多语种识别

PDF (PC)

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 0

Metrics

本文评价

推荐阅读 0