计算机科学 ›› 2010, Vol. 37 ›› Issue (3): 230-233.
• 人工智能 • 上一篇 下一篇
韩习武,赵铁军
出版日期:
发布日期:
基金资助:
HAN Xi-wu,ZHAO Tie-jun
Online:
Published:
摘要: 基于大规模句子级,对齐双语语料库进行了统计分析汉英动词次范畴化对应类型的系统性实验。首先以语言学量度为启发,应用双重最大似然检验的统计过滤方法初步估计了654种汉英次范畴化对应类型的概率分布;然后 根据汉英句法特点对次范畴化对应类型进行了语言学分类;最后针对每一种对应类型及其背景语料进行了基于支持向量机的语言学类别标注和统计可靠性分析。
关键词: 汉英动词次范畴化,统计分析,支持向量机
Abstract: Based on large scale ChinescEnglish parallel corpus, this paper described a systematic experiment of statistical analysis for bilingual verb subcategorization. Firstly, with lexical and grammatical compatibility as heuristics, probabilistic distributions of 654 bilingual subcategorization frames were estimated by means of a two-fold MI_E filtering method. Then,linguistic classification of the frames was determined according to Chinese and English syntax Finally,linguistic classes for each frame were labeled via SVM on the basis of their supporting corpus.
Key words: Chinese-English verb subcatcgorization, Statistical analysis, SVM
韩习武,赵铁军. 汉英动词次范畴化对应类型的统计分析[J]. 计算机科学, 2010, 37(3): 230-233. https://doi.org/
HAN Xi-wu,ZHAO Tie-jun. Statistical Analysis for Chinese-English Verb Subcategorization[J]. Computer Science, 2010, 37(3): 230-233. https://doi.org/
0 / / 推荐
导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks
链接本文: https://www.jsjkx.com/CN/
https://www.jsjkx.com/CN/Y2010/V37/I3/230
Cited