计算机科学 ›› 2015, Vol. 42 ›› Issue (12): 23-25.
罗宇翔,邹艳珍,金庸,谢冰
LUO Yu-xiang, ZOU Yan-zhen, JIN Yong and XIE Bing
摘要: 开源项目通常会提供邮件列表来帮助用户更好地理解和使用开源项目。但由于邮件的数量巨大、邮件内容组织繁杂、问题不明确、答案定位困难等问题,用户在邮件查询过程中定位一个特定的软件问答信息要花费大量的时间和精力。为此,提出一种基于邮件列表的软件问答信息抽取方法。该方法通过对邮件的简单分类与标注,实现自动的问题句抽取和答案邮件选取,从而提升了用户进行邮件列表查询以及开源软件项目学习的效率。最后,通过实验验证了该方法的有效性。
[1] 金庸.基于邮件列表的软件问答信息抽取工具的设计与实现 [D].北京:北京大学,2014 Jin Yong.A design and implementation of software R&A extraction tool based on maillists[D].Beijing:Peking University,2014 [2] Fournier-Viger P.Spmf:A sequential pattern mining framework .http://www.philippe-fournier-viger.com/spmf,2011 [3] 肖仁财.序列模式挖掘算法研究与实现[ D].南京:江苏大学,2007 Xiao Ren-cai.A research and implementation of sequential pattern mining algorithm[D].Nanjing:Jiangsu University,2007 [4] Belkin,Nicholas J,et al.Query length in interactive information retrieval[C]∥Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.ACM,2003 [5] Salton,Gerard,Wong A,et al.A vector space model for automatic indexing [J].Communications of the ACM, 1975,18(11):613-620 [6] Cong G,Wang L,Lin C Y,et al.Finding question-answer pairs from online forums[C]∥Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.ACM,2008:467-474 [7] Wang Kai,Chua T-S.Exploiting salient patterns for question detection and question retrieval in community-based question answering [C]∥Proceedings of the 23rd International Conference on Computational Linguistics.2010 |
No related articles found! |
|