计算机科学 ›› 2013, Vol. 40 ›› Issue (Z11): 246-250.

• 数据存储与挖掘 • 上一篇    下一篇

基于段落-句子互增强的自动文摘算法

谢浩,孙伟   

  1. 中山大学信息科学与技术学院 广州510006;中山大学软件学院 广州510006
  • 出版日期:2018-11-16 发布日期:2018-11-16

Paragraph-Sentence Mutual Reinforcement Based Automatic Summarization Algorithm

XIE Hao and SUN Wei   

  • Online:2018-11-16 Published:2018-11-16

摘要: 句子排序问题是文本自动摘要的核心问题,基于互增强关系(MRP)的基本思想,提出一种新的句子排序模型——段落-句子互增强模型。利用段落关系,通过段落句子的互增强,迭代计算出句子的显著度,抽取出文摘句。分析了模型中的内、外影响因子对算法效果的影响并对冗余处理进行了讨论。实验表明,将其运用在单文本自动摘要中,能取得高质量的文摘。

关键词: 句子排序,互增强关系,自动文摘

Abstract: Sentence ranking is the key issue of text automatic summarization.Based on mutual reinforcement principles,we proposed a new sentence ranking model——paragraph-sentence mutual reinforcement model.With the relation between paragraphs and the mutual reinforcement between paragraphs and sentences,it iteratively computes the salience of the sentences and extract the summary sentences.We analyzed the effect of the internal and external reinforced factor and discussed the problem of redundancy remove.Experiments show that it can extract high quality summary when it applies to the single-document summarization.

Key words: Sentence ranking,Mutual reinforcement principle,Automatic summarization

[1] Langville A N,Meyer C D.Deeper inside PageRank[J].Journal of Internet Mathematics,2004,1(3):335-380
[2] Kleinberg J M.Authoritative Sources in a Hyperlinked Environment[J].Journal of the ACM,1999,46(5):604-632
[3] Erkan G,Radev D R.LexRank:graph-based centrality as salience in text summarization[J].Journal of Aritficial Intelligence Research,2004,22:457-479
[4] Mihalcea R,Tarau.TextRank:Bringing Order into Text[C]∥Proceedings of EMNLP.Barcelona Spain,2004:404-411
[5] Zhu Xiao-jin,Goldberg A B,Van Gael J,et al.Improving Diversity in Ranking using Absorbing Random Walks[C]∥Procee-dings of NAACL HLT,2007.Rochester NY,2007:97-104
[6] Zha Hong-yuan.Generic Summarization and Key Phrase Extraction using Mutual Reinforcement Principle and Sentence Clustering[C]∥Proceedings of ACM SIGIR,2002.Tampere Finland,2002:113-120
[7] Wei Fu-ru,Li Wen-jie,Lu Qin,et al.Applying Two-Level Reinforcement Ranking in Query-Oriented Multidocument Summarization[J].Journal of the American Society for Information Science and Technology,2009,60(10):2119-2131
[8] Wei Fu-ru,Li Wen-jie,Lu Qin,et al.Query-Sensitive Mutual Reinforcement Chain and Its Application in Query-Oriented Multi-Document Summarization[C]∥SIGIR,2008.Singapore,2008
[9] Bollegala D,Matsuo Y,Ishizuka M.Measuring Semantic Similarity between Words using Web Search Engines[C]∥Procee-dings of WWW.2007:757-766
[10] Cilibrasi R L,Vitanyi P M B.The Google Similairity Distance[J].IEEE Transactions on Knowledge and Data Engineering,2007,19(3):370-383
[11] Sahami M,Heliman T D.A Web-based Kernel Function forMeasuring the Similarity of Short Text Snippets[C]∥Procee-dings of WWW.2006:377-386
[12] Che Wan-xiang,Li Zheng-hua,Liu Ting.LTP:A Chinese Language Technology Platform[C]∥Proceedings of the Coling 2010:Demonstrations.Beijing,China,2010:13-16
[13] Lin C-Y,Hovy E H.Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics [C]∥Proceeding of 2003Language Technology Conference(HLT-NAACL 2003).Canada,2003
[14] Fellbaum C.WordNet[Z].Theory and Application of Ontolo-gy:Computer Applications,2010:231-243
[15] Jones K S.A Statistical Interpretation of Term Specificity and Its Application in Retrieval[J].Journal of documentation,1972,28(1):11-21

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!