Computer Science ›› 2021, Vol. 48 ›› Issue (10): 59-66.doi: 10.11896/jsjkx.200900180

• Artificial Intelligence • Previous Articles     Next Articles

Scientific Paper Summarization Using Word-Section Association

FU Ying, WANG Hong-ling, WANG Zhong-qing   

  1. School of Computer Science and Technology,Soochow University,Suzhou,Jiangsu 215006,China
  • Received:2020-09-24 Revised:2021-01-04 Online:2021-10-15 Published:2021-10-18
  • About author:FU Ying,born in 1994,postgraduate,is a member of China Computer Federation.Her main research interests include natural language processing and so on.
    WANG Hong-ling,born in 1975,professor.Her main research interests include natural language processing and so on.
  • Supported by:
    National Natural Science Foundation of China(61976146).

Abstract: With the development of science and technology,people need to access a large number of scientific and technological information quickly,and scientific paper is one of the main ways to carry scientific and technological information.As an important part of scientific paper,abstract is an effective tool for readers to retrieve literature.Therefore,the quality of abstract affects the retrieval rate of paper directly.However,due to the lack of writing experience,the quality of abstracts written by many authors is not high.Automatic generation of summary for scientific paper can help the author grasp the important content of paper more effectively,so as to write high-quality abstract.At the same time,the automatically generated abstract can also control the number of words in the abstract,which can bring more content to readers and help them understand the paper better.Generating automa-tic summarization for scientific paper can help author write abstract faster,which is one of the research contents in automatic summarization.Compared with common news document,scientific paper has the characteristics of strong structure and clear logical relationship.As far as the mainstream abstractive summarization such as encoder-decoder model is concerned,it mainly consi-ders the serialized information in the document,and rarely explores the text structure information in the document.For this reason,according to the characteristics in scientific papers,this paper proposes an automatic summarization model based on the hie-rarchical structure of “word-section-document”,which uses the association between word and section to enhance the level of text structure and the interaction between levels,so as to screen out the key information in scientific paper.In addition,a context gate unit is extended to update the optimized context vector,thus capturing context information more comprehensively.The experimental results show that the proposed model can effectively improve the performance of the generated summarization in the ROUGE evaluation method.

Key words: Abstractive summarization, Automatic summarization, Hierarchical structure, Scientific paper summarization, Text structure

CLC Number: 

  • TP18
[1]YU H.Standard editing of “purpose” elements in abstracts of scientific papers[J].Journal of Liaoning Teachers College (Natu-ral Science Edition),2020,22,85(1):110-112.
[2]ZHANG Y,WANG Z Q,WANG H L.Research on single document extraction summarization method based on the relationship between primary and secondary text[J].Chinese Journal of information technology,2019,33(8):67-76.
[3]NALLAPATI R,ZHOU B,GULCEHRE C,et al.Abstractive text summarization using sequence-to-sequence rnns and beyond[J].arXiv:1602.06023,2016.
[4]XU Y,LAU J H,BALDWIN T,et al.Decoupling encoder and decoder networks for abstractive document summarization[C]//Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres.2017:7-11.
[5]CHO K,VAN MERRIËNBOER B,GULCEHRE C,et al.Learning phrase representations using RNN encoder-decoder for statistical machine translation[J].arXiv:1406.1078,2014.
[6]XU H,HE Y,HAN K,et al.Learning Syntactic and Dynamic Selective Encoding for Document Summarization[C]//2019 International Joint Conference on Neural Networks (IJCNN).IEEE,2019:1-8.
[7]HERMANN K M,KOCISKY T,GREFENSTETTE E,et al.Teaching machines to read and comprehend[C]//Advances in Neural Information Processing Systems.2015:1693-1701.
[8]XU F,ZHU Q M,ZHOU G D.Review of text analysis technology[J].Chinese Journal of Information Technology,2013,27(3):20-33.
[9]TEUFEL S,MOENS M.Summarizing scientific articles:experiments with relevance and rhetorical status[J].Computational Linguistics,2002,28(4):409-445.
[10]COLLINS E,AUGENSTEIN I,RIEDEL S.A supervised ap-proach to extractive summarisation of scientific papers[J].ar-Xiv:1706.03946,2017.
[11]FORMAN G.BNS feature scaling:an improved representation over tf-idf for svm text classification[C]//Proceedings of the 17th ACM Conference on Information and Knowledge Management.2008:263-270.
[12]XIAO W,CARENINI G.Extractive summarization of long do-cuments by combining global and local context[J].arXiv:1909.08089,2019.
[13]KIM M,SINGH M D,LEE M.Towards abstraction from extraction:multiple timescale gated recurrent unit for summarization[J].arXiv:1607.00718,2016.
[14]COHAN A,DERNONCOURT F,KIM D S,et al.A discourse-aware attention model for abstractive summarization of long do-cuments[J].arXiv:1804.05685,2018.
[15]LIU K,WANG H L.Coherence of Automatic SummarizationBased on Discourse Rhetoric Structure[J].Chinese Journal of Information Technology,2019,33(1):77-84.
[16]WU R S,ZHANG Y F,WANG H L,et al.Generative Automa-tic Summarization Based on Hierarchical Structure[J].Chinese Journal of Information Technology,2019,33 (10):90-98.
[17]SEE A,LIU P J,MANNING C D.Get to the point:Summarization with pointer-generator networks[J].arXiv:1704.04368,2017.
[18]HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[19]ZHOU P,SHI W,TIAN J,et al.Attention-based bidirectional long short-term memory networks for relation classification[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.2016:207-212.
[20]LIN J,SUN X,MA S,et al.Global encoding for abstractivesummarization[J].arXiv:1805.03989,2018.
[21]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet classification with deep convolutional neural networks[C]//Advances in Neural Information Processing Systems.2012:1097-1105.
[22]NAIR V,HINTON G E.Rectified linear units improve restric-ted boltzmann machines[C]//ICML.2010.
[23]WANG K,QUAN X,WANG R.Biset:Bi-directional selective encoding with template for abstractive summarization[J].ar-Xiv:1906.05012,2019.
[24]LIN C Y,GAO J,CAO G,et al.Automatic evaluation of summaries:U.S.Patent 7,725,442[P].2010-5-25.
[25]COHAN A,GOHARIAN N.Scientific article summarizationusing citation-context and article's discourse structure[J].ar-Xiv:1704.06619,2017.
[1] GUO Yu-xin, CHEN Xiu-hong. Automatic Summarization Model Combining BERT Word Embedding Representation and Topic Information Enhancement [J]. Computer Science, 2022, 49(6): 313-318.
[2] LI Xin-chao, LI Pei-feng, ZHU Qiao-ming. Directed Network Representation Method Based on Hierarchical Structure Information [J]. Computer Science, 2021, 48(2): 100-104.
[3] ZHOU Wei, WANG Zhao-yu, WEI Bin. Abstractive Automatic Summarizing Model for Legal Judgment Documents [J]. Computer Science, 2021, 48(12): 331-336.
[4] ZHANG Yi-fei,WANG Zhong-qing,WANG Hong-ling. Product Review Summarization Using Discourse Hierarchical Structure [J]. Computer Science, 2020, 47(2): 195-200.
[5] QIAN Ling-long, WU Jiao, WANG Ren-feng, LU Hui-juan. Multi-document Automatic Summarization Based on Sparse Representation [J]. Computer Science, 2020, 47(11A): 97-105.
[6] WANG Yan-bin, WU You-xi, LIU Hong-pu. Research and Application of Ensemble Learning Using Gradient Optimization Decision Tree [J]. Computer Science, 2018, 45(11A): 121-125.
[7] YU Shan-shan, SU Jin-dian and LI Peng-fei. Improved TextRank-based Method for Automatic Summarization [J]. Computer Science, 2016, 43(6): 240-247.
[8] XIE Hao and SUN Wei. Paragraph-Sentence Mutual Reinforcement Based Automatic Summarization Algorithm [J]. Computer Science, 2013, 40(Z11): 246-250.
[9] WANG Nan,SUN Shan-wu,OUYANG Dan-tong. Hierarchical Abstraction Model Based on System-centered Ontology [J]. Computer Science, 2011, 38(8): 189-192.
[10] ZHAO An-ping,REN Ke,WANG Xiao-yong,QIU Yu-hui. Hierarchical Structure Based Service Semantic Relationship Discovery [J]. Computer Science, 2011, 38(2): 210-213.
[11] WU Jing,GU Guo-chang,HOU Guo-zhao,SUN Ting. Multi-layer Centerless Hierarchical Structure in Ad Hoc Networks [J]. Computer Science, 2010, 37(4): 110-.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!