Inicio  /  Information  /  Vol: 9 Par: 9 (2018)  /  Artículo
ARTÍCULO
TITULO

An Integrated Graph Model for Document Summarization

Kang Yang    
Kamal Al-Sabahi    
Yanmin Xiang and Zuping Zhang    

Resumen

Extractive summarization aims to produce a concise version of a document by extracting information-rich sentences from the original texts. The graph-based model is an effective and efficient approach to rank sentences since it is simple and easy to use. However, its performance depends heavily on good text representation. In this paper, an integrated graph model (iGraph) for extractive text summarization is proposed. An enhanced embedding model is used to detect the inherent semantic properties at the word level, bigram level and trigram level. Words with part-of-speech (POS) tags, bigrams and trigrams were extracted to train the embedding models. Based on the enhanced embedding vectors, the similarity values between the sentences were calculated in three perspectives. The sentences in the document were treated as vertexes and the similarity between them as edges. As a result, three different types of semantic graphs were obtained for every document, with the same nodes and different edges. These three graphs were integrated into one enriched semantic graph in a naive Bayesian fashion. After that, TextRank, which is a graph-based ranking algorithm, was applied to rank the sentences, before the top scored sentences were selected for the summary according to the compression rate. Evaluated on the DUC 2002 and DUC 2004 datasets, our proposed method shows competitive performance compared to the state-of-the-art methods.

 Artículos similares

       
 
Wen Tian, Yining Zhang, Ying Zhang, Haiyan Chen and Weidong Liu    
To fully leverage the spatiotemporal dynamic correlations in air traffic flow and enhance the accuracy of traffic flow prediction models, thereby providing a more precise basis for perceiving congestion situations in the air route network, a study was co... ver más
Revista: Aerospace

 
Bae-Seon Park and Hak-Tae Lee    
This paper demonstrates the effectiveness of the Extended First-Come, First-Served (EFCFS) scheduler for integrated arrival and departure scheduling by comparing the scheduling results with the recorded operational data at Incheon International Airport (... ver más
Revista: Aerospace

 
Sorin Zoican, Roxana Zoican, Dan Galatchi and Marius Vochin    
This paper illustrates a general framework in which a neural network application can be easily integrated and proposes a traffic forecasting approach that uses neural networks based on graphs. Neural networks based on graphs have the advantage of capturi... ver más
Revista: Applied Sciences

 
Weikun Xie, Wenjing Qi, Xiaohui Lin and Houjun Wang    
With the rapid development of integrated circuit production technology, the scale of FPGA circuits has expanded to billions of gates. The complexity of the internal resource structures in the FPGAs (field programmable gate arrays) is continually increasi... ver más
Revista: Applied Sciences

 
Bikram Pratim Bhuyan, Vaishnavi Jaiswal and Amar Ramdane Cherif    
Investors at well-known firms are increasingly becoming interested in stock forecasting as they seek more effective methods to predict market behavior using behavioral finance tools. Accordingly, studies aimed at predicting stock performance are gaining ... ver más
Revista: Computers