Inicio  /  Information  /  Vol: 12 Par: 1 (2021)  /  Artículo
ARTÍCULO
TITULO

A Framework for Generating Extractive Summary from Multiple Malayalam Documents

K. Manju    
S. David Peter and Sumam Mary Idicula    

Resumen

Automatic extractive text summarization retrieves a subset of data that represents most notable sentences in the entire document. In the era of digital explosion, which is mostly unstructured textual data, there is a demand for users to understand the huge amount of text in a short time; this demands the need for an automatic text summarizer. From summaries, the users get the idea of the entire content of the document and can decide whether to read the entire document or not. This work mainly focuses on generating a summary from multiple news documents. In this case, the summary helps to reduce the redundant news from the different newspapers. A multi-document summary is more challenging than a single-document summary since it has to solve the problem of overlapping information among sentences from different documents. Extractive text summarization yields the sensitive part of the document by neglecting the irrelevant and redundant sentences. In this paper, we propose a framework for extracting a summary from multiple documents in the Malayalam Language. Also, since the multi-document summarization data set is sparse, methods based on deep learning are difficult to apply. The proposed work discusses the performance of existing standard algorithms in multi-document summarization of the Malayalam Language. We propose a sentence extraction algorithm that selects the top ranked sentences with maximum diversity. The system is found to perform well in terms of precision, recall, and F-measure on multiple input documents.

 Artículos similares

       
 
Vinh Pham, Maxim Tyan, Tuan Anh Nguyen and Jae-Woo Lee    
Multi-fidelity surrogate modeling (MFSM) methods are gaining recognition for their effectiveness in addressing simulation-based design challenges. Prior approaches have typically relied on recursive techniques, combining a limited number of high-fidelity... ver más
Revista: Aerospace

 
Wajeeh Daher, Hussam Diab and Anwar Rayan    
In recent years, artificial intelligence (AI) has emerged as a valuable resource for teaching and learning, and it has also shown promise as a tool to help solve problems. A tool that has gained attention in education is ChatGPT, which supports teaching ... ver más
Revista: Information

 
Liming Lao, Dangkui Du and Pengzhan Chen    
This paper proposes a novel prediction model termed the social and spatial attentive generative adversarial network (SSA-GAN). The SSA-GAN framework utilizes a generative approach, where the generator employs social attention mechanisms to accurately mod... ver más
Revista: Algorithms

 
Valeria Mercuri, Martina Saletta and Claudio Ferretti    
As the prevalence and sophistication of cyber threats continue to increase, the development of robust vulnerability detection techniques becomes paramount in ensuring the security of computer systems. Neural models have demonstrated significant potential... ver más
Revista: Algorithms

 
Yunfei Gao, Guogui Huang, Yinxi Li, Junyuan Zhang, Zeng Yang and Meng Wang    
Homogenization methods can characterize the mechanical properties of these materials based on appropriate constitutive models and data. They are also applied to the characterization of mechanical parameters under complex geotechnical conditions in geotec... ver más
Revista: Applied Sciences