Inicio  /  Information  /  Vol: 14 Par: 2 (2023)  /  Artículo
ARTÍCULO
TITULO

EverAnalyzer: A Self-Adjustable Big Data Management Platform Exploiting the Hadoop Ecosystem

Panagiotis Karamolegkos    
Argyro Mavrogiorgou    
Athanasios Kiourtis and Dimosthenis Kyriazis    

Resumen

Big Data is a phenomenon that affects today?s world, with new data being generated every second. Today?s enterprises face major challenges from the increasingly diverse data, as well as from indexing, searching, and analyzing such enormous amounts of data. In this context, several frameworks and libraries for processing and analyzing Big Data exist. Among those frameworks Hadoop MapReduce, Mahout, Spark, and MLlib appear to be the most popular, although it is unclear which of them best suits and performs in various data processing and analysis scenarios. This paper proposes EverAnalyzer, a self-adjustable Big Data management platform built to fill this gap by exploiting all of these frameworks. The platform is able to collect data both in a streaming and in a batch manner, utilizing the metadata obtained from its users? processing and analytical processes applied to the collected data. Based on this metadata, the platform recommends the optimum framework for the data processing/analytical activities that the users aim to execute. To verify the platform?s efficiency, numerous experiments were carried out using 30 diverse datasets related to various diseases. The results revealed that EverAnalyzer correctly suggested the optimum framework in 80% of the cases, indicating that the platform made the best selections in the majority of the experiments.

 Artículos similares

       
 
Jairo Fuentes, Jose Aguilar, Edwin Montoya and Ángel Pinto    
In this paper, we propose autonomous cycles of data analysis tasks for the automation of the production chains aimed to improve the productivity of Micro, Small and Medium Enterprises (MSMEs) in the context of agroindustry. In the autonomous cycles of da... ver más
Revista: Information

 
Yohanes Yohanie Fridelin Panduman, Nobuo Funabiki, Evianita Dewi Fajrianti, Shihao Fang and Sritrusta Sukaridhoto    
In this paper, we have developed the SEMAR (Smart Environmental Monitoring and Analytics in Real-Time) IoT application server platform for fast deployments of IoT application systems. It provides various integration capabilities for the collection, displ... ver más
Revista: Information

 
Iman I. M. Abu Sulayman, Peter Voege and Abdelkader Ouda    
The increasing significance of data analytics in modern information analysis is underpinned by vast amounts of user data. However, it is only feasible to amass sufficient data for various tasks in specific data-gathering contexts that either have limited... ver más
Revista: Information

 
Mansour Bayazidy, Mohammad Maleki, Aras Khosravi, Amir Mohammad Shadjou, Junye Wang, Rabee Rustum and Reza Morovati    
River water is one of the most important natural resources for economic development and environmental sustainability. However, river water systems are vulnerable in some of the densely populated regions across the globe. Intense sand mining and waste dis... ver más
Revista: Water

 
Emre Ercan, Muhammed Serdar Avci, Mahmut Pekedis and Çaglayan Hizal    
Structural health monitoring (SHM) plays a crucial role in extending the service life of engineering structures. Effective monitoring not only provides insights into the health and functionality of a structure but also serves as an early warning system f... ver más
Revista: Applied Sciences