Inicio  /  Information  /  Vol: 14 Par: 2 (2023)  /  Artículo
ARTÍCULO
TITULO

EverAnalyzer: A Self-Adjustable Big Data Management Platform Exploiting the Hadoop Ecosystem

Panagiotis Karamolegkos    
Argyro Mavrogiorgou    
Athanasios Kiourtis and Dimosthenis Kyriazis    

Resumen

Big Data is a phenomenon that affects today?s world, with new data being generated every second. Today?s enterprises face major challenges from the increasingly diverse data, as well as from indexing, searching, and analyzing such enormous amounts of data. In this context, several frameworks and libraries for processing and analyzing Big Data exist. Among those frameworks Hadoop MapReduce, Mahout, Spark, and MLlib appear to be the most popular, although it is unclear which of them best suits and performs in various data processing and analysis scenarios. This paper proposes EverAnalyzer, a self-adjustable Big Data management platform built to fill this gap by exploiting all of these frameworks. The platform is able to collect data both in a streaming and in a batch manner, utilizing the metadata obtained from its users? processing and analytical processes applied to the collected data. Based on this metadata, the platform recommends the optimum framework for the data processing/analytical activities that the users aim to execute. To verify the platform?s efficiency, numerous experiments were carried out using 30 diverse datasets related to various diseases. The results revealed that EverAnalyzer correctly suggested the optimum framework in 80% of the cases, indicating that the platform made the best selections in the majority of the experiments.

 Artículos similares

       
 
Jairo Fuentes, Jose Aguilar, Edwin Montoya and Ángel Pinto    
In this paper, we propose autonomous cycles of data analysis tasks for the automation of the production chains aimed to improve the productivity of Micro, Small and Medium Enterprises (MSMEs) in the context of agroindustry. In the autonomous cycles of da... ver más
Revista: Information

 
Zhen Yang, Guozhang Fan, Wei Yan, Xuefeng Wang, Guoqing Zhang, Zhili Yang, Zuofei Zhu, Yuanze Zhang, Huai Cheng, Hongxun Tian, Li Li and Qiang Zhang    
During the Miocene, several reefs formed in the Beikang Basin, South China Sea, which may be potential targets for hydrocarbon exploration. This is due to the environment that developed as a result of the collision, splitting, and splicing of the Nansha ... ver más

 
Min Xu, Wenjie Tian and Xiangpeng Zhang    
The three-degrees-of-freedom (3-DOF) parallel robot is commonly employed as a shipborne stabilized platform for real-time compensation of ship disturbances. Pose accuracy is one of its most critical performance indicators. Currently, neural networks have... ver más

 
Ivica Skoko, Zvonimir Lu?ic, Zaloa Sanchez-Varela and Zlatko Boko    
Over the past century, with an accelerated increase in world energy demand, oil remained the most intriguing energy source, while oil exploration and production evolved from an onshore to an offshore exploration and production facility. In logistic and o... ver más

 
Mansour Bayazidy, Mohammad Maleki, Aras Khosravi, Amir Mohammad Shadjou, Junye Wang, Rabee Rustum and Reza Morovati    
River water is one of the most important natural resources for economic development and environmental sustainability. However, river water systems are vulnerable in some of the densely populated regions across the globe. Intense sand mining and waste dis... ver más
Revista: Water