ARTÍCULO
TITULO

A Holistic Scalability Strategy for Time Series Databases Following Cascading Polyglot Persistence

Carlos Garcia Calatrava    
Yolanda Becerra Fontal and Fernando M. Cucchietti    

Resumen

Time series databases aim to handle big amounts of data in a fast way, both when introducing new data to the system, and when retrieving it later on. However, depending on the scenario in which these databases participate, reducing the number of requested resources becomes a further requirement. Following this goal, NagareDB and its Cascading Polyglot Persistence approach were born. They were not just intended to provide a fast time series solution, but also to find a great cost-efficiency balance. However, although they provided outstanding results, they lacked a natural way of scaling out in a cluster fashion. Consequently, monolithic approaches could extract the maximum value from the solution but distributed ones had to rely on general scalability approaches. In this research, we proposed a holistic approach specially tailored for databases following Cascading Polyglot Persistence to further maximize its inherent resource-saving goals. The proposed approach reduced the cluster size by 33%, in a setup with just three ingestion nodes and up to 50% in a setup with 10 ingestion nodes. Moreover, the evaluation shows that our scaling method is able to provide efficient cluster growth, offering scalability speedups greater than 85% in comparison to a theoretically 100% perfect scaling, while also ensuring data safety via data replication.

 Artículos similares

       
 
Valeria Placidi, Michele Cenci, Francesco Castellani and Marta Falasca    
The increasingly complex dynamics of urban planning require an innovative approach to land use suitability analyses and environmental assessments. Traditionally, these disciplines have provided a critical foundation for sustainable urban development, but... ver más
Revista: Urban Science

 
Liufeng Tao, Kai Ma, Miao Tian, Zhenyang Hui, Shuai Zheng, Junjie Liu, Zhong Xie and Qinjun Qiu    
The efficient and precise retrieval of desired information from extensive geological databases is a prominent and pivotal focus within the realm of geological information services. Conventional information retrieval methods primarily rely on keyword matc... ver más

 
Daniela Castagna, Luzinete Scaunichi Barbosa, Charles Campoe Martim, Rhavel Salviano Dias Paulista, Nadja Gomes Machado, Marcelo Sacardi Biudes and Adilson Pacheco de Souza    
The Amazon biome plays a crucial role in the hydrological cycle, supplying water vapor for the atmosphere and contributing to evapotranspiration (ET) that influences regional humidity across Brazil and South America. Remote sensing (RS) has emerged as a ... ver más
Revista: Hydrology

 
Sikha S. Bagui, Dustin Mink, Subhash C. Bagui, Michael Plain, Jadarius Hill and Marshall Elam    
There has been a great deal of research in the area of using graph engines and graph databases to model network traffic and network attacks, but the novelty of this research lies in visually or graphically representing the Reconnaissance Tactic (TA0043) ... ver más
Revista: Future Internet

 
Wisal Khan, Teerath Kumar, Cheng Zhang, Kislay Raj, Arunabha M. Roy and Bin Luo    
The competent software architecture plays a crucial role in the difficult task of big data processing for SQL and NoSQL databases. SQL databases were created to organize data and allow for horizontal expansion. NoSQL databases, on the other hand, support... ver más