Inicio  /  Future Internet  /  Vol: 14 Par: 10 (2022)  /  Artículo
ARTÍCULO
TITULO

Distributed Big Data Storage Infrastructure for Biomedical Research Featuring High-Performance and Rich-Features

Xingjian Xu    
Lijun Sun and Fanjun Meng    

Resumen

The biomedical field entered the era of ?big data? years ago, and a lot of software is being developed to tackle the analysis problems brought on by big data. However, very few programs focus on providing a solid foundation for file systems of biomedical big data. Since file systems are a key prerequisite for efficient big data utilization, the absence of specialized biomedical big data file systems makes it difficult to optimize storage, accelerate analysis, and enrich functionality, resulting in inefficiency. Here we present F3BFS, a functional, fundamental, and future-oriented distributed file system, specially designed for various kinds of biomedical data. F3BFS makes it possible to boost existing software?s performance without modifying its main algorithms by transmitting raw datasets from generic file systems. Further, F3BFS has various built-in features to help researchers manage biology datasets more efficiently and productively, including metadata management, fuzzy search, automatic backup, transparent compression, etc.

 Artículos similares

       
 
Zhixin Yao, Jianqin Zhang, Taizeng Li and Ying Ding    
Trajectory big data is suitable for distributed storage retrieval due to its fast update speed and huge data volume, but currently there are problems such as hot data writing, storage skew, high I/O overhead and slow retrieval speed. In order to solve th... ver más

 
Lili Liang, Yufeng Hu, Zhiwu Liu, Yuntao Ye, Kuang Li, Kexin Liu, Haiqing Xu and Xiquan Liu    
The lumped hydrological model and empirical model have the problems of low accuracy and short forecasting period in real-time flood forecasting of small- and medium-sized rivers in a mountainous watershed. The sharing of underlying surface data such as h... ver más
Revista: Water

 
Guijun Lai, Yuzhen Shang, Binbao He, Guanwei Zhao and Muzhuang Yang    
Characterizing the taxi travel network is of fundamental importance to our understanding of urban mobility, and could provide intellectual support for urban planning, traffic congestion, and even the spread of diseases. However, the research on the inter... ver más

 
Yan Yan, Zichao Sun, Adnan Mahmood, Fei Xu, Zhuoyue Dong and Quan Z. Sheng    
Statistical partitioning and publishing is commonly used in location-based big data services to address queries such as the number of points of interest, available vehicles, traffic flows, infected patients, etc., within a certain range. Adding noise per... ver más

 
Carlos Garcia Calatrava, Yolanda Becerra Fontal and Fernando M. Cucchietti    
Time series databases aim to handle big amounts of data in a fast way, both when introducing new data to the system, and when retrieving it later on. However, depending on the scenario in which these databases participate, reducing the number of requeste... ver más