ARTÍCULO
TITULO

Data Compression for the Exascale Computing Era - Survey

Seung Woo Son    
Zhengzhang Chen    
William Hendrix    
Ankit Agrawal    
Wei-keng Liao    
Alok Choudhary    

Resumen

While periodic checkpointing has been an important mechanism for tolerating faults in high-performance computing (HPC) systems, it is cost-prohibitive as the HPC system approaches exascale. Applying compression techniques is one common way to mitigate such burdens by reducing the data size, but they are often found to be less effective for scientific datasets. Traditional lossless compression techniques that look for repeated patterns are ineffective for scientific data in which high-precision data is used and hence common patterns are rare to find. In this paper, we present a comparison of several lossless and lossy data compression algorithms and discuss their methodology under the exascale environment. As data volume increases, we discover an increasing trend of new domain-driven algorithms that exploit the inherent characteristics exhibited in many scientific dataset, such as relatively small changes in data values from one simulation iteration to the next or among neighboring data. In particular, significant data reduction has been observed in lossy compression. This paper also discusses how the errors introduced by lossy compressions are controlled and the tradeoffs with the compression ratio.

 Artículos similares

       
 
Rongliang Cheng, Xiaofeng Han and Zhiqiang Wu    
It is of great significance to identify the spatiotemporal stress distribution characteristics to ensure the safety of a super-high arch dam during the initial operation stage. Taking the 285.5 m-high Xiluodu Dam as an example, the spatiotemporal distrib... ver más
Revista: Water

 
Varsha S. Lalapura, Veerender Reddy Bhimavarapu, J. Amudha and Hariram Selvamurugan Satheesh    
The Recurrent Neural Networks (RNNs) are an essential class of supervised learning algorithms. Complex tasks like speech recognition, machine translation, sentiment classification, weather prediction, etc., are now performed by well-trained RNNs. Local o... ver más
Revista: Algorithms

 
Yu Yang, Changhao Xin, Yidan Sun, Junzhen Di and Pengfei Liang    
Incomplete data indicate that coal gangue is accumulated in China, with over 2000 gangue hills covering an area exceeding 200,000 mu and an annual growth rate surpassing 800 million tons. This accumulation not only signifies a substantial waste of resour... ver más
Revista: Applied Sciences

 
Jiahao Chen, Jiaxin Li, Deqian Zheng, Qianru Zheng, Jiayi Zhang, Meimei Wu and Chaosai Liu    
The multi-field coupling of grain piles in grain silos is a focal point of research in the field of grain storage. The porosity of grain piles is a critical parameter that affects the heat and moisture transfer in grain piles. To investigate the distribu... ver más
Revista: Applied Sciences

 
Panagiotis D. Kordas, George N. Lampeas and Konstantinos T. Fotopoulos    
The main purpose of this study comprises the design and the development of a novel experimental configuration for carrying out tests on a full-scale stiffened panel manufactured of fiber-reinforced thermoplastic material. Two different test-bench design ... ver más
Revista: Aerospace