ARTÍCULO
TITULO

Deep Analysis of Job State Statistics on Lomonosov-2 Supercomputer

Dmitry A. Nikitenko    
Vadim V. Voevodin    
Sergey A. Zhumatiy    

Resumen

It is a common knowledge that the increasingly growing capabilities of HPC systems are always limited by a number of efficiency related issues. The reasons can be very different: hardware failures, incorrect job scheduling, peculiarities of algorithm, chosen programming technology specifics, etc. Most of these issues can be detected after precise analysis, but is a very resourceful way to study every application run. Therefore we performed less complicated analysis of the whole supercomputer job flow. In this paper we share our experience of analyzing user applications? job states assigned by the SLURM resource manager that is used on the Lomonosov-2 system at Supercomputing center of Lomonosov Moscow State University. The statistics on job states was collected and it revealed that the ratio of correctly finished jobs (with the COMPLETED state) was rather low. The jobs owners were asked if the distribution of their jobs? states is normal regarding their applications. This user feedback was processed, and some new ways of efficiency gain were revealed as the result.

 Artículos similares

       
 
Pedro Celard, Adrián Seara Vieira, José Manuel Sorribes-Fdez, Eva Lorenzo Iglesias and Lourdes Borrajo    
In this study, we propose a novel Temporal Development Generative Adversarial Network (TD-GAN) for the generation and analysis of videos, with a particular focus on biological and medical applications. Inspired by Progressive Growing GAN (PG-GAN) and Tem... ver más
Revista: Information

 
Yuhan Li, Shuguang Zhang, Ruichen He and Florian Holzapfel    
Urban Air Mobility (UAM) has emerged in response to increasing traffic demands. As UAM involves commercial flights in complex urban areas, well-established automation technologies are critical to ensure a safe, accessible, and reliable flight. However, t... ver más
Revista: Aerospace

 
Huang Feng and Yu Zhang    
Extensive research in predicting annual passenger throughput has been conducted, aiming at providing decision support for airport construction, aircraft procurement, resource management, flight scheduling, etc. However, how airport operational throughput... ver más
Revista: Aerospace

 
WoonSeong Jeong, ByungChan Kong and Sang-Guk Yum    
The demand for compact housing is on the rise, driven by the need for floor plans that accommodate stakeholders? preferences. However, clients frequently struggle to convey their spatial needs to professionals, such as architects, due to a lack of means ... ver más
Revista: Applied Sciences

 
Ilia Zaznov, Julian Martin Kunkel, Atta Badii and Alfonso Dufour    
This paper introduces a novel deep learning approach for intraday stock price direction prediction, motivated by the need for more accurate models to enable profitable algorithmic trading. The key problems addressed are effectively modelling complex limi... ver más
Revista: Applied Sciences