ARTÍCULO
TITULO

Supercomputer Lomonosov-2: Large Scale, Deep Monitoring and Fine Analytics for the User Community

Vladimir V. Voevodin    
Alexander S. Antonov    
Dmitry A. Nikitenko    
Pavel A. Shvets    
Sergey I. Sobolev    
Igor Yu. Sidorov    
Konstantin S. Stefanov    
Vadim V. Voevodin    
Sergey A. Zhumatiy    

Resumen

The huge number of hardware and software components, together with a large number of parameters affecting the performance of each parallel application, makes ensuring the efficiency of a large scale supercomputer extremely difficult. In this situation, all basic parameters of the supercomputer should be constantly monitored, as well as many decisions about its functioning should be made by special software automatically. In this paper we describe the tight connection between complexity of modern large high performance computing systems and special techniques and tools required to ensure their efficiency in practice. The main subsystems of the developed complex (Octoshell, DiMMoN, Octotron, JobDigest, and an expert software system to bring fine analytics on parallel applications and the entire supercomputer to users and sysadmins) are actively operated on the large supercomputer systems at Lomonosov Moscow State University. A brief description of the architecture of Lomonosov-2 supercomputer is presented, and questions showing both a wide variety of emerging complex issues and the need for an integrated approach to solving the problem of effectively supporting large supercomputer systems are discussed.

 Artículos similares

       
 
Sergey V. Zaytsev,Viktor A. Kulikov,Andrei G. Yakovlev,Denis V. Yakovlev     Pág. 25 - 29
Usage of 2D inversion of magnetotelluric data for real geological objects can cause distortion, but it is more often used in commercial projects, because of its effectiveness and great experience. Whereas in the case of 3D inversion is not such a great e... ver más

 
Marina A. Kornilina,Viktoriia O. Podryga,Sergey V. Polyakov,Dmitry V. Puzyrkov,Mikhail V. Yakoboskiy     Pág. 66 - 79
The paper presents the problem of creating a cloud service designed to solve promising nanotechnology problems on supercomputer systems. The motivation for creating such a service was the need to integrate ideas, knowledge and computing technologies rela... ver más

 
Xiangke Liao,Shaoliang Peng,Yutong Lu,Yingbo Cui,Chengkun Wu,Heng Wang,Jiajun Wen     Pág. 73 - 83
The growing velocity of biological big data is way beyond Moore's Law of compute power growth. The amount of genomic data has been explosively accumulating, which calls for an enormous amount of computing power, while current computation methods cannot s... ver más

 
Antoni Artigues,Fernando Martin Cucchietti,Carlos Tripiana Montes,David Vicente,Hadrien Calmet,Guillermo Marin,Guillaume Houzeaux,Mariano Vazquez     Pág. 4 - 18
We designed and implemented a parallel visualisation system for the analysis of large scale time-dependent particle type data. The particular challenge we address is how to analyse a high perfor- mance computation style dataset when a visual representati... ver más