Redirigiendo al acceso original de articulo en 19 segundos...
ARTÍCULO
TITULO

Online MPI Process Mapping for Coordinating Locality and Memory Congestion on NUMA Systems

Mulya Agung    
Muhammad Alfian Amrizal    
Ryusuke Egawa    
Hiroyuki Takizawa    

Resumen

Mapping MPI processes to processor cores, called process mapping, is crucial to achieving the scalable performance on multi-core processors. By analyzing the communication behavior among MPI processes, process mapping can improve the communication locality, and thus reduce the overall communication cost. However, on modern non-uniform memory access (NUMA) systems, the memory congestion problem could degrade performance more severely than the locality problem because heavy congestion on shared caches and memory controllers could cause long latencies. Most of the existing work focus only on improving the locality or rely on offline profiling to analyze the communication behavior.We propose a process mapping method that dynamically performs the process mapping for adapting to communication behaviors while coordinating the locality and memory congestion. Our method works online during the execution of an MPI application. It does not require modifications to the application, previous knowledge of the communication behavior, or changes to the hardware and operating system. Experimental results show that our method can achieve performance and energy efficiency close to the best static mapping method with low overhead to the application execution. In experiments with the NAS parallel benchmarks on a NUMA system, the performance and total energy improvements are up to 34% (18.5% on average) and 28.9% (13.6% on average), respectively. In experiments with two GROMACS applications on a larger NUMA system, the average improvements in performance and total energy consumption are 21.6% and 12.6%, respectively.

 Artículos similares

       
 
Andrei Paraschiv, Teodora Andreea Ion and Mihai Dascalu    
The advent of online platforms and services has revolutionized communication, enabling users to share opinions and ideas seamlessly. However, this convenience has also brought about a surge in offensive and harmful language across various communication m... ver más
Revista: Information

 
Chuanzhe Gao, Shidang Li, Mingsheng Wei, Siyi Duan and Jinsong Xu    
With the rapid development of wireless communication networks and Internet of Things technology (IoT), higher requirements have been put forward for spectrum resource utilization and system performance. In order to further improve the utilization of spec... ver más
Revista: Information

 
Guillermo Calahorra-Candao and María José Martín-de Hoyos    
This study develops a theoretical framework integrating the Technology Acceptance Model (TAM) and Uses and Gratifications Theory (UGT) to predict and understand the acceptance of voice shopping intentions, particularly through AI-driven voice assistants.... ver más
Revista: Information

 
Marco Sabatini and Giovanni B. Palmerini    
This paper investigates the performance of the forthcoming lunar navigation satellite systems for estimating not only the position of an onboard receiver in a lunar inertial reference frame but also, and with a consistent accuracy, the relative position ... ver más
Revista: Aerospace

 
Jingyuan Liang, Shuiqing Lin and Xizheng Ke    
Four-level pulse amplitude modulation (PAM4) can transmit more information in the same symbol interval, effectively improving the information transmission rate and frequency band utilization of visible light communication (VLC). This paper proposes a met... ver más
Revista: Applied Sciences