Inicio  /  Computation  /  Vol: 12 Par: 3 (2024)  /  Artículo
ARTÍCULO
TITULO

Exploring Numba and CuPy for GPU-Accelerated Monte Carlo Radiation Transport

Tair Askar    
Argyn Yergaliyev    
Bekdaulet Shukirgaliyev and Ernazar Abdikamalov    

Resumen

This paper examines the performance of two popular GPU programming platforms, Numba and CuPy, for Monte Carlo radiation transport calculations. We conducted tests involving random number generation and one-dimensional Monte Carlo radiation transport in plane-parallel geometry on three GPU cards: NVIDIA Tesla A100, Tesla V100, and GeForce RTX3080. We compared Numba and CuPy to each other and our CUDA C implementation. The results show that CUDA C, as expected, has the fastest performance and highest energy efficiency, while Numba offers comparable performance when data movement is minimal. While CuPy offers ease of implementation, it performs slower for compute-heavy tasks.

Palabras claves

GPU -  CUDA -  Numba -  CuPy -  performance

 Artículos similares

       
 
Aurelien Bloch, Simone Casale-Brunet and Marco Mattavelli    
Developing and fine-tuning software programs for heterogeneous hardware such as CPU/GPU processing platforms comprise a highly complex endeavor that demands considerable time and effort of software engineers and requires evaluating various fundamental co... ver más

 
Michael Knobloch,Bernd Mohr     Pág. 91 - 111
General purpose GPUs are now ubiquitous in high-end supercomputing. All but one (the Japanese Fugaku system, which is based on ARM processors) of the announced (pre-)exascale systems contain vast amounts of GPUs that deliver the majority of the performan... ver más

 
Stefano Quer, Andrea Marcelli and Giovanni Squillero    
The maximum common subgraph of two graphs is the largest possible common subgraph, i.e., the common subgraph with as many vertices as possible. Even if this problem is very challenging, as it has been long proven NP-hard, its countless practical applicat... ver más
Revista: Computation

 
Yuzhu Wang, Yuan Zhao, Jinrong Jiang and He Zhang    
Graphics processing unit (GPU)-based computing for climate system models is a longstanding research area of interest. The rapid radiative transfer model for general circulation models (RRTMG), a popular atmospheric radiative transfer model, can calculate... ver más
Revista: Applied Sciences

 
Ilya S. Pershin,Vadim D. Levchenko,Anastasia Y. Perepelkina     Pág. 86 - 101
We study the performance limits of different algorithmic approaches to the implementation of a sample problem of wave equation solution with a cross stencil scheme. With this, we aim to find the highest limit of the achievable performance efficiency... ver más