Redirigiendo al acceso original de articulo en 18 segundos...
Inicio  /  Computation  /  Vol: 9 Par: 2 (2021)  /  Artículo
ARTÍCULO
TITULO

High-Performance Computation in Residue Number System Using Floating-Point Arithmetic

Konstantin Isupov    

Resumen

Residue number system (RNS) is known for its parallel arithmetic and has been used in recent decades in various important applications, from digital signal processing and deep neural networks to cryptography and high-precision computation. However, comparison, sign identification, overflow detection, and division are still hard to implement in RNS. For such operations, most of the methods proposed in the literature only support small dynamic ranges (up to several tens of bits), so they are only suitable for low-precision applications. We recently proposed a method that supports arbitrary moduli sets with cryptographically sized dynamic ranges, up to several thousands of bits. The practical interest of our method compared to existing methods is that it relies only on very fast standard floating-point operations, so it is suitable for multiple-precision applications and can be efficiently implemented on many general-purpose platforms that support IEEE 754 arithmetic. In this paper, we make further improvements to this method and demonstrate that it can successfully be applied to implement efficient data-parallel primitives operating in the RNS domain, namely finding the maximum element of an array of RNS numbers on graphics processing units. Our experimental results on an NVIDIA RTX 2080 GPU show that for random residues and a 128-moduli set with 2048-bit dynamic range, the proposed implementation reduces the running time by a factor of 39 and the memory consumption by a factor of 13 compared to an implementation based on mixed-radix conversion.

 Artículos similares

       
 
Yang Wang, Jie Liu, Xiaoxiong Zhu, Qingyang Zhang, Shengguo Li and Qinglin Wang    
Structured grid-based sparse matrix-vector multiplication and Gauss?Seidel iterations are very important kernel functions in scientific and engineering computations, both of which are memory intensive and bandwidth-limited. GPDSP is a general purpose dig... ver más
Revista: Applied Sciences

 
Yukui Tian, Dongbao Yang, Xuhao Gang, Chaoge Yu, Shunying Ji and Qianjin Yue    
The determination of ice loads on polar vessels and offshore structures is important for ice-resistant design, safe operation, and management of structural integrity in ice-infested waters. Physical model testing carried out in an ice tank/basin is usual... ver más

 
Argyrios Sideris and Minas Dasygenis    
Information is transmitted between multiple insecure routing hops in text, image, video, and audio. Thus, this multi-hop digital data transfer makes secure transmission with confidentiality and integrity imperative. This protection of the transmitted dat... ver más
Revista: Computation

 
Danial Rooyani and Fantahun Defersha    
The work in this paper is motivated by a recently published article in which the authors developed an efficient two-stage genetic algorithm for a comprehensive model of a flexible job-shop scheduling problem (FJSP). In this paper, we extend the applicati... ver más
Revista: Algorithms

 
Ahmed Jawad A. AlBdairi, Zhu Xiao, Ahmed Alkhayyat, Amjad J. Humaidi, Mohammed A. Fadhel, Bahaa Hussein Taher, Laith Alzubaidi, José Santamaría and Omran Al-Shamma    
In the last decade, there has been a surge of interest in addressing complex Computer Vision (CV) problems in the field of face recognition (FR). In particular, one of the most difficult ones is based on the accurate determination of the ethnicity of man... ver más
Revista: Applied Sciences