Inicio  /  Applied Sciences  /  Vol: 14 Par: 1 (2024)  /  Artículo
ARTÍCULO
TITULO

Research on High-Performance Fourier Transform Algorithms Based on the NPU

Qing Li    
Decheng Zuo    
Yi Feng and Dongxin Wen    

Resumen

Backpack computers require powerful, intelligent computing capabilities for field wearables while taking energy consumption into careful consideration. A recommended solution for this demand is the CPU + NPU-based SoC. In many wearable intelligence applications, the Fourier Transform is an essential, computationally intensive preprocessing task. However, due to the unique structure of the NPU, the conventional Fourier Transform algorithms cannot be applied directly to it. This paper proposes two NPU-accelerated Fourier Transform algorithms that leverage the unique hardware structure of the NPU and provides three implementations of those algorithms, namely MM-2DFT, MV-2FFTm, and MV-2FFTv. Then, we benchmarked the speed and energy efficiency of our algorithms for the gray image edge filtering task on the Huawei Atlas200I-DK-A2 development kits against the Cooley-Tukey algorithm running on CPU and GPU platforms. The experiment results reveal MM-2DFT outperforms OpenCL-based FFT on NVIDIA Tegra X2 GPU for small input sizes, with a 4- to 8-time speedup. As the input image resolution exceeds 2048, MV-2FFTv approaches GPU computation speed. Additionally, two scenarios were tested and analyzed for energy efficiency, revealing that cube units of the NPU are more energy efficient. The vector and CPU units are better suited for sparse matrix multiplication and small-scale inputs, respectively.

Palabras claves

 Artículos similares

       
 
Yinglong Kang, Kemin Zhang and Xi Lin    
Whether it is fossil energy or renewable energy, the storage, efficient use, and multi-application of energy largely depend on the research and preparation of high-performance materials. The research and development of energy storage materials with a hig... ver más
Revista: Coatings

 
Mikel Labayen, Laura Medina, Fernando Eizaguirre, José Flich and Naiara Aginako    
The automation of railroad operations is a rapidly growing industry. In 2023, a new European standard for the automated Grade of Automation (GoA) 2 over European Train Control System (ETCS) driving is anticipated. Meanwhile, railway stakeholders are alre... ver más
Revista: Applied Sciences

 
Son Vu Hong Pham and Khoi Van Tien Nguyen    
Artificial intelligence models are currently being proposed for application in improving performance in addressing contemporary management and production issues. With the goal of automating the detection of road surface defects in transportation infrastr... ver más
Revista: Applied Sciences

 
Alexander Feoktistov, Alexei Edelev, Andrei Tchernykh, Sergey Gorsky, Olga Basharina and Evgeniy Fereferov    
Implementing high-performance computing (HPC) to solve problems in energy infrastructure resilience research in a heterogeneous environment based on an in-memory data grid (IMDG) presents a challenge to workflow management systems. Large-scale energy inf... ver más
Revista: Computation

 
Francisco-David Hernandez, Domingo Cortes, Marco Antonio Ramirez-Salinas and Luis Alfonso Villa-Vargas    
In control research and design it is frequently necessary to explore, evaluate, tune and compare many control strategies. These activities are assisted by software tools of increasing complexity; however, even with the existing high performance tools the... ver más
Revista: Algorithms