ARTÍCULO
TITULO

A Case for Embedded FPGA-based SoCs in Energy-Efficient Acceleration of Graph Problems

Nachiket Kapre    
Pradeep Moorthy    

Resumen

Sparse graph problems are notoriously hard to accelerate on conventional platforms due to irregular memory access patterns resulting in underutilization of memory bandwidth. These bottlenecks on traditional x86-based systems mean that sparse graph problems scale very poorly, both in terms of performance and power efficiency. A cluster of embedded SoCs (systems-on-chip) with closely-coupled FPGA accelerators can support distributed memory accesses with better matched low-power processing. We first conduct preliminary experiments across a range of COTS (commercial off-the-shelf) embedded SoCs to establish promise for energy-efficiency acceleration of sparse problems. We select the Xilinx Zynq SoC with FPGA accelerators to construct a prototype 32-node Beowulf cluster. We develop specialized MPI routines and memory DMA offload engines to support irregular communication efficiently. In this setup, we use the ARM processor as a data marshaller for local DMA traffic as well as remote MPI traffic while the FPGA may be used as a programmable accelerator. Across a set of benchmark graphs, we show that 32-node embedded SoC cluster can exceed the energy efficiency of an Intel E5-2407 by as much as 1.7× at a total graph processing capacity of 91?95 MTEPS for graphs as large as 32 million nodes and edges. 

 Artículos similares

       
 
Fernando Díaz-del-Río, Pablo Sanchez-Cuevas, María José Moron-Fernández, Daniel Cascado-Caballero, Helena Molina-Abril and Pedro Real    
Relating image contours and regions and their attributes according to connectivity based on incidence or adjacency is a crucial task in numerous applications in the fields of image processing, computer vision and pattern recognition. In this paper, the c... ver más
Revista: Algorithms

 
Shaswot Shresthamali, Masaaki Kondo and Hiroshi Nakamura    
IoT embedded systems have multiple objectives that need to be maximized simultaneously. These objectives conflict with each other due to limited resources and tradeoffs that need to be made. This requires multi-objective optimization (MOO) and multiple P... ver más

 
Aakarsh Rao, Nadir A. Carreón, Roman Lysecky and Jerzy Rozenblit    
Life-critical embedded systems, including medical devices, are becoming increasingly interconnected and interoperable, providing great efficiency to the healthcare ecosystem. These systems incorporate complex software that plays a significantly integrati... ver más
Revista: Information

 
Kwang-Myong Lee, Yong-Sik Yoon, Keun-Hyeok Yang, Bong-Young Yoo and Seung-Jun Kwon    
Concrete structures are considered as durable construction material, but corrosion of the embedded steel reinforcement occurs under chloride exposure as concrete has porous properties. Herein, a cyclic drying?wetting test was performed for two years usin... ver más
Revista: Applied Sciences

 
Milan Dvorák, Tomá? Poní?il, Viktor Kulí?ek, Nikola Schmidová, Karel Doubrava, Bohumil Kropík and Milan Ru?icka    
This article focuses on the development of a carbon composite bicycle frame using various experimental methods of structural analysis. Two types of frame specimen were used. The complete frame specimen was tested in accordance with ISO test load cases wi... ver más
Revista: Applied Sciences