Many-core architectures structured as fabrics of tightly-coupled clusters have shown promising results on embedded computer vision benchmarks, providing state-of-art performance with a reduced power budget. We propose PULP (Parallel processing Ultra-Low Power platform), an architecture built on clusters of tightly-coupled OpenRISC ISA cores, with advanced techniques for fast performance and energy scalability that exploit the capabilities of the STMicroelectronics UTB FD-SOI 28nm technology. As a use case for PULP, we show that a computationally demanding vision kernel based on Convolutional Neural Networks can be quickly and efficiently switched from a low power, low frame-rate operating point to a high frame-rate one when a detection is performed. Our results show that PULP performance can be scaled over a 1x-354x range, with a peak performance/power efficiency of 211 GOPS/W.
Titolo: | Energy-efficient vision on the PULP platform for ultra-low power parallel computing | |
Autore/i: | CONTI, FRANCESCO; ROSSI, DAVIDE; Pullini, Antonio; LOI, IGOR; BENINI, LUCA | |
Autore/i Unibo: | ||
Anno: | 2014 | |
Titolo del libro: | IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation | |
Pagina iniziale: | 1 | |
Pagina finale: | 6 | |
Digital Object Identifier (DOI): | http://dx.doi.org/10.1109/SiPS.2014.6986099 | |
Abstract: | Many-core architectures structured as fabrics of tightly-coupled clusters have shown promising results on embedded computer vision benchmarks, providing state-of-art performance with a reduced power budget. We propose PULP (Parallel processing Ultra-Low Power platform), an architecture built on clusters of tightly-coupled OpenRISC ISA cores, with advanced techniques for fast performance and energy scalability that exploit the capabilities of the STMicroelectronics UTB FD-SOI 28nm technology. As a use case for PULP, we show that a computationally demanding vision kernel based on Convolutional Neural Networks can be quickly and efficiently switched from a low power, low frame-rate operating point to a high frame-rate one when a detection is performed. Our results show that PULP performance can be scaled over a 1x-354x range, with a peak performance/power efficiency of 211 GOPS/W. | |
Data stato definitivo: | 18-dic-2015 | |
Appare nelle tipologie: | 4.01 Contributo in Atti di convegno |