Barrier synchronization is a key programming primitive for shared memory embedded MPSoCs. As the core count increases, software implementations cannot provide the needed performance and scalability, thus making hardware acceleration critical. In this paper we describe an interconnect extension implemented with standard cells and with a mainstream industrial toolflow. We show that the area overhead is marginal with respect to the performance improvements of the resulting hardware-accelerated barriers.We integrate our HW barrier into the OpenMP programming model and discuss synchronization efficiency compared with traditional software implementations.
Abellan J.L., Fernandez J. , Acacio M.E. , Bertozzi D. , Bortolotti D. , Marongiu A. , et al. (2012). Design of a collective communication infrastructure for barrier synchronization in cluster-based nanoscale MPSoCs. NEW YORK : IEEE Press [10.1109/DATE.2012.6176519].
Design of a collective communication infrastructure for barrier synchronization in cluster-based nanoscale MPSoCs
BERTOZZI, DAVIDE;BORTOLOTTI, DANIELE;MARONGIU, ANDREA;BENINI, LUCA
2012
Abstract
Barrier synchronization is a key programming primitive for shared memory embedded MPSoCs. As the core count increases, software implementations cannot provide the needed performance and scalability, thus making hardware acceleration critical. In this paper we describe an interconnect extension implemented with standard cells and with a mainstream industrial toolflow. We show that the area overhead is marginal with respect to the performance improvements of the resulting hardware-accelerated barriers.We integrate our HW barrier into the OpenMP programming model and discuss synchronization efficiency compared with traditional software implementations.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.