In this paper we address the issue of efficient doall workload distribution on a embedded 3D MPSoC. 3D stacking technology enables low latency and high bandwidth access to multiple, large memory banks in close spatial proximity. In our implementation one silicon layer contains multiple processors, whereas one or more DRAM layers on top host a NUMA memory subsystem. To obtain high locality and balanced workload we consider a two-step approach. First, a compiler pass analyzes memory references in a loop and schedules each iteration to the processor owning the most frequently accessed data. Second, if locality-aware loop parallelization has generated unbalanced workload we allow idle processors to execute part of the remaining work from neighbors by implementing runtime support for work stealing.

Vertical stealing: robust, locality-aware do-all workload distribution for 3D MPSoCs / A. Marongiu; P. Burgio; L. Benini. - STAMPA. - (2010), pp. 207-216. (Intervento presentato al convegno CASES '10, the 2010 international conference on Compilers, architectures and synthesis for embedded systems tenutosi a Scottsdale, AZ (USA) nel Oct 24-29, 2010) [10.1145/1878921.1878952].

Vertical stealing: robust, locality-aware do-all workload distribution for 3D MPSoCs

MARONGIU, ANDREA;BURGIO, PAOLO;BENINI, LUCA
2010

Abstract

In this paper we address the issue of efficient doall workload distribution on a embedded 3D MPSoC. 3D stacking technology enables low latency and high bandwidth access to multiple, large memory banks in close spatial proximity. In our implementation one silicon layer contains multiple processors, whereas one or more DRAM layers on top host a NUMA memory subsystem. To obtain high locality and balanced workload we consider a two-step approach. First, a compiler pass analyzes memory references in a loop and schedules each iteration to the processor owning the most frequently accessed data. Second, if locality-aware loop parallelization has generated unbalanced workload we allow idle processors to execute part of the remaining work from neighbors by implementing runtime support for work stealing.
2010
CASES '10 Proceedings of the 2010 international conference on Compilers, architectures and synthesis for embedded systems
207
216
Vertical stealing: robust, locality-aware do-all workload distribution for 3D MPSoCs / A. Marongiu; P. Burgio; L. Benini. - STAMPA. - (2010), pp. 207-216. (Intervento presentato al convegno CASES '10, the 2010 international conference on Compilers, architectures and synthesis for embedded systems tenutosi a Scottsdale, AZ (USA) nel Oct 24-29, 2010) [10.1145/1878921.1878952].
A. Marongiu; P. Burgio; L. Benini
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/96002
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
social impact