Heprem: Enabling predictable GPU execution on heterogeneous soc

Forsberg, Bjorn; Benini, Luca; Marongiu, Andrea

doi:10.23919/DATE.2018.8342066

Heterogeneous systems-on-A-chip are increasingly embracing shared memory designs, in which a single DRAM is used for both the main CPU and an integrated GPU. This architectural paradigm reduces the overheads associated with data movements and simplifies programmability. However, the deployment of real-Time workloads on such architectures is troublesome, as memory contention significantly increases execution time of tasks and the pessimism in worst-case execution time (WCET) estimates. The Predictable Execution Model (PREM) separates memory and computation phases in real-Time codes, then arbitrates memory phases from different tasks such that only one core at a time can access the DRAM. This paper revisits the original PREM proposal in the context of heterogeneous SoCs, proposing a compiler-based approach to make GPU codes PREM-compliant. Starting from high-level specifications of computation offloading, suitable program regions are selected and separated into memory and compute phases. Our experimental results show that the proposed technique is able to reduce the sensitivity of GPU kernels to memory interference to near zero, and achieves up to a 20 χ reduction in the measured WCET.

Forsberg, B., Benini, L., Marongiu, A. (2018). Heprem: Enabling predictable GPU execution on heterogeneous soc. Institute of Electrical and Electronics Engineers Inc. [10.23919/DATE.2018.8342066].

Heprem: Enabling predictable GPU execution on heterogeneous soc

Forsberg, Bjorn;Benini, Luca;Marongiu, Andrea

2018

Abstract

Heterogeneous systems-on-A-chip are increasingly embracing shared memory designs, in which a single DRAM is used for both the main CPU and an integrated GPU. This architectural paradigm reduces the overheads associated with data movements and simplifies programmability. However, the deployment of real-Time workloads on such architectures is troublesome, as memory contention significantly increases execution time of tasks and the pessimism in worst-case execution time (WCET) estimates. The Predictable Execution Model (PREM) separates memory and computation phases in real-Time codes, then arbitrates memory phases from different tasks such that only one core at a time can access the DRAM. This paper revisits the original PREM proposal in the context of heterogeneous SoCs, proposing a compiler-based approach to make GPU codes PREM-compliant. Starting from high-level specifications of computation offloading, suitable program regions are selected and separated into memory and compute phases. Our experimental results show that the proposed technique is able to reduce the sensitivity of GPU kernels to memory interference to near zero, and achieves up to a 20 χ reduction in the measured WCET.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2018
			
	Titolo del volume
	
				Proceedings of the 2018 Design, Automation and Test in Europe Conference and Exhibition, DATE 2018
			
	Pagina iniziale
	
				539
			
	Pagina finale
	
				544
			
	Codice DOI
	
				https://dx.doi.org/10.23919/DATE.2018.8342066
			
	Citazione
	
				Forsberg, B., Benini, L., Marongiu, A. (2018). Heprem: Enabling predictable GPU execution on heterogeneous soc. Institute of Electrical and Electronics Engineers Inc. [10.23919/DATE.2018.8342066].
			
	Tutti gli autori
	
						Forsberg, Bjorn; Benini, Luca; Marongiu, Andrea
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/677163

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

22

21

ND

CRIS Current Research Information System