A key enabler for the ever-increasing adoption of FPGA accelerators is the availability of frameworks allowing for the seamless coupling to general-purpose host processors. Embedded FPGA+CPU systems still heavily rely on copy-based host-to-accelerator communication, which complicates application development. In this paper, we present a hardware/software framework for enabling transparent, shared virtual memory for FPGA accelerators in embedded SoCs. It can use a hard-macro IOMMU if available, or a configurable soft-core IOMMU that we provide. We explore different TLB configurations and provide a comparison with other designs for shared virtual memory to gain insight on performance-critical IOMMU components. Experimental results using pointer-rich benchmarks show that our framework not only simplifies FPGA-accelerated application development, it also achieves up to 13x speedup compared to traditional copy-based offloading.

Vogel, P., Marongiu, A., Benini, L. (2019). Exploring Shared Virtual Memory for FPGA Accelerators with a Configurable IOMMU. IEEE TRANSACTIONS ON COMPUTERS, 68(4), 510-525 [10.1109/TC.2018.2879080].

Exploring Shared Virtual Memory for FPGA Accelerators with a Configurable IOMMU

Marongiu, Andrea;Benini, Luca
2019

Abstract

A key enabler for the ever-increasing adoption of FPGA accelerators is the availability of frameworks allowing for the seamless coupling to general-purpose host processors. Embedded FPGA+CPU systems still heavily rely on copy-based host-to-accelerator communication, which complicates application development. In this paper, we present a hardware/software framework for enabling transparent, shared virtual memory for FPGA accelerators in embedded SoCs. It can use a hard-macro IOMMU if available, or a configurable soft-core IOMMU that we provide. We explore different TLB configurations and provide a comparison with other designs for shared virtual memory to gain insight on performance-critical IOMMU components. Experimental results using pointer-rich benchmarks show that our framework not only simplifies FPGA-accelerated application development, it also achieves up to 13x speedup compared to traditional copy-based offloading.
2019
Vogel, P., Marongiu, A., Benini, L. (2019). Exploring Shared Virtual Memory for FPGA Accelerators with a Configurable IOMMU. IEEE TRANSACTIONS ON COMPUTERS, 68(4), 510-525 [10.1109/TC.2018.2879080].
Vogel, Pirmin; Marongiu, Andrea; Benini, Luca
File in questo prodotto:
File Dimensione Formato  
Exploring 2018.pdf

Open Access dal 02/05/2019

Descrizione: Articolo Postprint
Tipo: Postprint
Licenza: Licenza per accesso libero gratuito
Dimensione 1.02 MB
Formato Adobe PDF
1.02 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/651834
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 12
  • ???jsp.display-item.citation.isi??? 10
social impact