Symmetric Multi-Processing (SMP) based on cache coherency is crucial for high-end embedded systems like automotive applications. RISC-V is gaining traction, and open-source hardware (OSH) platforms offer solutions to issues such as IP costs and vendor dependency. Existing multi-core cache-coherent RISC-V platforms are complex and not efficient for small embedded core clusters. We propose an open-source SystemVerilog implementation of a lightweight snoop-based cache-coherent cluster of Linux-capable CVA6 cores. Our design uses the MOESI protocol via the Arm’s AMBA ACE protocol. Evaluated with Splash-3 benchmarks, our solution shows up to 32.87% faster performance in a dual-core setup and an average improvement of 15.8% over OpenPiton. Synthesized using GF 22nm FDSOI technology, the Cache Coherency Unit occupies only 1.6% of the system area.

Tedeschi, R., Valente, L., Ottavi, G., Zelioli, E., Wistoff, N., Giacometti, M., et al. (2024). Culsans: An Efficient Snoop-based Coherency Unit for the CVA6 Open Source RISC-V application processor. WORKS IN PROGRESS IN EMBEDDED COMPUTING JOURNAL, 10(2), 1-5 [10.3929/ethz-b-000712186].

Culsans: An Efficient Snoop-based Coherency Unit for the CVA6 Open Source RISC-V application processor

Luca Valente;Gianmarco Ottavi;Luca Benini;Davide Rossi
2024

Abstract

Symmetric Multi-Processing (SMP) based on cache coherency is crucial for high-end embedded systems like automotive applications. RISC-V is gaining traction, and open-source hardware (OSH) platforms offer solutions to issues such as IP costs and vendor dependency. Existing multi-core cache-coherent RISC-V platforms are complex and not efficient for small embedded core clusters. We propose an open-source SystemVerilog implementation of a lightweight snoop-based cache-coherent cluster of Linux-capable CVA6 cores. Our design uses the MOESI protocol via the Arm’s AMBA ACE protocol. Evaluated with Splash-3 benchmarks, our solution shows up to 32.87% faster performance in a dual-core setup and an average improvement of 15.8% over OpenPiton. Synthesized using GF 22nm FDSOI technology, the Cache Coherency Unit occupies only 1.6% of the system area.
2024
Tedeschi, R., Valente, L., Ottavi, G., Zelioli, E., Wistoff, N., Giacometti, M., et al. (2024). Culsans: An Efficient Snoop-based Coherency Unit for the CVA6 Open Source RISC-V application processor. WORKS IN PROGRESS IN EMBEDDED COMPUTING JOURNAL, 10(2), 1-5 [10.3929/ethz-b-000712186].
Tedeschi, Riccardo; Valente, Luca; Ottavi, Gianmarco; Zelioli, Enrico; Wistoff, Nils; Giacometti, Massimiliano; Basit Sajjad, Abdul; Benini, Luca; Ros...espandi
File in questo prodotto:
File Dimensione Formato  
Culsans_ An Efficient Snoop-based Coherency Unit for the CVA6 Open Source RISC-V application processor.pdf

accesso aperto

Descrizione: versione editoriale
Tipo: Versione (PDF) editoriale
Licenza: Creative commons
Dimensione 1.51 MB
Formato Adobe PDF
1.51 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1004728
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact