One of the challenges for Tiny Machine Learning (tinyML) is keeping up with the evolution of Machine Learning models from Convolutional Neural Networks to Transformers. We address this by leveraging a heterogeneous architectural template coupling RISC-V processors with hardwired accelerators supported by an automated deployment flow. We demonstrate Attention-based models in a tinyML power envelope with an octacore cluster coupled with an accelerator for quantized Attention. Our deployment flow enables end-to-end 8-bit Transformer inference, achieving leading-edge energy efficiency and throughput of 2960 GOp/J and 154GOp/s (0.65 V, 22nm FD-SOI technology).

Wiese, P., İslamoğlu, G., Scherer, M., Macan, L., Jung, V.J.B., Burrello, A., et al. (2025). Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow. IEEE DESIGN & TEST, 42(5), 63-72 [10.1109/mdat.2025.3527371].

Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow

Macan, Luka;Burrello, Alessio;Conti, Francesco;Benini, Luca
2025

Abstract

One of the challenges for Tiny Machine Learning (tinyML) is keeping up with the evolution of Machine Learning models from Convolutional Neural Networks to Transformers. We address this by leveraging a heterogeneous architectural template coupling RISC-V processors with hardwired accelerators supported by an automated deployment flow. We demonstrate Attention-based models in a tinyML power envelope with an octacore cluster coupled with an accelerator for quantized Attention. Our deployment flow enables end-to-end 8-bit Transformer inference, achieving leading-edge energy efficiency and throughput of 2960 GOp/J and 154GOp/s (0.65 V, 22nm FD-SOI technology).
2025
Wiese, P., İslamoğlu, G., Scherer, M., Macan, L., Jung, V.J.B., Burrello, A., et al. (2025). Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow. IEEE DESIGN & TEST, 42(5), 63-72 [10.1109/mdat.2025.3527371].
Wiese, Philip; İslamoğlu, Gamze; Scherer, Moritz; Macan, Luka; Jung, Victor J. B.; Burrello, Alessio; Conti, Francesco; Benini, Luca...espandi
File in questo prodotto:
File Dimensione Formato  
2408.02473v2.pdf

accesso aperto

Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza: Licenza per accesso libero gratuito
Dimensione 442.23 kB
Formato Adobe PDF
442.23 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1000948
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact