Balancing energy efficiency and high performance in embedded systems requires fine-tuning hardware and software components to co-optimize their interaction. In this work, we address the automated optimization of memory usage through a compiler toolchain that leverages DMA-aware precision tuning and mathematical function memorization. The proposed solution extends the llvm infrastructure, employing the taffo plugins for precision tuning, with the SeTHet extension for DMA-aware precision tuning and luTHet for automated, DMA-aware mathematical function memorization. We performed an experimental assessment on hero, a heterogeneous platform employing risc-v cores as a parallel accelerator. Our solution enables speedups ranging from 1.5× to 51.1× on AxBench benchmarks that employ trigonometrical functions and 4.23-48.4× on Polybench benchmarks over the baseline hero platform.
Magnani, G., Cattaneo, D., Denisov, L., Tagliavini, G., Agosta, G., Cherubin, S. (2025). Synergistic Memory Optimisations: Precision Tuning in Heterogeneous Memory Hierarchies. IEEE TRANSACTIONS ON COMPUTERS, 74(9), 3168-3180 [10.1109/tc.2025.3586025].
Synergistic Memory Optimisations: Precision Tuning in Heterogeneous Memory Hierarchies
Tagliavini, Giuseppe;
2025
Abstract
Balancing energy efficiency and high performance in embedded systems requires fine-tuning hardware and software components to co-optimize their interaction. In this work, we address the automated optimization of memory usage through a compiler toolchain that leverages DMA-aware precision tuning and mathematical function memorization. The proposed solution extends the llvm infrastructure, employing the taffo plugins for precision tuning, with the SeTHet extension for DMA-aware precision tuning and luTHet for automated, DMA-aware mathematical function memorization. We performed an experimental assessment on hero, a heterogeneous platform employing risc-v cores as a parallel accelerator. Our solution enables speedups ranging from 1.5× to 51.1× on AxBench benchmarks that employ trigonometrical functions and 4.23-48.4× on Polybench benchmarks over the baseline hero platform.| File | Dimensione | Formato | |
|---|---|---|---|
|
Sethet_Transactions_Computers_accepted.pdf
accesso aperto
Tipo:
Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza:
Licenza per accesso libero gratuito
Dimensione
8.36 MB
Formato
Adobe PDF
|
8.36 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


