The UN Agenda 2030 motivates the development of scalable methods and tools for assessing the contribution of research and higher education institutions to sustainable development. Current research efforts suffer from the lack of rigorous evaluation benchmarks. This paper describes a collaborative work with sustainability experts aimed at constructing AlmaSDG, the first pilot dataset of scientific articles labeled by multiple annotators along all SDGs. AlmaSDG is validated using inter-annotator agreement and inference of pre-trained models and tools.
Bolognini, L., Palmieri, E., Donati, N., Grundler, G., Pappacoda, G., Ruggeri, F., et al. (2026). AlmaSDG: A Dataset of Scientific Articles' Contributions to the UN Sustainable Development Goals. SCIENTIFIC DATA, 0, 1-17 [10.1038/s41597-026-07452-4].
AlmaSDG: A Dataset of Scientific Articles' Contributions to the UN Sustainable Development Goals
Bolognini, Luca;Palmieri, Elena;Donati, Nicolò;Grundler, Giulia;Pappacoda, Gianmarco;Ruggeri, Federico;Galassi, Andrea
;Torroni, Paolo
2026
Abstract
The UN Agenda 2030 motivates the development of scalable methods and tools for assessing the contribution of research and higher education institutions to sustainable development. Current research efforts suffer from the lack of rigorous evaluation benchmarks. This paper describes a collaborative work with sustainability experts aimed at constructing AlmaSDG, the first pilot dataset of scientific articles labeled by multiple annotators along all SDGs. AlmaSDG is validated using inter-annotator agreement and inference of pre-trained models and tools.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



