CRIS Current Research Information System

Reinforcement learning (RL) algorithms show promise in robotics and multi-agent systems but often suffer from low sample efficiency. While methods like SHAC leverage differentiable simulators to improve efficiency, they are limited to specific settings: they require fully differentiable environments, including transition and reward functions, and have primarily been demonstrated in single-agent scenarios. To overcome these limitations, we introduce SHAC++, a novel framework inspired by SHAC. SHAC++ removes the need for differentiable simulator components by using neural networks to approximate the required gradients, training these networks alongside the standard policy and value networks. This enables the core SHAC approach to be applied in both non-differentiable and multi-agent environments. We evaluate SHAC++ on challenging multi-agent tasks from the VMAS suite, comparing it against SHAC (where applicable) and PPO, a standard algorithm for non-differentiable settings. Our results demonstrate that SHAC++ significantly outperforms PPO in both single- and multi-agent scenarios. Furthermore, in differentiable environments where SHAC operates, SHAC++ achieves comparable performance despite lacking direct access to simulator gradients, thus successfully extending SHAC’s benefits to a broader class of problems. The full implementation is openly available at https://github.com/f14-bertolotti/shacpp.

Bertolotti, F., Aguzzi, G., Cazzola, W., Viroli, M. (2025). SHAC++: A Neural Network to Rule All Differentiable Simulators [10.3233/faia251138].

SHAC++: A Neural Network to Rule All Differentiable Simulators

Bertolotti, Francesco;Aguzzi, Gianluca;Cazzola, Walter;Viroli, Mirko

2025

Abstract

Reinforcement learning (RL) algorithms show promise in robotics and multi-agent systems but often suffer from low sample efficiency. While methods like SHAC leverage differentiable simulators to improve efficiency, they are limited to specific settings: they require fully differentiable environments, including transition and reward functions, and have primarily been demonstrated in single-agent scenarios. To overcome these limitations, we introduce SHAC++, a novel framework inspired by SHAC. SHAC++ removes the need for differentiable simulator components by using neural networks to approximate the required gradients, training these networks alongside the standard policy and value networks. This enables the core SHAC approach to be applied in both non-differentiable and multi-agent environments. We evaluate SHAC++ on challenging multi-agent tasks from the VMAS suite, comparing it against SHAC (where applicable) and PPO, a standard algorithm for non-differentiable settings. Our results demonstrate that SHAC++ significantly outperforms PPO in both single- and multi-agent scenarios. Furthermore, in differentiable environments where SHAC operates, SHAC++ achieves comparable performance despite lacking direct access to simulator gradients, thus successfully extending SHAC’s benefits to a broader class of problems. The full implementation is openly available at https://github.com/f14-bertolotti/shacpp.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				ECAI 2025. 28th European Conference on Artificial Intelligence: 25-30 October 2025, Bologna, Italy
			
	Pagina iniziale
	
				2818
			
	Pagina finale
	
				2825
			
	Collana/Serie
	
				FRONTIERS IN ARTIFICIAL INTELLIGENCE AND APPLICATIONS
			
	Codice DOI
	
				https://dx.doi.org/10.3233/faia251138
			
	Citazione
	
				Bertolotti, F., Aguzzi, G., Cazzola, W., Viroli, M. (2025). SHAC++: A Neural Network to Rule All Differentiable Simulators [10.3233/faia251138].
			
	Tutti gli autori
	
						Bertolotti, Francesco; Aguzzi, Gianluca; Cazzola, Walter; Viroli, Mirko
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
FAIA-413-FAIA251138.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC) Dimensione 826.44 kB Formato Adobe PDF Visualizza/Apri	826.44 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1026330

Citazioni

ND

ND

ND

social impact