CRIS Current Research Information System

One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn a policy. Motivated by this, we present the design of the Control-Tutored Deep QNetworks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approximate model of the system, without any assumption about the knowledge of the system dynamics. There is no expectation that it will be able to achieve the control objective if used stand-alone. During learning, the tutor occasionally suggests an action, thus partially guiding exploration. We validate our approach on three scenarios from OpenAI Gym: the inverted pendulum, lunar lander, and car racing. We demonstrate that CT-DQN is able to achieve better or equivalent data efficiency with respect to the classic function approximation solutions.

Francesco De Lellis, M.C. (2023). CT-DQN: Control-Tutored Deep Reinforcement Learning.

CT-DQN: Control-Tutored Deep Reinforcement Learning

Francesco De Lellis;Marco Coraggio;Giovanni Russo;Mirco Musolesi;Mario di Bernardo

2023

Abstract

One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn a policy. Motivated by this, we present the design of the Control-Tutored Deep QNetworks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approximate model of the system, without any assumption about the knowledge of the system dynamics. There is no expectation that it will be able to achieve the control objective if used stand-alone. During learning, the tutor occasionally suggests an action, thus partially guiding exploration. We validate our approach on three scenarios from OpenAI Gym: the inverted pendulum, lunar lander, and car racing. We demonstrate that CT-DQN is able to achieve better or equivalent data efficiency with respect to the classic function approximation solutions.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo del volume
	
				Proceedings of The 5th Annual Learning for Dynamics and Control Conference
			
	Pagina iniziale
	
				941
			
	Pagina finale
	
				953
			
	Collana/Serie
	
				PROCEEDINGS OF MACHINE LEARNING RESEARCH
			
	Citazione
	
				Francesco De Lellis, M.C. (2023). CT-DQN: Control-Tutored Deep Reinforcement Learning.
			
	Tutti gli autori
	
						Francesco De Lellis, Marco Coraggio, Giovanni Russo, Mirco Musolesi, Mario di Bernardo
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
l4dc23 (1).pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Licenza per accesso libero gratuito Dimensione 393.26 kB Formato Adobe PDF Visualizza/Apri	393.26 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/960139

Citazioni

ND

ND

ND

social impact