One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn a policy. Motivated by this, we present the design of the Control-Tutored Deep QNetworks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approximate model of the system, without any assumption about the knowledge of the system dynamics. There is no expectation that it will be able to achieve the control objective if used stand-alone. During learning, the tutor occasionally suggests an action, thus partially guiding exploration. We validate our approach on three scenarios from OpenAI Gym: the inverted pendulum, lunar lander, and car racing. We demonstrate that CT-DQN is able to achieve better or equivalent data efficiency with respect to the classic function approximation solutions.

Francesco De Lellis, M.C. (2023). CT-DQN: Control-Tutored Deep Reinforcement Learning.

CT-DQN: Control-Tutored Deep Reinforcement Learning

Mirco Musolesi;
2023

Abstract

One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn a policy. Motivated by this, we present the design of the Control-Tutored Deep QNetworks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approximate model of the system, without any assumption about the knowledge of the system dynamics. There is no expectation that it will be able to achieve the control objective if used stand-alone. During learning, the tutor occasionally suggests an action, thus partially guiding exploration. We validate our approach on three scenarios from OpenAI Gym: the inverted pendulum, lunar lander, and car racing. We demonstrate that CT-DQN is able to achieve better or equivalent data efficiency with respect to the classic function approximation solutions.
2023
Proceedings of The 5th Annual Learning for Dynamics and Control Conference
941
953
Francesco De Lellis, M.C. (2023). CT-DQN: Control-Tutored Deep Reinforcement Learning.
Francesco De Lellis, Marco Coraggio, Giovanni Russo, Mirco Musolesi, Mario di Bernardo
File in questo prodotto:
File Dimensione Formato  
l4dc23 (1).pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per accesso libero gratuito
Dimensione 393.26 kB
Formato Adobe PDF
393.26 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/960139
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact