CRIS Current Research Information System

In this paper, we investigate a data-driven framework to solve Linear Quadratic Regulator (LQR) problems when the dynamics is unknown, with the additional challenge of providing stability certificates for the overall learning and control scheme. Specifically, in the proposed on-policy learning framework, the control input is applied to the actual (unknown) linear system while iteratively optimized. We propose a learning and control procedure, termed Relearn LQR, that combines a recursive least squares method with a direct policy search based on the gradient method. The resulting scheme is analyzed by modeling it as a feedback-interconnected nonlinear dynamical system. A Lyapunov-based approach, exploiting averaging and timescale separation theories for nonlinear systems, allows us to provide formal stability guarantees for the whole interconnected scheme. The effectiveness of the proposed strategy is corroborated by numerical simulations, where Relearn LQR is deployed on an aircraft control problem, with both static and drifting parameters.

Sforni, L., Carnevale, G., Notarnicola, I., Notarstefano, G. (2026). Stability-certified on-policy data-driven LQR via recursive learning and policy gradient. AUTOMATICA, 191, 1-12 [10.1016/j.automatica.2026.113101].

Stability-certified on-policy data-driven LQR via recursive learning and policy gradient

Sforni, Lorenzo;Carnevale, Guido;Notarnicola, Ivano;Notarstefano, Giuseppe

2026

Abstract

In this paper, we investigate a data-driven framework to solve Linear Quadratic Regulator (LQR) problems when the dynamics is unknown, with the additional challenge of providing stability certificates for the overall learning and control scheme. Specifically, in the proposed on-policy learning framework, the control input is applied to the actual (unknown) linear system while iteratively optimized. We propose a learning and control procedure, termed Relearn LQR, that combines a recursive least squares method with a direct policy search based on the gradient method. The resulting scheme is analyzed by modeling it as a feedback-interconnected nonlinear dynamical system. A Lyapunov-based approach, exploiting averaging and timescale separation theories for nonlinear systems, allows us to provide formal stability guarantees for the whole interconnected scheme. The effectiveness of the proposed strategy is corroborated by numerical simulations, where Relearn LQR is deployed on an aircraft control problem, with both static and drifting parameters.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	Rivista
	
				AUTOMATICA
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.automatica.2026.113101
			
	Citazione
	
				Sforni, L., Carnevale, G., Notarnicola, I., Notarstefano, G. (2026). Stability-certified on-policy data-driven LQR via recursive learning and policy gradient. AUTOMATICA, 191, 1-12 [10.1016/j.automatica.2026.113101].
			
	Tutti gli autori
	
						Sforni, Lorenzo; Carnevale, Guido; Notarnicola, Ivano; Notarstefano, Giuseppe
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0005109826002852-main.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 1.88 MB Formato Adobe PDF Visualizza/Apri	1.88 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1067631

Citazioni

ND

ND

ND

1

social impact