CRIS Current Research Information System

Attitude control of a novel regional truss-braced wing (TBW) aircraft with low stability characteristics is addressed in this paper using Reinforcement Learning (RL). In recent years, RL has been increasingly employed in challenging applications, particularly, autonomous flight control. However, a significant predicament confronting discrete RL algorithms is the dimension limitation of the state-action table and difficulties in defining the elements of the RL environment. To address these issues, in this paper, a detailed mathematical model of the mentioned aircraft is first developed to shape an RL environment. Subsequently, Q-learning, the most prevalent discrete RL algorithm, will be implemented in both the Markov Decision Process (MDP) and Partially Observable Markov Decision Process (POMDP) frameworks to control the longitudinal mode of the proposed aircraft. In order to eliminate residual fluctuations that are a consequence of discrete action selection, and simultaneously track variable pitch angles, a Fuzzy Action Assignment (FAA) method is proposed to generate continuous control commands using the trained optimal Q-table. Accordingly, it will be proved that by defining a comprehensive reward function based on dynamic behavior considerations, along with observing all crucial states (equivalent to satisfying the Markov Property), the air vehicle would be capable of tracking the desired attitude in the presence of different uncertain dynamics including measurement noises, atmospheric disturbances, actuator faults, and model uncertainties where the performance of the introduced control system surpasses a well-tuned Proportional–Integral–Derivative (PID) controller.

Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning / Zahmatkesh M.; Emami S.A.; Banazadeh A.; Castaldi P.. - In: ACTUATORS. - ISSN 2076-0825. - ELETTRONICO. - 11:12(2022), pp. 374.1-374.17. [10.3390/act11120374]

Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning

Zahmatkesh M.^Software;Emami S. A.^Software;Castaldi P.^{Ultimo

Conceptualization}

2022

Abstract

Attitude control of a novel regional truss-braced wing (TBW) aircraft with low stability characteristics is addressed in this paper using Reinforcement Learning (RL). In recent years, RL has been increasingly employed in challenging applications, particularly, autonomous flight control. However, a significant predicament confronting discrete RL algorithms is the dimension limitation of the state-action table and difficulties in defining the elements of the RL environment. To address these issues, in this paper, a detailed mathematical model of the mentioned aircraft is first developed to shape an RL environment. Subsequently, Q-learning, the most prevalent discrete RL algorithm, will be implemented in both the Markov Decision Process (MDP) and Partially Observable Markov Decision Process (POMDP) frameworks to control the longitudinal mode of the proposed aircraft. In order to eliminate residual fluctuations that are a consequence of discrete action selection, and simultaneously track variable pitch angles, a Fuzzy Action Assignment (FAA) method is proposed to generate continuous control commands using the trained optimal Q-table. Accordingly, it will be proved that by defining a comprehensive reward function based on dynamic behavior considerations, along with observing all crucial states (equivalent to satisfying the Markov Property), the air vehicle would be capable of tracking the desired attitude in the presence of different uncertain dynamics including measurement noises, atmospheric disturbances, actuator faults, and model uncertainties where the performance of the introduced control system surpasses a well-tuned Proportional–Integral–Derivative (PID) controller.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
			2022
		
	Rivista
	
			ACTUATORS
		
	Codice DOI
	
			https://dx.doi.org/10.3390/act11120374
		
	Citazione
	
			Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning / Zahmatkesh M.; Emami S.A.; Banazadeh A.; Castaldi P.. - In: ACTUATORS. - ISSN 2076-0825. - ELETTRONICO. - 11:12(2022), pp. 374.1-374.17. [10.3390/act11120374]
		
	Tutti gli autori
	
			Zahmatkesh M.; Emami S.A.; Banazadeh A.; Castaldi P.
		
	Appare nelle tipologie:
	
			1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
CASTALDI_actuators-11-00374-with-cover.pdf accesso aperto Descrizione: Actuators Q Reinforcement Learning Tipo: Versione (PDF) editoriale Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 1.46 MB Formato Adobe PDF Visualizza/Apri	1.46 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/916158

Citazioni

ND

2

2

social impact