CRIS Current Research Information System

Very high throughput satellite (VHTS) systems are expected to have a huge increase in traffic demand in the near future. Nevertheless, this increase will not be uniform over the entire service area due to the non-uniform distribution of users and changes in traffic demand during the day. This problem is addressed by using flexible payload architectures, which allow the allocation of payload resources flexibly to meet the traffic demand of each beam, leading to dynamic resource management (DRM) approaches. However, DRM adds significant complexity to VHTS systems, so in this paper we discuss the use of one reinforcement learning (RL) algorithm and two deep reinforcement learning (DRL) algorithms to manage the resources available in flexible payload architectures for DRM. These algorithms are Q-Learning (QL), Deep Q-Learning (DQL) and Double Deep Q-Learning (DDQL) which are compared based on their performance, complexity and added latency. On the other hand, this work demonstrates the superiority a cooperative multiagent (CMA) decentralized distribution has over a single agent (SA).

Ortiz-Gomez, F.G., Tarchi, D., Martinez, R., Vanelli-Coralli, A., Salas-Natera, M.A., Landeros-Ayala, S. (2022). Cooperative Multi-Agent Deep Reinforcement Learning for Resource Management in Full Flexible VHTS Systems. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 8(1), 335-349 [10.1109/TCCN.2021.3087586].

Cooperative Multi-Agent Deep Reinforcement Learning for Resource Management in Full Flexible VHTS Systems

Ortiz-Gomez, Flor G.;Tarchi, Daniele;Martinez, Ramon;Vanelli-Coralli, Alessandro;Salas-Natera, Miguel A.;Landeros-Ayala, Salvador

2022

Abstract

Very high throughput satellite (VHTS) systems are expected to have a huge increase in traffic demand in the near future. Nevertheless, this increase will not be uniform over the entire service area due to the non-uniform distribution of users and changes in traffic demand during the day. This problem is addressed by using flexible payload architectures, which allow the allocation of payload resources flexibly to meet the traffic demand of each beam, leading to dynamic resource management (DRM) approaches. However, DRM adds significant complexity to VHTS systems, so in this paper we discuss the use of one reinforcement learning (RL) algorithm and two deep reinforcement learning (DRL) algorithms to manage the resources available in flexible payload architectures for DRM. These algorithms are Q-Learning (QL), Deep Q-Learning (DQL) and Double Deep Q-Learning (DDQL) which are compared based on their performance, complexity and added latency. On the other hand, this work demonstrates the superiority a cooperative multiagent (CMA) decentralized distribution has over a single agent (SA).

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Rivista
	
				IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING
			
	Codice DOI
	
				https://dx.doi.org/10.1109/TCCN.2021.3087586
			
	Citazione
	
				Ortiz-Gomez, F.G., Tarchi, D., Martinez, R., Vanelli-Coralli, A., Salas-Natera, M.A., Landeros-Ayala, S. (2022). Cooperative Multi-Agent Deep Reinforcement Learning for Resource Management in Full Flexible VHTS Systems. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 8(1), 335-349 [10.1109/TCCN.2021.3087586].
			
	Tutti gli autori
	
						Ortiz-Gomez, Flor G.; Tarchi, Daniele; Martinez, Ramon; Vanelli-Coralli, Alessandro; Salas-Natera, Miguel A.; Landeros-Ayala, Salvador
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Cooperative_Multi-Agent_Deep_Reinforcement_Learning_for_Resource_Management_in_Full_Flexible_VHTS_Systems.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 2.84 MB Formato Adobe PDF Visualizza/Apri	2.84 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/821718

Citazioni

ND

37

28

social impact