CRIS Current Research Information System

We consider Inverse Reinforcement Learning (IRL) about multiple intentions, i.e., the problem of estimating the unknown reward functions optimized by a group of experts that demonstrate optimal behaviors. Most of the existing algorithms either require access to a model of the environment or need to repeatedly compute the optimal policies for the hypothesized rewards. However, these requirements are rarely met in real-world applications, in which interacting with the environment can be expensive or even dangerous. In this paper, we address the IRL about multiple intentions in a fully model-free and batch setting. We first cast the single IRL problem as a constrained likelihood maximization and then we use this formulation to cluster agents based on the likelihood of the assignment. In this way, we can efficiently solve, without interactions with the environment, both the IRL and the clustering problem. Finally, we evaluate the proposed methodology on simulated domains and on a real-world social-network application.

Ramponi, G., Likmeta, A., Metelli, A.M., Tirinzoni, A., Restelli, M. (2020). Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions. 75 ARLINGTON ST, STE 300, BOSTON, MA 02116-3936 USA : ADDISON-WESLEY PUBL CO.

Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions

Ramponi, G;Likmeta, A;Metelli, AM;Tirinzoni, A;Restelli, M

2020

Abstract

We consider Inverse Reinforcement Learning (IRL) about multiple intentions, i.e., the problem of estimating the unknown reward functions optimized by a group of experts that demonstrate optimal behaviors. Most of the existing algorithms either require access to a model of the environment or need to repeatedly compute the optimal policies for the hypothesized rewards. However, these requirements are rarely met in real-world applications, in which interacting with the environment can be expensive or even dangerous. In this paper, we address the IRL about multiple intentions in a fully model-free and batch setting. We first cast the single IRL problem as a constrained likelihood maximization and then we use this formulation to cluster agents based on the likelihood of the assignment. In this way, we can efficiently solve, without interactions with the environment, both the IRL and the clustering problem. Finally, we evaluate the proposed methodology on simulated domains and on a real-world social-network application.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo del volume
	
				Proceedings of Machine Learning Research
			
	Pagina iniziale
	
				2359
			
	Pagina finale
	
				2368
			
	Collana/Serie
	
				PROCEEDINGS OF MACHINE LEARNING RESEARCH
			
	Citazione
	
				Ramponi, G., Likmeta, A., Metelli, A.M., Tirinzoni, A., Restelli, M. (2020). Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions. 75 ARLINGTON ST, STE 300, BOSTON, MA 02116-3936 USA : ADDISON-WESLEY PUBL CO.
			
	Tutti gli autori
	
						Ramponi, G; Likmeta, A; Metelli, AM; Tirinzoni, A; Restelli, M
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/801940

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

16

social impact