We tackle the problem of goal-directed graph construction: given a starting graph, finding a set of edges whose addition maximally improves a global objective function. This problem emerges in many transportation and infrastructure networks that are of critical importance to society. We identify two significant shortcomings of present reinforcement learning methods: their exclusive focus on topology to the detriment of spatial characteristics (which are known to influence the growth and density of links), as well as the rapid growth in the action spaces and costs of model training. Our formulation as a deterministic Markov decision process allows us to adopt the Monte Carlo tree search framework, an artificial intelligence decision-time planning method. We propose improvements over the standard upper confidence bounds for trees (UCT) algorithm for this family of problems that addresses their single-agent nature, the trade-off between the cost of edges and their contribution to the objective, and an action space linear in the number of nodes. Our approach yields substantial improvements over UCT for increasing the efficiency and attack resilience of synthetic networks and real-world Internet backbone and metro systems, while using a wall clock time budget similar to other search-based algorithms. We also demonstrate that our approach scales to significantly larger networks than previous reinforcement learning methods, since it does not require training a model.

Planning spatial networks with Monte Carlo tree search / Darvariu, Victor-Alexandru; Hailes, Stephen; Musolesi, Mirco. - In: PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON. SERIES A. - ISSN 1364-5021. - ELETTRONICO. - 479:2269(2023), pp. 1-21. [10.1098/rspa.2022.0383]

Planning spatial networks with Monte Carlo tree search

Musolesi, Mirco
2023

Abstract

We tackle the problem of goal-directed graph construction: given a starting graph, finding a set of edges whose addition maximally improves a global objective function. This problem emerges in many transportation and infrastructure networks that are of critical importance to society. We identify two significant shortcomings of present reinforcement learning methods: their exclusive focus on topology to the detriment of spatial characteristics (which are known to influence the growth and density of links), as well as the rapid growth in the action spaces and costs of model training. Our formulation as a deterministic Markov decision process allows us to adopt the Monte Carlo tree search framework, an artificial intelligence decision-time planning method. We propose improvements over the standard upper confidence bounds for trees (UCT) algorithm for this family of problems that addresses their single-agent nature, the trade-off between the cost of edges and their contribution to the objective, and an action space linear in the number of nodes. Our approach yields substantial improvements over UCT for increasing the efficiency and attack resilience of synthetic networks and real-world Internet backbone and metro systems, while using a wall clock time budget similar to other search-based algorithms. We also demonstrate that our approach scales to significantly larger networks than previous reinforcement learning methods, since it does not require training a model.
2023
Planning spatial networks with Monte Carlo tree search / Darvariu, Victor-Alexandru; Hailes, Stephen; Musolesi, Mirco. - In: PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON. SERIES A. - ISSN 1364-5021. - ELETTRONICO. - 479:2269(2023), pp. 1-21. [10.1098/rspa.2022.0383]
Darvariu, Victor-Alexandru; Hailes, Stephen; Musolesi, Mirco
File in questo prodotto:
File Dimensione Formato  
darvariu-et-al-2023-planning-spatial-networks-with-monte-carlo-tree-search.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 947.02 kB
Formato Adobe PDF
947.02 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/960137
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact