Multistores are data management systems that facilitate query processing across databases based on different data models; in addition to distributing data, integration and data fusion activities are necessary to address complexities such as schema heterogeneity and data replication. Our multistore solution relies on a dataspace to provide the user with an integrated view of the available data and enables the formulation and execution of GPSJ queries. In this paper, we outline a technique to optimize the execution of GPSJ queries by formulating and evaluating different execution plans on the multistore. In particular, we identify different strategies to carry out joins and data fusion by relying on different schema representations; then, a self-learning black-box cost model is used to estimate execution times and select the most efficient plan. The experiments assess the effectiveness of the cost model in choosing the best execution plan.

On the Optimization of Query Plans in Multistores (Discussion Paper) / Forresi, C.; Francia, M.; Gallinucci, E.; Golfarelli, M.. - ELETTRONICO. - 3478:(2023), pp. 391-400. (Intervento presentato al convegno 31st Symposium of Advanced Database Systems 2023 (SEBD 2023) tenutosi a Galzingano Terme, Italy nel July 2nd to 5th, 2023).

On the Optimization of Query Plans in Multistores (Discussion Paper)

Forresi, C.
;
Francia, M.;Gallinucci, E.;Golfarelli, M.
2023

Abstract

Multistores are data management systems that facilitate query processing across databases based on different data models; in addition to distributing data, integration and data fusion activities are necessary to address complexities such as schema heterogeneity and data replication. Our multistore solution relies on a dataspace to provide the user with an integrated view of the available data and enables the formulation and execution of GPSJ queries. In this paper, we outline a technique to optimize the execution of GPSJ queries by formulating and evaluating different execution plans on the multistore. In particular, we identify different strategies to carry out joins and data fusion by relying on different schema representations; then, a self-learning black-box cost model is used to estimate execution times and select the most efficient plan. The experiments assess the effectiveness of the cost model in choosing the best execution plan.
2023
Proceedings of the 31st Symposium of Advanced Database Systems
391
400
On the Optimization of Query Plans in Multistores (Discussion Paper) / Forresi, C.; Francia, M.; Gallinucci, E.; Golfarelli, M.. - ELETTRONICO. - 3478:(2023), pp. 391-400. (Intervento presentato al convegno 31st Symposium of Advanced Database Systems 2023 (SEBD 2023) tenutosi a Galzingano Terme, Italy nel July 2nd to 5th, 2023).
Forresi, C.; Francia, M.; Gallinucci, E.; Golfarelli, M.
File in questo prodotto:
File Dimensione Formato  
paper05.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 1.45 MB
Formato Adobe PDF
1.45 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/946057
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact