The data vault model natively supports data and schema evolution, so it is often adopted to create operational data stores. However, it can hardly be directly used for OLAP querying. In this paper we propose an approach called Starry Vault for finding a multidimensional structure in data vaults. Starry Vault builds on the specific features of the data vault model to automate multidimensional modeling, and uses approximate functional dependencies to discover out of data the information necessary to infer the structure of multidimensional hierarchies. The manual intervention by the user is limited to some editing of the resulting multidimensional schemata, which makes the overall process simple and quick enough to be compatible with the situational analysis needs of a data scientist.

Starry Vault: Automating Multidimensional Modeling from Data Vaults / Golfarelli, Matteo; Graziani, Simone; Rizzi, Stefano. - STAMPA. - 9809:(2016), pp. 137-151. (Intervento presentato al convegno 20th East-European Conference on Advances in Databases and Information Systems (ADBIS 2016) tenutosi a Prague, Czech Republic nel August 28 – 31, 2016) [10.1007/978-3-319-44039-2_10].

Starry Vault: Automating Multidimensional Modeling from Data Vaults

GOLFARELLI, MATTEO;GRAZIANI, SIMONE;RIZZI, STEFANO
2016

Abstract

The data vault model natively supports data and schema evolution, so it is often adopted to create operational data stores. However, it can hardly be directly used for OLAP querying. In this paper we propose an approach called Starry Vault for finding a multidimensional structure in data vaults. Starry Vault builds on the specific features of the data vault model to automate multidimensional modeling, and uses approximate functional dependencies to discover out of data the information necessary to infer the structure of multidimensional hierarchies. The manual intervention by the user is limited to some editing of the resulting multidimensional schemata, which makes the overall process simple and quick enough to be compatible with the situational analysis needs of a data scientist.
2016
Proceedings 20th East-European Conference on Advances in Databases and Information Systems
137
151
Starry Vault: Automating Multidimensional Modeling from Data Vaults / Golfarelli, Matteo; Graziani, Simone; Rizzi, Stefano. - STAMPA. - 9809:(2016), pp. 137-151. (Intervento presentato al convegno 20th East-European Conference on Advances in Databases and Information Systems (ADBIS 2016) tenutosi a Prague, Czech Republic nel August 28 – 31, 2016) [10.1007/978-3-319-44039-2_10].
Golfarelli, Matteo; Graziani, Simone; Rizzi, Stefano
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/561471
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? ND
social impact