The data vault model natively supports data and schema evolution, so it is often adopted to create operational data stores. However, it can hardly be directly used for OLAP querying. In this paper we propose an approach called Starry Vault for finding a multidimensional structure in data vaults. Starry Vault builds on the specific features of the data vault model to automate multidimensional modeling, and uses approximate functional dependencies to discover out of data the information necessary to infer the structure of multidimensional hierarchies. The manual intervention by the user is limited to some editing of the resulting multidimensional schemata, which makes the overall process simple and quick enough to be compatible with the situational analysis needs of a data scientist.
Golfarelli, M., Graziani, S., Rizzi, S. (2016). Starry Vault: Automating Multidimensional Modeling from Data Vaults. Springer [10.1007/978-3-319-44039-2_10].
Starry Vault: Automating Multidimensional Modeling from Data Vaults
GOLFARELLI, MATTEO;GRAZIANI, SIMONE;RIZZI, STEFANO
2016
Abstract
The data vault model natively supports data and schema evolution, so it is often adopted to create operational data stores. However, it can hardly be directly used for OLAP querying. In this paper we propose an approach called Starry Vault for finding a multidimensional structure in data vaults. Starry Vault builds on the specific features of the data vault model to automate multidimensional modeling, and uses approximate functional dependencies to discover out of data the information necessary to infer the structure of multidimensional hierarchies. The manual intervention by the user is limited to some editing of the resulting multidimensional schemata, which makes the overall process simple and quick enough to be compatible with the situational analysis needs of a data scientist.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.