Data Warehouses are the core of the modern systems for decision making. They store integrated information extracted from various and heterogeneous data sources, making it available in multidimensional form for analyses aimed at improving the users' knowledge of their business. Though the first use of the term dates back to the 80s, only during the late 90s data warehousing has emerged as a research area on its own, though in strict correlation with several other research topics as database integration, view materialization, data visualization, etc. This paper surveys more than 20 years of research on data warehouse systems, from their early relational implementations (still widely adopted in corporate environments), to the new architectures solicited by Business Intelligence 2.0 scenarios during the last decade, and up to the exciting challenges now posed by the integration with big data settings. The timeline of research is organized into three interrelated tracks: techniques, architectures, and methodologies.
Golfarelli, M., Rizzi, S. (2017). From Star Schemas to Big Data: 20+ Years of Data Warehouse Research. Heidelberg : Springer.
From Star Schemas to Big Data: 20+ Years of Data Warehouse Research
GOLFARELLI, MATTEO;RIZZI, STEFANO
2017
Abstract
Data Warehouses are the core of the modern systems for decision making. They store integrated information extracted from various and heterogeneous data sources, making it available in multidimensional form for analyses aimed at improving the users' knowledge of their business. Though the first use of the term dates back to the 80s, only during the late 90s data warehousing has emerged as a research area on its own, though in strict correlation with several other research topics as database integration, view materialization, data visualization, etc. This paper surveys more than 20 years of research on data warehouse systems, from their early relational implementations (still widely adopted in corporate environments), to the new architectures solicited by Business Intelligence 2.0 scenarios during the last decade, and up to the exciting challenges now posed by the integration with big data settings. The timeline of research is organized into three interrelated tracks: techniques, architectures, and methodologies.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.