Categorical data as a stone guest in a data science project for
  predicting defective water meters

Delnevo, Giovanni; Roccetti, Marco; Casini, Luca

After a one-year long effort of research on the field, we developed a machine learning-based classifier, tailored to predict whether a mechanical water meter would fail with passage of time and intensive use as well. A recurrent deep neural network (RNN) was trained with data extrapolated from 15 million readings of water consumption, gathered from 1 million meters. The data we used for training were essentially of two types: continuous vs categorical. Categorical being a type of data that can take on one of a limited and fixed number of possible values, on the basis of some qualitative property; while continuous, in this case, are the values of the measurements. taken at the meters, of the quantity of consumed water (cubic meters). In this paper, we want to discuss the fact that while the prediction accuracy of our RNN has exceeded the 80% on average, based on the use of continuous data, those performances did not improve, significantly, with the introduction of categorical information during the training phase. From a specific viewpoint, this remains an unsolved and critical problem of our research. Yet, if we reason about this controversial case from a data science perspective, we realize that we have had a confirmation that accurate machine learning solutions cannot be built without the participation of domain experts, who can differentiate on the importance of (the relation between) different types of data, each with its own sense, validity, and implications. Past all the original hype, the science of data is thus evolving towards a multifaceted discipline, where the designitations of data scientist/machine learning expert and domain expert are symbiotic

Giovanni Delnevo, Marco Roccetti, Luca Casini (2020). Categorical data as a stone guest in a data science project for predicting defective water meters. Ghent : Eurosis.

Categorical data as a stone guest in a data science project for predicting defective water meters

Giovanni Delnevo;Marco Roccetti;Luca Casini

2020

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo del volume
	
				Proceedings SCIFI-IT'2020 - 4th Annual Science Fiction Prototyping Conference
			
	Pagina iniziale
	
				24
			
	Pagina finale
	
				26
			
	Citazione
	
				Giovanni Delnevo,  Marco Roccetti,  Luca Casini (2020). Categorical data as a stone guest in a data science project for
  predicting defective water meters. Ghent : Eurosis.
			
	Tutti gli autori
	
						Giovanni Delnevo; Marco Roccetti; Luca Casini
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/815074

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

CRIS Current Research Information System

Categorical data as a stone guest in a data science project for predicting defective water meters

Giovanni Delnevo;Marco Roccetti;Luca Casini

2020

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Attenzione

Citazioni

social impact

CRIS Current Research Information System

Categorical data as a stone guest in a data science project for predicting defective water meters

Giovanni Delnevo;Marco Roccetti;Luca Casini

2020

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)