We conduct a large-scale benchmark experiment aiming to advance the use of machinelearning quantile regression algorithms for probabilistic hydrological post-processing "at scale" within operational contexts. The experiment is set up using 34-year-long daily time series of precipitation, temperature, evapotranspiration and streamflow for 511 catchments over the contiguous United States. Point hydrological predictions are obtained using the Génie Rural à 4 paramètres Journalier (GR4J) hydrological model and exploited as predictor variables within quantile regression settings. Six machine-learning quantile regression algorithms and their equal-weight combiner are applied to predict conditional quantiles of the hydrological model errors. The individual algorithms are quantile regression, generalized random forests for quantile regression, generalized random forests for quantile regression emulating quantile regression forests, gradient boosting machine, model-based boosting with linear models as base learners and quantile regression neural networks. The conditional quantiles of the hydrological model errors are transformed to conditional quantiles of daily streamflow, which are finally assessed using proper performance scores and benchmarking. The assessment concerns various levels of predictive quantiles and central prediction intervals, while it is made both independently of the flow magnitude andconditional upon thismagnitude. Key aspects of thedevelopedmethodological framework are highlighted, and practical recommendations are formulated. In technical hydro-meteorological applications, the algorithms should be applied preferably in a way that maximizes the benefits and reduces the risks fromtheir use. This can be achieved by (i) combining algorithms (e.g., by averaging their predictions) and (ii) integrating algorithms within systematic frameworks (i.e., by using the algorithms according to their identified skills), as our large-scale results point out.

Probabilistic hydrological post-processing at scale: Why and how to apply machine-learning quantile regression algorithms / Papacharalampous G.; Tyralis H.; Langousis A.; Jayawardena A.W.; Sivakumar B.; Mamassis N.; Montanari A.; Koutsoyiannis D.. - In: WATER. - ISSN 2073-4441. - ELETTRONICO. - 11:10(2019), pp. 2126.1-2126.43. [10.3390/w11102126]

Probabilistic hydrological post-processing at scale: Why and how to apply machine-learning quantile regression algorithms

Montanari A.;
2019

Abstract

We conduct a large-scale benchmark experiment aiming to advance the use of machinelearning quantile regression algorithms for probabilistic hydrological post-processing "at scale" within operational contexts. The experiment is set up using 34-year-long daily time series of precipitation, temperature, evapotranspiration and streamflow for 511 catchments over the contiguous United States. Point hydrological predictions are obtained using the Génie Rural à 4 paramètres Journalier (GR4J) hydrological model and exploited as predictor variables within quantile regression settings. Six machine-learning quantile regression algorithms and their equal-weight combiner are applied to predict conditional quantiles of the hydrological model errors. The individual algorithms are quantile regression, generalized random forests for quantile regression, generalized random forests for quantile regression emulating quantile regression forests, gradient boosting machine, model-based boosting with linear models as base learners and quantile regression neural networks. The conditional quantiles of the hydrological model errors are transformed to conditional quantiles of daily streamflow, which are finally assessed using proper performance scores and benchmarking. The assessment concerns various levels of predictive quantiles and central prediction intervals, while it is made both independently of the flow magnitude andconditional upon thismagnitude. Key aspects of thedevelopedmethodological framework are highlighted, and practical recommendations are formulated. In technical hydro-meteorological applications, the algorithms should be applied preferably in a way that maximizes the benefits and reduces the risks fromtheir use. This can be achieved by (i) combining algorithms (e.g., by averaging their predictions) and (ii) integrating algorithms within systematic frameworks (i.e., by using the algorithms according to their identified skills), as our large-scale results point out.
2019
Probabilistic hydrological post-processing at scale: Why and how to apply machine-learning quantile regression algorithms / Papacharalampous G.; Tyralis H.; Langousis A.; Jayawardena A.W.; Sivakumar B.; Mamassis N.; Montanari A.; Koutsoyiannis D.. - In: WATER. - ISSN 2073-4441. - ELETTRONICO. - 11:10(2019), pp. 2126.1-2126.43. [10.3390/w11102126]
Papacharalampous G.; Tyralis H.; Langousis A.; Jayawardena A.W.; Sivakumar B.; Mamassis N.; Montanari A.; Koutsoyiannis D.
File in questo prodotto:
File Dimensione Formato  
water-11-02126.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 6.46 MB
Formato Adobe PDF
6.46 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/718403
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 48
  • ???jsp.display-item.citation.isi??? 40
social impact