Accurate forecasting of crop yield anomalies is essential for global food security and effective agricultural policy-making, as it enables the separation of climate-driven impacts from baseline regional productivity. This study proposes a transparent and lightweight ensemble forecasting pipeline, termed Lightweight Exponential-Weighted Regression (LEWR), for predicting such yield deviations. To account for time-invariant country-specific characteristics, a cross-sectional demeaning procedure is first applied to the panel dataset. The procedure then integrates heterogeneous regression models—including Random Forest, XGBoost, and Support Vector Regressors—through exponentially decaying weights derived from validation errors, emphasizing higher-performing predictors without employing a second-layer meta-learner. The performance of the proposed method is evaluated on a hold-out test set (2009–2013) drawn from a panel dataset covering multiple countries and years (1990–2013) and benchmarked not only against individual base models but also against established ensemble methods. Results show that the procedure achieves competitive accuracy in forecasting yield deviations compared to more complex methods, while offering notable advantages in simplicity and computational efficiency. Furthermore, it supports incremental updates and provides a straightforward mechanism for quantifying predictive uncertainty—a critical factor in modern environmental risk assessment.
Stracqualursi, L. (2026). Forecasting Crop Yield Anomalies on Panel Data via Spatially Demeaned Ensembles. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 5, 1-23 [10.1007/s13253-026-00743-8].
Forecasting Crop Yield Anomalies on Panel Data via Spatially Demeaned Ensembles
Stracqualursi, Luisa
Primo
2026
Abstract
Accurate forecasting of crop yield anomalies is essential for global food security and effective agricultural policy-making, as it enables the separation of climate-driven impacts from baseline regional productivity. This study proposes a transparent and lightweight ensemble forecasting pipeline, termed Lightweight Exponential-Weighted Regression (LEWR), for predicting such yield deviations. To account for time-invariant country-specific characteristics, a cross-sectional demeaning procedure is first applied to the panel dataset. The procedure then integrates heterogeneous regression models—including Random Forest, XGBoost, and Support Vector Regressors—through exponentially decaying weights derived from validation errors, emphasizing higher-performing predictors without employing a second-layer meta-learner. The performance of the proposed method is evaluated on a hold-out test set (2009–2013) drawn from a panel dataset covering multiple countries and years (1990–2013) and benchmarked not only against individual base models but also against established ensemble methods. Results show that the procedure achieves competitive accuracy in forecasting yield deviations compared to more complex methods, while offering notable advantages in simplicity and computational efficiency. Furthermore, it supports incremental updates and provides a straightforward mechanism for quantifying predictive uncertainty—a critical factor in modern environmental risk assessment.| File | Dimensione | Formato | |
|---|---|---|---|
|
JABE2026 ARTICLE.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale / Version Of Record
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
1.11 MB
Formato
Adobe PDF
|
1.11 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



