Crop recommendation is a key challenge in precision agriculture, aiming to match soil and environmental parameters with the most suitable crop variety. In this work, I present a lightweight exponential-weighted ensemble framework that combines multiple base classifiers—K-Nearest Neighbor (KNN), Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM), Naive Bayes (NB), Logistic Regression (LR), Extreme Gradient Boosting (XGBoost)—through a simple yet effective reweighting scheme applied offline. This method assigns exponential penalties based on each model’s validation error and then aggregates their probability outputs in the test phase. Unlike more complex stacking or multilayer ensembles, the proposed approach does not require a separate meta-learner, enabling straightforward implementation and reduced computational overhead. The system is evaluated on a well-known dataset available on Kaggle, achieving a final accuracy of 99.8%, which meets or exceeds the performance of more elaborate ensembles. Although the current study focuses on an offline learning setting, the exponential weighting mechanism naturally extends to online or incremental learning scenarios, wherein model weights can be easily updated as new data arrive. This adaptability, coupled with excellent predictive accuracy, makes the approach highly suitable for real-world smart farming applications where conditions and data streams are subject to continuous change.
Stracqualursi, L. (2025). A Lightweight Exponential-Weighted Ensemble for Crop Recommendation. JOURNAL OF AGRICULTURAL, BIOLOGICAL, AND ENVIRONMENTAL STATISTICS, Online first, 1-17 [10.1007/s13253-025-00694-6].
A Lightweight Exponential-Weighted Ensemble for Crop Recommendation
Stracqualursi Luisa
Primo
2025
Abstract
Crop recommendation is a key challenge in precision agriculture, aiming to match soil and environmental parameters with the most suitable crop variety. In this work, I present a lightweight exponential-weighted ensemble framework that combines multiple base classifiers—K-Nearest Neighbor (KNN), Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM), Naive Bayes (NB), Logistic Regression (LR), Extreme Gradient Boosting (XGBoost)—through a simple yet effective reweighting scheme applied offline. This method assigns exponential penalties based on each model’s validation error and then aggregates their probability outputs in the test phase. Unlike more complex stacking or multilayer ensembles, the proposed approach does not require a separate meta-learner, enabling straightforward implementation and reduced computational overhead. The system is evaluated on a well-known dataset available on Kaggle, achieving a final accuracy of 99.8%, which meets or exceeds the performance of more elaborate ensembles. Although the current study focuses on an offline learning setting, the exponential weighting mechanism naturally extends to online or incremental learning scenarios, wherein model weights can be easily updated as new data arrive. This adaptability, coupled with excellent predictive accuracy, makes the approach highly suitable for real-world smart farming applications where conditions and data streams are subject to continuous change.| File | Dimensione | Formato | |
|---|---|---|---|
|
Stracqualursi-2025-Journal_of_Agricultural,_Biological_and_Environmental_Statistics.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale / Version Of Record
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
578.53 kB
Formato
Adobe PDF
|
578.53 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


