Predicting fishing activity from vessel tracking data is crucial for quantifying fishing effort. This study addresses this challenge by classifying the fishing versus non-fishing activity status of small-scale vessels using passive gears, with a suite of different algorithms, ranging from basic statistics (Logistic Regression - LoRe) to Machine Learning (Decision Trees - Dtree, Random Forests - RaFo, and Extreme Gradient Boosting - XGBo). Results demonstrate that the Machine Learning (ML) ensemble significantly outperformed LoRe, especially with XGBo and Dtree achieving comparable high accuracy and robustness across training, validation, and test sets. By employing SHAP (SHapley Additive exPlanations), we demonstrate that the vessel speed (SPEED) and course variations (course_diff), the hour of the day (hours), and the distance from the coast (distance_from_coast) or the bathymetric depth (depth), are the primary mechanistic drivers for discerning fishing operations in passive-gear small-scale fisheries (SSF). We provide a fully reproducible workflow and a unique, high-resolution dataset of manually labelled tracking data to address the critical scarcity of validated resources in this field. This framework provides a timely, scalable solution for high-resolution tracking analysis, directly addressing the technical needs arising from upcoming EU mandates (Control Regulation 2023/2842) for small-scale vessel monitoring. The shared code and data enable researchers to evaluate model transferability and generalisation, providing a standardised approach to harmonise fishing effort estimation across diverse geographic contexts. Finally, the provided code is structured as an accessible framework for fisheries scientists with limited ML experience, offering a practical foundation for implementing automated activity classification.

Lattanzi, P., Vasapollo, C., Samarão, J., Galdelli, A., Mendo, T., Rufino, M., et al. (2026). Predicting fishing vs. not-fishing in small-scale fisheries: A sample vessel tracking dataset and a reproducible machine learning approach. SOFTWAREX, 34, 1-8 [10.1016/j.softx.2026.102579].

Predicting fishing vs. not-fishing in small-scale fisheries: A sample vessel tracking dataset and a reproducible machine learning approach

Pamela Lattanzi
Primo
;
Luca Bolognini;
2026

Abstract

Predicting fishing activity from vessel tracking data is crucial for quantifying fishing effort. This study addresses this challenge by classifying the fishing versus non-fishing activity status of small-scale vessels using passive gears, with a suite of different algorithms, ranging from basic statistics (Logistic Regression - LoRe) to Machine Learning (Decision Trees - Dtree, Random Forests - RaFo, and Extreme Gradient Boosting - XGBo). Results demonstrate that the Machine Learning (ML) ensemble significantly outperformed LoRe, especially with XGBo and Dtree achieving comparable high accuracy and robustness across training, validation, and test sets. By employing SHAP (SHapley Additive exPlanations), we demonstrate that the vessel speed (SPEED) and course variations (course_diff), the hour of the day (hours), and the distance from the coast (distance_from_coast) or the bathymetric depth (depth), are the primary mechanistic drivers for discerning fishing operations in passive-gear small-scale fisheries (SSF). We provide a fully reproducible workflow and a unique, high-resolution dataset of manually labelled tracking data to address the critical scarcity of validated resources in this field. This framework provides a timely, scalable solution for high-resolution tracking analysis, directly addressing the technical needs arising from upcoming EU mandates (Control Regulation 2023/2842) for small-scale vessel monitoring. The shared code and data enable researchers to evaluate model transferability and generalisation, providing a standardised approach to harmonise fishing effort estimation across diverse geographic contexts. Finally, the provided code is structured as an accessible framework for fisheries scientists with limited ML experience, offering a practical foundation for implementing automated activity classification.
2026
Lattanzi, P., Vasapollo, C., Samarão, J., Galdelli, A., Mendo, T., Rufino, M., et al. (2026). Predicting fishing vs. not-fishing in small-scale fisheries: A sample vessel tracking dataset and a reproducible machine learning approach. SOFTWAREX, 34, 1-8 [10.1016/j.softx.2026.102579].
Lattanzi, Pamela; Vasapollo, Claudio; Samarão, Joao; Galdelli, Alessandro; Mendo, Tania; Rufino, Marta; Bolognini, Luca; Nora Tassetti, Anna...espandi
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S2352711026000737-main.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 2.48 MB
Formato Adobe PDF
2.48 MB Adobe PDF Visualizza/Apri
ScienceDirect_files_14May2026_11-44-36.521.zip

accesso aperto

Tipo: File Supplementare
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 2.8 MB
Formato Zip File
2.8 MB Zip File Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1052672
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact