Skip to main navigation Skip to search Skip to main content

Predicting fishing vs. not-fishing in small-scale fisheries: a sample vessel tracking dataset and a reproducible machine learning approach

Pamela Lattanzi*, Claudio Vasapollo, Joao Samarão, Alessandro Galdelli, Tania Mendo, Marta Rufino, Luca Bolognini, Anna Nora Tassetti

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Predicting fishing activity from vessel tracking data is crucial for quantifying fishing effort. This study addresses this challenge by classifying the fishing versus non-fishing activity status of small-scale vessels using passive gears, with a suite of different algorithms, ranging from basic statistics (Logistic Regression - LoRe) to Machine Learning (Decision Trees - Dtree, Random Forests - RaFo, and Extreme Gradient Boosting - XGBo). Results demonstrate that the Machine Learning (ML) ensemble significantly outperformed LoRe, especially with XGBo and Dtree achieving comparable high accuracy and robustness across training, validation, and test sets. By employing SHAP (SHapley Additive exPlanations), we demonstrate that the vessel speed (SPEED) and course variations (course_diff), the hour of the day (hours), and the distance from the coast (distance_from_coast) or the bathymetric depth (depth), are the primary mechanistic drivers for discerning fishing operations in passive-gear small-scale fisheries (SSF). We provide a fully reproducible workflow and a unique, high-resolution dataset of manually labelled tracking data to address the critical scarcity of validated resources in this field. This framework provides a timely, scalable solution for high-resolution tracking analysis, directly addressing the technical needs arising from upcoming EU mandates (Control Regulation 2023/2842) for small-scale vessel monitoring. The shared code and data enable researchers to evaluate model transferability and generalisation, providing a standardised approach to harmonise fishing effort estimation across diverse geographic contexts. Finally, the provided code is structured as an accessible framework for fisheries scientists with limited ML experience, offering a practical foundation for implementing automated activity classification.
Original languageEnglish
Article number102579
Number of pages8
JournalSoftwareX
Volume34
Early online date28 Feb 2026
DOIs
Publication statusE-pub ahead of print - 28 Feb 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 14 - Life Below Water
    SDG 14 Life Below Water

Keywords

  • Activity status
  • High-resolution tracking data
  • Machine learning classification
  • Passive gears
  • Prediction
  • Small-scale fisheries

Fingerprint

Dive into the research topics of 'Predicting fishing vs. not-fishing in small-scale fisheries: a sample vessel tracking dataset and a reproducible machine learning approach'. Together they form a unique fingerprint.

Cite this