Abstract
Predicting fishing activity from vessel tracking data is crucial for quantifying fishing effort. This study addresses this challenge by classifying the fishing versus non-fishing activity status of small-scale vessels using passive gears, with a suite of different algorithms, ranging from basic statistics (Logistic Regression - LoRe) to Machine Learning (Decision Trees - Dtree, Random Forests - RaFo, and Extreme Gradient Boosting - XGBo). Results demonstrate that the Machine Learning (ML) ensemble significantly outperformed LoRe, especially with XGBo and Dtree achieving comparable high accuracy and robustness across training, validation, and test sets. By employing SHAP (SHapley Additive exPlanations), we demonstrate that the vessel speed (SPEED) and course variations (course_diff), the hour of the day (hours), and the distance from the coast (distance_from_coast) or the bathymetric depth (depth), are the primary mechanistic drivers for discerning fishing operations in passive-gear small-scale fisheries (SSF). We provide a fully reproducible workflow and a unique, high-resolution dataset of manually labelled tracking data to address the critical scarcity of validated resources in this field. This framework provides a timely, scalable solution for high-resolution tracking analysis, directly addressing the technical needs arising from upcoming EU mandates (Control Regulation 2023/2842) for small-scale vessel monitoring. The shared code and data enable researchers to evaluate model transferability and generalisation, providing a standardised approach to harmonise fishing effort estimation across diverse geographic contexts. Finally, the provided code is structured as an accessible framework for fisheries scientists with limited ML experience, offering a practical foundation for implementing automated activity classification.
| Original language | English |
|---|---|
| Article number | 102579 |
| Number of pages | 8 |
| Journal | SoftwareX |
| Volume | 34 |
| Early online date | 28 Feb 2026 |
| DOIs | |
| Publication status | E-pub ahead of print - 28 Feb 2026 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 14 Life Below Water
Keywords
- Activity status
- High-resolution tracking data
- Machine learning classification
- Passive gears
- Prediction
- Small-scale fisheries
Fingerprint
Dive into the research topics of 'Predicting fishing vs. not-fishing in small-scale fisheries: a sample vessel tracking dataset and a reproducible machine learning approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver