In this paper, we present a study on plankton classification for automated underwater ecosystems monitoring. The study considers the creation of ensembles combining different Convolutional Neural Network (CNN) models and transformer architectures to understand whether different optimization algorithms can result in more robust and efficient classification across various plankton datasets. Tests involved different variants of the Adam optimizer and multiple learning rate variation strategies applied to several CNN architectures, building an ensemble of classifiers. Such ensembles were tested together with transformer-based models in a detailed comparative analysis considering feature extraction efficiency, computational cost, and robustness to species imbalances. The study highlights the performance of individual nets and ensembles on multiple plankton datasets, and discusses the potential for generalizing this approach to broader aquatic ecosystems. Experiments demonstrate that combining diverse neural network models in a heterogeneous ensemble significantly improves performance with respect to other state-of-the-art approaches across all the problems considered. Final results show that the ensemble-based approach achieves a remarkable accuracy improvement over individual CNN models and over standalone Vision Transformers.
Nanni, L., Lumini, A., Barcellona, L., Ghidoni, S. (2025). Convolutional neural networks and vision transformers for Plankton Classification. ECOLOGICAL INFORMATICS, 90, 1-19 [10.1016/j.ecoinf.2025.103272].
Convolutional neural networks and vision transformers for Plankton Classification
Lumini A.
;
2025
Abstract
In this paper, we present a study on plankton classification for automated underwater ecosystems monitoring. The study considers the creation of ensembles combining different Convolutional Neural Network (CNN) models and transformer architectures to understand whether different optimization algorithms can result in more robust and efficient classification across various plankton datasets. Tests involved different variants of the Adam optimizer and multiple learning rate variation strategies applied to several CNN architectures, building an ensemble of classifiers. Such ensembles were tested together with transformer-based models in a detailed comparative analysis considering feature extraction efficiency, computational cost, and robustness to species imbalances. The study highlights the performance of individual nets and ensembles on multiple plankton datasets, and discusses the potential for generalizing this approach to broader aquatic ecosystems. Experiments demonstrate that combining diverse neural network models in a heterogeneous ensemble significantly improves performance with respect to other state-of-the-art approaches across all the problems considered. Final results show that the ensemble-based approach achieves a remarkable accuracy improvement over individual CNN models and over standalone Vision Transformers.| File | Dimensione | Formato | |
|---|---|---|---|
|
main.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale / Version Of Record
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione
2.96 MB
Formato
Adobe PDF
|
2.96 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


