In this paper, we present a study on plankton classification for automated underwater ecosystems monitoring. The study considers the creation of ensembles combining different Convolutional Neural Network (CNN) models and transformer architectures to understand whether different optimization algorithms can result in more robust and efficient classification across various plankton datasets. Tests involved different variants of the Adam optimizer and multiple learning rate variation strategies applied to several CNN architectures, building an ensemble of classifiers. Such ensembles were tested together with transformer-based models in a detailed comparative analysis considering feature extraction efficiency, computational cost, and robustness to species imbalances. The study highlights the performance of individual nets and ensembles on multiple plankton datasets, and discusses the potential for generalizing this approach to broader aquatic ecosystems. Experiments demonstrate that combining diverse neural network models in a heterogeneous ensemble significantly improves performance with respect to other state-of-the-art approaches across all the problems considered. Final results show that the ensemble-based approach achieves a remarkable accuracy improvement over individual CNN models and over standalone Vision Transformers.

Nanni, L., Lumini, A., Barcellona, L., Ghidoni, S. (2025). Convolutional neural networks and vision transformers for Plankton Classification. ECOLOGICAL INFORMATICS, 90, 1-19 [10.1016/j.ecoinf.2025.103272].

Convolutional neural networks and vision transformers for Plankton Classification

Lumini A.
;
2025

Abstract

In this paper, we present a study on plankton classification for automated underwater ecosystems monitoring. The study considers the creation of ensembles combining different Convolutional Neural Network (CNN) models and transformer architectures to understand whether different optimization algorithms can result in more robust and efficient classification across various plankton datasets. Tests involved different variants of the Adam optimizer and multiple learning rate variation strategies applied to several CNN architectures, building an ensemble of classifiers. Such ensembles were tested together with transformer-based models in a detailed comparative analysis considering feature extraction efficiency, computational cost, and robustness to species imbalances. The study highlights the performance of individual nets and ensembles on multiple plankton datasets, and discusses the potential for generalizing this approach to broader aquatic ecosystems. Experiments demonstrate that combining diverse neural network models in a heterogeneous ensemble significantly improves performance with respect to other state-of-the-art approaches across all the problems considered. Final results show that the ensemble-based approach achieves a remarkable accuracy improvement over individual CNN models and over standalone Vision Transformers.
2025
Nanni, L., Lumini, A., Barcellona, L., Ghidoni, S. (2025). Convolutional neural networks and vision transformers for Plankton Classification. ECOLOGICAL INFORMATICS, 90, 1-19 [10.1016/j.ecoinf.2025.103272].
Nanni, L.; Lumini, A.; Barcellona, L.; Ghidoni, S.
File in questo prodotto:
File Dimensione Formato  
main.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione 2.96 MB
Formato Adobe PDF
2.96 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1045777
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
social impact