In microbial community studies, analyzing diversity is crucial for uncovering ecological complexity. However, the intrinsic characteristics of Next-gen sequencing data challenge the use of Euclidean metrics for estimating proximity and correlation. Consequently, a variety of distance measures have been developed within ecological frameworks. In this study, we compare several of these metrics—including Bray–Curtis, Canberra, Jensen–Shannon, Hellinger, Euclidean, and Aitchison distances—demonstrating how the choice of metric can significantly influence the interpretation of microbial community structures. Among these, Aitchison distance specifically defined for compositional data shows markedly different behavior from the others, highlighting different features related to the data. We consider two real-world examples: the human gut microbiome sampled using 16S rRNA sequencing with multiple measurements for different patients (G-HMP2) and urban sewage environmental metagenomes collected over time at different sites through shotgun sequencing (E-WADES). We show that, for the same dataset—independently on the sequencing technique or on the sampling context—the community structure depends strongly on the choice of specific metrics. This can be explained by the mathematical properties of the chosen metrics and the specific characteristics of microbiome data, namely their high heterogeneity in species abundance. This provides clear insights into how distance metrics influence interpretation and assists in choosing the most appropriate one for the study objectives.

Fuschi, A., Merlotti, A., Remondini, D. (2025). Microbiome data: tell me which metrics and I will tell you which communities. ISME COMMUNICATIONS, 5(1), 1-4 [10.1093/ismeco/ycaf125].

Microbiome data: tell me which metrics and I will tell you which communities

Fuschi A.;Merlotti A.;Remondini D.
2025

Abstract

In microbial community studies, analyzing diversity is crucial for uncovering ecological complexity. However, the intrinsic characteristics of Next-gen sequencing data challenge the use of Euclidean metrics for estimating proximity and correlation. Consequently, a variety of distance measures have been developed within ecological frameworks. In this study, we compare several of these metrics—including Bray–Curtis, Canberra, Jensen–Shannon, Hellinger, Euclidean, and Aitchison distances—demonstrating how the choice of metric can significantly influence the interpretation of microbial community structures. Among these, Aitchison distance specifically defined for compositional data shows markedly different behavior from the others, highlighting different features related to the data. We consider two real-world examples: the human gut microbiome sampled using 16S rRNA sequencing with multiple measurements for different patients (G-HMP2) and urban sewage environmental metagenomes collected over time at different sites through shotgun sequencing (E-WADES). We show that, for the same dataset—independently on the sequencing technique or on the sampling context—the community structure depends strongly on the choice of specific metrics. This can be explained by the mathematical properties of the chosen metrics and the specific characteristics of microbiome data, namely their high heterogeneity in species abundance. This provides clear insights into how distance metrics influence interpretation and assists in choosing the most appropriate one for the study objectives.
2025
Fuschi, A., Merlotti, A., Remondini, D. (2025). Microbiome data: tell me which metrics and I will tell you which communities. ISME COMMUNICATIONS, 5(1), 1-4 [10.1093/ismeco/ycaf125].
Fuschi, A.; Merlotti, A.; Remondini, D.
File in questo prodotto:
File Dimensione Formato  
microbiome metrics_Fuschi Merlotti Remondini_ISMEcomm25.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 797.43 kB
Formato Adobe PDF
797.43 kB Adobe PDF Visualizza/Apri
microbiome_metrics_supplementary_ycaf125.pdf

accesso aperto

Tipo: File Supplementare
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 1.08 MB
Formato Adobe PDF
1.08 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1044642
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact