The ability to detect and characterize bacteria within a biological sample is crucial for the monitoring of infections and epidemics, as well as for the study of human health and its relationship with commensal microorganisms. To this aim, a commonly used technique is the 16S rRNA gene targeted sequencing. PCR-amplified 16S sequences derived from the sample of interest are usually clustered into the so-called Operational Taxonomic Units (OTUs) based on pairwise similarities. Then, representative OTU sequences are compared with reference (human-made) databases to derive their phylogeny and taxonomic classification. Here, we propose a new reference-free approach to define the phylogenetic distance between bacteria based on protein domains, which are the evolving units of proteins. We extract the protein domain profiles of 3368 bacterial genomes and we use an ecological approach to model their Relative Species Abundance distribution. Based on the model parameters, we then derive a new measurement of phylogenetic distance. Finally, we show that such model-based distance is capable of detecting differences between bacteria in cases in which the 16S rRNA-based method fails, providing a possibly complementary approach , which is particularly promising for the analysis of bacterial populations measured by shotgun sequencing.

Intraspecies characterization of bacteria via evolutionary modeling of protein domains / Budimir, Iva; Giampieri, Enrico; Saccenti, Edoardo; Suarez-Diez, Maria; Tarozzi, Martina; Dall'Olio, Daniele; Merlotti, Alessandra; Curti, Nico; Remondini, Daniel; Castellani, Gastone; Sala, Claudia. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - ELETTRONICO. - 12:1(2022), pp. 16595.1-16595.12. [10.1038/s41598-022-21036-3]

Intraspecies characterization of bacteria via evolutionary modeling of protein domains

Budimir, Iva
Primo
;
Giampieri, Enrico;Tarozzi, Martina;Dall'Olio, Daniele;Merlotti, Alessandra;Curti, Nico;Remondini, Daniel;Castellani, Gastone
;
Sala, Claudia
Ultimo
2022

Abstract

The ability to detect and characterize bacteria within a biological sample is crucial for the monitoring of infections and epidemics, as well as for the study of human health and its relationship with commensal microorganisms. To this aim, a commonly used technique is the 16S rRNA gene targeted sequencing. PCR-amplified 16S sequences derived from the sample of interest are usually clustered into the so-called Operational Taxonomic Units (OTUs) based on pairwise similarities. Then, representative OTU sequences are compared with reference (human-made) databases to derive their phylogeny and taxonomic classification. Here, we propose a new reference-free approach to define the phylogenetic distance between bacteria based on protein domains, which are the evolving units of proteins. We extract the protein domain profiles of 3368 bacterial genomes and we use an ecological approach to model their Relative Species Abundance distribution. Based on the model parameters, we then derive a new measurement of phylogenetic distance. Finally, we show that such model-based distance is capable of detecting differences between bacteria in cases in which the 16S rRNA-based method fails, providing a possibly complementary approach , which is particularly promising for the analysis of bacterial populations measured by shotgun sequencing.
2022
Intraspecies characterization of bacteria via evolutionary modeling of protein domains / Budimir, Iva; Giampieri, Enrico; Saccenti, Edoardo; Suarez-Diez, Maria; Tarozzi, Martina; Dall'Olio, Daniele; Merlotti, Alessandra; Curti, Nico; Remondini, Daniel; Castellani, Gastone; Sala, Claudia. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - ELETTRONICO. - 12:1(2022), pp. 16595.1-16595.12. [10.1038/s41598-022-21036-3]
Budimir, Iva; Giampieri, Enrico; Saccenti, Edoardo; Suarez-Diez, Maria; Tarozzi, Martina; Dall'Olio, Daniele; Merlotti, Alessandra; Curti, Nico; Remondini, Daniel; Castellani, Gastone; Sala, Claudia
File in questo prodotto:
File Dimensione Formato  
s41598-022-21036-3.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 2.7 MB
Formato Adobe PDF
2.7 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/903909
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact