Faunal remains from archaeological sites allow us to identify the animal species present and understand the relationships between humans and animals, not only from their morphological information, but also from the ancient biomolecules (lipids, proteins, and DNA) preserved in these remains for thousands and even millions of years. However, due to the costs and efforts required for ancient biomolecular analysis, there has been considerable research into development of accurate and efficient screening approaches for archaeological biomolecular analysis. FTIR spectroscopy is one such approach that has been considered for screening of proteins, but its widespread use has been hindered by the fact that its predictive accuracy can vary widely depending on the extent of sample preservation and the instrument used. Further, screening methods for ancient DNA (aDNA) analysis are scarce. Here we present a new approach to vastly improve upon FTIR-based screening methods prior to ZooMS and aDNA analysis through the use of random forest-based machine learning. To do so, we use ATR-FTIR to examine three sets of archaeological bone assemblages and and analyse them by ZooMS (Zooarchaeology by Mass Spectrometry; allows us to identify the species of the bones). Two of these are from Palaeolithic contexts, dominated by terrestrial fauna and include specimens with a variety of preservational conditions. The third set consists of Holocene faunal remains, with variable levels of preservation and is dominated by cetaceans. Using the Holocene faunal remains, we were able to more consistently evaluate ATR-FTIR based screening for mtDNA as well as ZooMS success. We report the first successful use of machine learning in ATR-FTIR-based screening technique for ancient mtDNA analysis, and our machine learning models conclusively improve the accuracy prior to usage of ATR-FTIR-based screening for ZooMS by 20-40%. The results also suggest this approach potentially allows for a universal screening system, applicable across multiple sites and largely independent of the spectrometers used.

Faunal remains from archaeological sites allow for the identification of animal species that enables the better understanding of the relationships between humans and animals, not only from their morphological information, but also from the ancient biomolecules (lipids, proteins, and DNA) preserved in these remains for thousands and even millions of years. However, due to the costs and efforts required for ancient biomolecular analysis, there has been considerable research into development of accurate and efficient screening approaches for archaeological remains. FTIR spectroscopy is one such approach that has been considered for screening of proteins, but its widespread use has been hindered by the fact that its predictive accuracy can vary widely depending on the extent of sample preservation and the instrument used. Further, screening methods for ancient DNA (aDNA) analysis are scarce. Here we present a new approach to vastly improve upon FTIR-based screening methods prior to ZooMS (Zooarchaeology by Mass Spectrometry) and aDNA analysis through the use of random forest-based machine learning. To do so, we use ATR-FTIR to examine three sets of archaeological bone assemblages and analyse them by ZooMS (for taxonomic identification). Two of these are from Palaeolithic contexts, dominated by terrestrial fauna and include specimens with a variety of preservational conditions. The third set consists of Holocene faunal remains, with variable levels of preservation and is dominated by cetaceans. Using the Holocene faunal remains, we were able to more consistently evaluate ATR-FTIR-based screening for mtDNA as well as ZooMS success. We report on the potential of machine learning in ATR-FTIR-based screening for ancient mtDNA analysis, and our machine learning models conclusively improve the accuracy prior to usage of ATR-FTIR-based screening for ZooMS by 20–40%. The results also suggest this approach potentially allows for a universal screening system, applicable across multiple sites and largely independent of the spectrometers used.

Pal Chowdhury, M., Choudhury, K.D., Bouchard, G.P., Riel-Salvatore, J., Negrino, F., Benazzi, S., et al. (2021). Machine learning ATR-FTIR spectroscopy data for the screening of collagen for ZooMS analysis and mtDNA in archaeological bone. JOURNAL OF ARCHAEOLOGICAL SCIENCE, 126, 1-13 [10.1016/j.jas.2020.105311].

Machine learning ATR-FTIR spectroscopy data for the screening of collagen for ZooMS analysis and mtDNA in archaeological bone

Benazzi, Stefano;
2021

Abstract

Faunal remains from archaeological sites allow for the identification of animal species that enables the better understanding of the relationships between humans and animals, not only from their morphological information, but also from the ancient biomolecules (lipids, proteins, and DNA) preserved in these remains for thousands and even millions of years. However, due to the costs and efforts required for ancient biomolecular analysis, there has been considerable research into development of accurate and efficient screening approaches for archaeological remains. FTIR spectroscopy is one such approach that has been considered for screening of proteins, but its widespread use has been hindered by the fact that its predictive accuracy can vary widely depending on the extent of sample preservation and the instrument used. Further, screening methods for ancient DNA (aDNA) analysis are scarce. Here we present a new approach to vastly improve upon FTIR-based screening methods prior to ZooMS (Zooarchaeology by Mass Spectrometry) and aDNA analysis through the use of random forest-based machine learning. To do so, we use ATR-FTIR to examine three sets of archaeological bone assemblages and analyse them by ZooMS (for taxonomic identification). Two of these are from Palaeolithic contexts, dominated by terrestrial fauna and include specimens with a variety of preservational conditions. The third set consists of Holocene faunal remains, with variable levels of preservation and is dominated by cetaceans. Using the Holocene faunal remains, we were able to more consistently evaluate ATR-FTIR-based screening for mtDNA as well as ZooMS success. We report on the potential of machine learning in ATR-FTIR-based screening for ancient mtDNA analysis, and our machine learning models conclusively improve the accuracy prior to usage of ATR-FTIR-based screening for ZooMS by 20–40%. The results also suggest this approach potentially allows for a universal screening system, applicable across multiple sites and largely independent of the spectrometers used.
2021
Pal Chowdhury, M., Choudhury, K.D., Bouchard, G.P., Riel-Salvatore, J., Negrino, F., Benazzi, S., et al. (2021). Machine learning ATR-FTIR spectroscopy data for the screening of collagen for ZooMS analysis and mtDNA in archaeological bone. JOURNAL OF ARCHAEOLOGICAL SCIENCE, 126, 1-13 [10.1016/j.jas.2020.105311].
Pal Chowdhury, Manasij; Choudhury, Kaustabh Datta; Bouchard, Geneviève Pothier; Riel-Salvatore, Julien; Negrino, Fabio; Benazzi, Stefano; Slimak, Ludo...espandi
File in questo prodotto:
File Dimensione Formato  
Chowdhury_et_al_2021_accepted_manuscript.pdf

accesso aperto

Descrizione: Peer-reviewed accepted manuscript
Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione 666.61 kB
Formato Adobe PDF
666.61 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/820667
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 12
social impact