Non-destructive, fast, and accurate methods of dating are highly desirable for many heritage objects. Here, we present and critically evaluate the use of near-infrared (NIR) spectroscopic data combined with three supervised machine learning methods to predict the publication year of paper books dated between 1851 and 2000. These methods provide different accuracies; however, we demonstrate that the underlying processes refer to common spectral features. Regardless of the machine learning method used, the most informative wavelength ranges can be associated with C-H and O-H stretching first overtone, typical of the cellulose structure, and N-H stretching first overtone from amide/protein structures. We find that the expected influence of degradation on the accuracy of prediction is not meaningful. The variance-bias decomposition of the reducible error reveals some differences among the three machine learning methods. Our results show that two out of the three methods allow predictions of publication dates in the period 1851-2000 from NIR spectroscopic data with an unprecedented accuracy of up to 2 years, better than any other non-destructive method applied to a real heritage collection.

Coppola, F., Frigau, L., Markelj, J., Malešič, J., Conversano, C., Strlič, M. (2023). Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 145(22), 12305-12314 [10.1021/jacs.3c02835].

Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books

Coppola, Floriana
Primo
;
2023

Abstract

Non-destructive, fast, and accurate methods of dating are highly desirable for many heritage objects. Here, we present and critically evaluate the use of near-infrared (NIR) spectroscopic data combined with three supervised machine learning methods to predict the publication year of paper books dated between 1851 and 2000. These methods provide different accuracies; however, we demonstrate that the underlying processes refer to common spectral features. Regardless of the machine learning method used, the most informative wavelength ranges can be associated with C-H and O-H stretching first overtone, typical of the cellulose structure, and N-H stretching first overtone from amide/protein structures. We find that the expected influence of degradation on the accuracy of prediction is not meaningful. The variance-bias decomposition of the reducible error reveals some differences among the three machine learning methods. Our results show that two out of the three methods allow predictions of publication dates in the period 1851-2000 from NIR spectroscopic data with an unprecedented accuracy of up to 2 years, better than any other non-destructive method applied to a real heritage collection.
2023
Coppola, F., Frigau, L., Markelj, J., Malešič, J., Conversano, C., Strlič, M. (2023). Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 145(22), 12305-12314 [10.1021/jacs.3c02835].
Coppola, Floriana; Frigau, Luca; Markelj, Jernej; Malešič, Jasna; Conversano, Claudio; Strlič, Matija
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1008144
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 16
social impact