Cargando…

Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books

[Image: see text] Non-destructive, fast, and accurate methods of dating are highly desirable for many heritage objects. Here, we present and critically evaluate the use of near-infrared (NIR) spectroscopic data combined with three supervised machine learning methods to predict the publication year o...

Descripción completa

Detalles Bibliográficos
Autores principales: Coppola, Floriana, Frigau, Luca, Markelj, Jernej, Malešič, Jasna, Conversano, Claudio, Strlič, Matija
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251519/
https://www.ncbi.nlm.nih.gov/pubmed/37216468
http://dx.doi.org/10.1021/jacs.3c02835
_version_ 1785055964579233792
author Coppola, Floriana
Frigau, Luca
Markelj, Jernej
Malešič, Jasna
Conversano, Claudio
Strlič, Matija
author_facet Coppola, Floriana
Frigau, Luca
Markelj, Jernej
Malešič, Jasna
Conversano, Claudio
Strlič, Matija
author_sort Coppola, Floriana
collection PubMed
description [Image: see text] Non-destructive, fast, and accurate methods of dating are highly desirable for many heritage objects. Here, we present and critically evaluate the use of near-infrared (NIR) spectroscopic data combined with three supervised machine learning methods to predict the publication year of paper books dated between 1851 and 2000. These methods provide different accuracies; however, we demonstrate that the underlying processes refer to common spectral features. Regardless of the machine learning method used, the most informative wavelength ranges can be associated with C–H and O–H stretching first overtone, typical of the cellulose structure, and N–H stretching first overtone from amide/protein structures. We find that the expected influence of degradation on the accuracy of prediction is not meaningful. The variance-bias decomposition of the reducible error reveals some differences among the three machine learning methods. Our results show that two out of the three methods allow predictions of publication dates in the period 1851–2000 from NIR spectroscopic data with an unprecedented accuracy of up to 2 years, better than any other non-destructive method applied to a real heritage collection.
format Online
Article
Text
id pubmed-10251519
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-102515192023-06-10 Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books Coppola, Floriana Frigau, Luca Markelj, Jernej Malešič, Jasna Conversano, Claudio Strlič, Matija J Am Chem Soc [Image: see text] Non-destructive, fast, and accurate methods of dating are highly desirable for many heritage objects. Here, we present and critically evaluate the use of near-infrared (NIR) spectroscopic data combined with three supervised machine learning methods to predict the publication year of paper books dated between 1851 and 2000. These methods provide different accuracies; however, we demonstrate that the underlying processes refer to common spectral features. Regardless of the machine learning method used, the most informative wavelength ranges can be associated with C–H and O–H stretching first overtone, typical of the cellulose structure, and N–H stretching first overtone from amide/protein structures. We find that the expected influence of degradation on the accuracy of prediction is not meaningful. The variance-bias decomposition of the reducible error reveals some differences among the three machine learning methods. Our results show that two out of the three methods allow predictions of publication dates in the period 1851–2000 from NIR spectroscopic data with an unprecedented accuracy of up to 2 years, better than any other non-destructive method applied to a real heritage collection. American Chemical Society 2023-05-22 /pmc/articles/PMC10251519/ /pubmed/37216468 http://dx.doi.org/10.1021/jacs.3c02835 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Coppola, Floriana
Frigau, Luca
Markelj, Jernej
Malešič, Jasna
Conversano, Claudio
Strlič, Matija
Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books
title Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books
title_full Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books
title_fullStr Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books
title_full_unstemmed Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books
title_short Near-Infrared Spectroscopy and Machine Learning for Accurate Dating of Historical Books
title_sort near-infrared spectroscopy and machine learning for accurate dating of historical books
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251519/
https://www.ncbi.nlm.nih.gov/pubmed/37216468
http://dx.doi.org/10.1021/jacs.3c02835
work_keys_str_mv AT coppolafloriana nearinfraredspectroscopyandmachinelearningforaccuratedatingofhistoricalbooks
AT frigauluca nearinfraredspectroscopyandmachinelearningforaccuratedatingofhistoricalbooks
AT markeljjernej nearinfraredspectroscopyandmachinelearningforaccuratedatingofhistoricalbooks
AT malesicjasna nearinfraredspectroscopyandmachinelearningforaccuratedatingofhistoricalbooks
AT conversanoclaudio nearinfraredspectroscopyandmachinelearningforaccuratedatingofhistoricalbooks
AT strlicmatija nearinfraredspectroscopyandmachinelearningforaccuratedatingofhistoricalbooks