Cargando…

Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data

Co-authored by a Computer Scientist and a Digital Humanist, this article examines the challenges faced by cultural heritage institutions in the digital age, which have led to the closure of the vast majority of born-digital archival collections. It focuses particularly on cultural organizations such...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jaillant, Lise, Caputo, Annalina
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer London 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8754190/ https://www.ncbi.nlm.nih.gov/pubmed/35039719 http://dx.doi.org/10.1007/s00146-021-01367-x

_version_	1784632223071207424
author	Jaillant, Lise Caputo, Annalina
author_facet	Jaillant, Lise Caputo, Annalina
author_sort	Jaillant, Lise
collection	PubMed
description	Co-authored by a Computer Scientist and a Digital Humanist, this article examines the challenges faced by cultural heritage institutions in the digital age, which have led to the closure of the vast majority of born-digital archival collections. It focuses particularly on cultural organizations such as libraries, museums and archives, used by historians, literary scholars and other Humanities scholars. Most born-digital records held by cultural organizations are inaccessible due to privacy, copyright, commercial and technical issues. Even when born-digital data are publicly available (as in the case of web archives), users often need to physically travel to repositories such as the British Library or the Bibliothèque Nationale de France to consult web pages. Provided with enough sample data from which to learn and train their models, AI, and more specifically machine learning algorithms, offer the opportunity to improve and ease the access to digital archives by learning to perform complex human tasks. These vary from providing intelligent support for searching the archives to automate tedious and time-consuming tasks. In this article, we focus on sensitivity review as a practical solution to unlock digital archives that would allow archival institutions to make non-sensitive information available. This promise to make archives more accessible does not come free of warnings for potential pitfalls and risks: inherent errors, "black box" approaches that make the algorithm inscrutable, and risks related to bias, fake, or partial information. Our central argument is that AI can deliver its promise to make digital archival collections more accessible, but it also creates new challenges - particularly in terms of ethics. In the conclusion, we insist on the importance of fairness, accountability and transparency in the process of making digital archives more accessible.
format	Online Article Text
id	pubmed-8754190
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer London
record_format	MEDLINE/PubMed
spelling	pubmed-87541902022-01-13 Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data Jaillant, Lise Caputo, Annalina AI Soc Original Article Co-authored by a Computer Scientist and a Digital Humanist, this article examines the challenges faced by cultural heritage institutions in the digital age, which have led to the closure of the vast majority of born-digital archival collections. It focuses particularly on cultural organizations such as libraries, museums and archives, used by historians, literary scholars and other Humanities scholars. Most born-digital records held by cultural organizations are inaccessible due to privacy, copyright, commercial and technical issues. Even when born-digital data are publicly available (as in the case of web archives), users often need to physically travel to repositories such as the British Library or the Bibliothèque Nationale de France to consult web pages. Provided with enough sample data from which to learn and train their models, AI, and more specifically machine learning algorithms, offer the opportunity to improve and ease the access to digital archives by learning to perform complex human tasks. These vary from providing intelligent support for searching the archives to automate tedious and time-consuming tasks. In this article, we focus on sensitivity review as a practical solution to unlock digital archives that would allow archival institutions to make non-sensitive information available. This promise to make archives more accessible does not come free of warnings for potential pitfalls and risks: inherent errors, "black box" approaches that make the algorithm inscrutable, and risks related to bias, fake, or partial information. Our central argument is that AI can deliver its promise to make digital archival collections more accessible, but it also creates new challenges - particularly in terms of ethics. In the conclusion, we insist on the importance of fairness, accountability and transparency in the process of making digital archives more accessible. Springer London 2022-01-12 2022 /pmc/articles/PMC8754190/ /pubmed/35039719 http://dx.doi.org/10.1007/s00146-021-01367-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Original Article Jaillant, Lise Caputo, Annalina Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data
title	Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data
title_full	Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data
title_fullStr	Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data
title_full_unstemmed	Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data
title_short	Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data
title_sort	unlocking digital archives: cross-disciplinary perspectives on ai and born-digital data
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8754190/ https://www.ncbi.nlm.nih.gov/pubmed/35039719 http://dx.doi.org/10.1007/s00146-021-01367-x
work_keys_str_mv	AT jaillantlise unlockingdigitalarchivescrossdisciplinaryperspectivesonaiandborndigitaldata AT caputoannalina unlockingdigitalarchivescrossdisciplinaryperspectivesonaiandborndigitaldata

Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data

Ejemplares similares