Cargando…

Decoding the Cauzin Softstrip: a case study in extracting information from old media

Having content in an archive is of limited value if it cannot be read and used. As a case study of extricating information from obsolete media, making it readable once again through deep learning techniques, we examine the Cauzin Softstrip: one of the first two-dimensional bar codes, released in 198...

Descripción completa

Detalles Bibliográficos
Autores principales: Reimsbach, Michael, Aycock, John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8591774/
https://www.ncbi.nlm.nih.gov/pubmed/34803479
http://dx.doi.org/10.1007/s10502-021-09358-z
_version_ 1784599321928269824
author Reimsbach, Michael
Aycock, John
author_facet Reimsbach, Michael
Aycock, John
author_sort Reimsbach, Michael
collection PubMed
description Having content in an archive is of limited value if it cannot be read and used. As a case study of extricating information from obsolete media, making it readable once again through deep learning techniques, we examine the Cauzin Softstrip: one of the first two-dimensional bar codes, released in 1985 by Cauzin Systems, which could be used for encoding all manner of digital data. Softstrips occupy a curious middle ground, as they were both physical and digital. The bar codes were printed on paper, and in that sense are no different in an archival way than any printed material. Softstrips can be found in old computer magazines, computer books, and booklets of software Cauzin produced. However, managing the digital nature of these physical artifacts falls within the scope of digital curation. To make the information on them readable and useful, the digital information needs to be extracted, which originally would have occurred using a physical Cauzin Softstrip reader. Obtaining a working Softstrip reader is already extremely difficult and will most likely be impossible in the coming years. In order to extract the encoded data, we created a digital Softstrip reader, making Softstrip data accessible without needing a physical reader. Our decoding strategy is able to decode over 91% of the 1229 Softstrips in our Softstrip corpus; this rises to 99% if we only consider Softstrip images produced under controlled conditions. Furthermore, we later acquired another set of 117 Softstrips and we were able to decode nearly 95% of them with no adjustments to the decoder. These excellent results underscore the fact that technology like deep learning is readily accessible to non-experts; we obtained these results using a convolutional neural network, even though neither of the authors are expert in the area.
format Online
Article
Text
id pubmed-8591774
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-85917742021-11-19 Decoding the Cauzin Softstrip: a case study in extracting information from old media Reimsbach, Michael Aycock, John Arch Sci (Dordr) Original Paper Having content in an archive is of limited value if it cannot be read and used. As a case study of extricating information from obsolete media, making it readable once again through deep learning techniques, we examine the Cauzin Softstrip: one of the first two-dimensional bar codes, released in 1985 by Cauzin Systems, which could be used for encoding all manner of digital data. Softstrips occupy a curious middle ground, as they were both physical and digital. The bar codes were printed on paper, and in that sense are no different in an archival way than any printed material. Softstrips can be found in old computer magazines, computer books, and booklets of software Cauzin produced. However, managing the digital nature of these physical artifacts falls within the scope of digital curation. To make the information on them readable and useful, the digital information needs to be extracted, which originally would have occurred using a physical Cauzin Softstrip reader. Obtaining a working Softstrip reader is already extremely difficult and will most likely be impossible in the coming years. In order to extract the encoded data, we created a digital Softstrip reader, making Softstrip data accessible without needing a physical reader. Our decoding strategy is able to decode over 91% of the 1229 Softstrips in our Softstrip corpus; this rises to 99% if we only consider Softstrip images produced under controlled conditions. Furthermore, we later acquired another set of 117 Softstrips and we were able to decode nearly 95% of them with no adjustments to the decoder. These excellent results underscore the fact that technology like deep learning is readily accessible to non-experts; we obtained these results using a convolutional neural network, even though neither of the authors are expert in the area. Springer Netherlands 2021-02-25 2021 /pmc/articles/PMC8591774/ /pubmed/34803479 http://dx.doi.org/10.1007/s10502-021-09358-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Paper
Reimsbach, Michael
Aycock, John
Decoding the Cauzin Softstrip: a case study in extracting information from old media
title Decoding the Cauzin Softstrip: a case study in extracting information from old media
title_full Decoding the Cauzin Softstrip: a case study in extracting information from old media
title_fullStr Decoding the Cauzin Softstrip: a case study in extracting information from old media
title_full_unstemmed Decoding the Cauzin Softstrip: a case study in extracting information from old media
title_short Decoding the Cauzin Softstrip: a case study in extracting information from old media
title_sort decoding the cauzin softstrip: a case study in extracting information from old media
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8591774/
https://www.ncbi.nlm.nih.gov/pubmed/34803479
http://dx.doi.org/10.1007/s10502-021-09358-z
work_keys_str_mv AT reimsbachmichael decodingthecauzinsoftstripacasestudyinextractinginformationfromoldmedia
AT aycockjohn decodingthecauzinsoftstripacasestudyinextractinginformationfromoldmedia