Cargando…
Decoding the Cauzin Softstrip: a case study in extracting information from old media
Having content in an archive is of limited value if it cannot be read and used. As a case study of extricating information from obsolete media, making it readable once again through deep learning techniques, we examine the Cauzin Softstrip: one of the first two-dimensional bar codes, released in 198...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8591774/ https://www.ncbi.nlm.nih.gov/pubmed/34803479 http://dx.doi.org/10.1007/s10502-021-09358-z |
_version_ | 1784599321928269824 |
---|---|
author | Reimsbach, Michael Aycock, John |
author_facet | Reimsbach, Michael Aycock, John |
author_sort | Reimsbach, Michael |
collection | PubMed |
description | Having content in an archive is of limited value if it cannot be read and used. As a case study of extricating information from obsolete media, making it readable once again through deep learning techniques, we examine the Cauzin Softstrip: one of the first two-dimensional bar codes, released in 1985 by Cauzin Systems, which could be used for encoding all manner of digital data. Softstrips occupy a curious middle ground, as they were both physical and digital. The bar codes were printed on paper, and in that sense are no different in an archival way than any printed material. Softstrips can be found in old computer magazines, computer books, and booklets of software Cauzin produced. However, managing the digital nature of these physical artifacts falls within the scope of digital curation. To make the information on them readable and useful, the digital information needs to be extracted, which originally would have occurred using a physical Cauzin Softstrip reader. Obtaining a working Softstrip reader is already extremely difficult and will most likely be impossible in the coming years. In order to extract the encoded data, we created a digital Softstrip reader, making Softstrip data accessible without needing a physical reader. Our decoding strategy is able to decode over 91% of the 1229 Softstrips in our Softstrip corpus; this rises to 99% if we only consider Softstrip images produced under controlled conditions. Furthermore, we later acquired another set of 117 Softstrips and we were able to decode nearly 95% of them with no adjustments to the decoder. These excellent results underscore the fact that technology like deep learning is readily accessible to non-experts; we obtained these results using a convolutional neural network, even though neither of the authors are expert in the area. |
format | Online Article Text |
id | pubmed-8591774 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-85917742021-11-19 Decoding the Cauzin Softstrip: a case study in extracting information from old media Reimsbach, Michael Aycock, John Arch Sci (Dordr) Original Paper Having content in an archive is of limited value if it cannot be read and used. As a case study of extricating information from obsolete media, making it readable once again through deep learning techniques, we examine the Cauzin Softstrip: one of the first two-dimensional bar codes, released in 1985 by Cauzin Systems, which could be used for encoding all manner of digital data. Softstrips occupy a curious middle ground, as they were both physical and digital. The bar codes were printed on paper, and in that sense are no different in an archival way than any printed material. Softstrips can be found in old computer magazines, computer books, and booklets of software Cauzin produced. However, managing the digital nature of these physical artifacts falls within the scope of digital curation. To make the information on them readable and useful, the digital information needs to be extracted, which originally would have occurred using a physical Cauzin Softstrip reader. Obtaining a working Softstrip reader is already extremely difficult and will most likely be impossible in the coming years. In order to extract the encoded data, we created a digital Softstrip reader, making Softstrip data accessible without needing a physical reader. Our decoding strategy is able to decode over 91% of the 1229 Softstrips in our Softstrip corpus; this rises to 99% if we only consider Softstrip images produced under controlled conditions. Furthermore, we later acquired another set of 117 Softstrips and we were able to decode nearly 95% of them with no adjustments to the decoder. These excellent results underscore the fact that technology like deep learning is readily accessible to non-experts; we obtained these results using a convolutional neural network, even though neither of the authors are expert in the area. Springer Netherlands 2021-02-25 2021 /pmc/articles/PMC8591774/ /pubmed/34803479 http://dx.doi.org/10.1007/s10502-021-09358-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Paper Reimsbach, Michael Aycock, John Decoding the Cauzin Softstrip: a case study in extracting information from old media |
title | Decoding the Cauzin Softstrip: a case study in extracting information from old media |
title_full | Decoding the Cauzin Softstrip: a case study in extracting information from old media |
title_fullStr | Decoding the Cauzin Softstrip: a case study in extracting information from old media |
title_full_unstemmed | Decoding the Cauzin Softstrip: a case study in extracting information from old media |
title_short | Decoding the Cauzin Softstrip: a case study in extracting information from old media |
title_sort | decoding the cauzin softstrip: a case study in extracting information from old media |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8591774/ https://www.ncbi.nlm.nih.gov/pubmed/34803479 http://dx.doi.org/10.1007/s10502-021-09358-z |
work_keys_str_mv | AT reimsbachmichael decodingthecauzinsoftstripacasestudyinextractinginformationfromoldmedia AT aycockjohn decodingthecauzinsoftstripacasestudyinextractinginformationfromoldmedia |