Cargando…
A Characterization of the DNA Data Storage Channel
Owing to its longevity and enormous information density, DNA, the molecule encoding biological information, has emerged as a promising archival storage medium. However, due to technological constraints, data can only be written onto many short DNA molecules that are stored in an unordered way, and c...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6609604/ https://www.ncbi.nlm.nih.gov/pubmed/31273225 http://dx.doi.org/10.1038/s41598-019-45832-6 |
_version_ | 1783432342069575680 |
---|---|
author | Heckel, Reinhard Mikutis, Gediminas Grass, Robert N. |
author_facet | Heckel, Reinhard Mikutis, Gediminas Grass, Robert N. |
author_sort | Heckel, Reinhard |
collection | PubMed |
description | Owing to its longevity and enormous information density, DNA, the molecule encoding biological information, has emerged as a promising archival storage medium. However, due to technological constraints, data can only be written onto many short DNA molecules that are stored in an unordered way, and can only be read by sampling from this DNA pool. Moreover, imperfections in writing (synthesis), reading (sequencing), storage, and handling of the DNA, in particular amplification via PCR, lead to a loss of DNA molecules and induce errors within the molecules. In order to design DNA storage systems, a qualitative and quantitative understanding of the errors and the loss of molecules is crucial. In this paper, we characterize those error probabilities by analyzing data from our own experiments as well as from experiments of two different groups. We find that errors within molecules are mainly due to synthesis and sequencing, while imperfections in handling and storage lead to a significant loss of sequences. The aim of our study is to help guide the design of future DNA data storage systems by providing a quantitative and qualitative understanding of the DNA data storage channel. |
format | Online Article Text |
id | pubmed-6609604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-66096042019-07-14 A Characterization of the DNA Data Storage Channel Heckel, Reinhard Mikutis, Gediminas Grass, Robert N. Sci Rep Article Owing to its longevity and enormous information density, DNA, the molecule encoding biological information, has emerged as a promising archival storage medium. However, due to technological constraints, data can only be written onto many short DNA molecules that are stored in an unordered way, and can only be read by sampling from this DNA pool. Moreover, imperfections in writing (synthesis), reading (sequencing), storage, and handling of the DNA, in particular amplification via PCR, lead to a loss of DNA molecules and induce errors within the molecules. In order to design DNA storage systems, a qualitative and quantitative understanding of the errors and the loss of molecules is crucial. In this paper, we characterize those error probabilities by analyzing data from our own experiments as well as from experiments of two different groups. We find that errors within molecules are mainly due to synthesis and sequencing, while imperfections in handling and storage lead to a significant loss of sequences. The aim of our study is to help guide the design of future DNA data storage systems by providing a quantitative and qualitative understanding of the DNA data storage channel. Nature Publishing Group UK 2019-07-04 /pmc/articles/PMC6609604/ /pubmed/31273225 http://dx.doi.org/10.1038/s41598-019-45832-6 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Heckel, Reinhard Mikutis, Gediminas Grass, Robert N. A Characterization of the DNA Data Storage Channel |
title | A Characterization of the DNA Data Storage Channel |
title_full | A Characterization of the DNA Data Storage Channel |
title_fullStr | A Characterization of the DNA Data Storage Channel |
title_full_unstemmed | A Characterization of the DNA Data Storage Channel |
title_short | A Characterization of the DNA Data Storage Channel |
title_sort | characterization of the dna data storage channel |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6609604/ https://www.ncbi.nlm.nih.gov/pubmed/31273225 http://dx.doi.org/10.1038/s41598-019-45832-6 |
work_keys_str_mv | AT heckelreinhard acharacterizationofthednadatastoragechannel AT mikutisgediminas acharacterizationofthednadatastoragechannel AT grassrobertn acharacterizationofthednadatastoragechannel AT heckelreinhard characterizationofthednadatastoragechannel AT mikutisgediminas characterizationofthednadatastoragechannel AT grassrobertn characterizationofthednadatastoragechannel |