Cargando…

Probing the physical limits of reliable DNA data retrieval

Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PC...

Descripción completa

Detalles Bibliográficos
Autores principales: Organick, Lee, Chen, Yuan-Jyue, Dumas Ang, Siena, Lopez, Randolph, Liu, Xiaomeng, Strauss, Karin, Ceze, Luis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992699/
https://www.ncbi.nlm.nih.gov/pubmed/32001691
http://dx.doi.org/10.1038/s41467-020-14319-8
_version_ 1783492886061383680
author Organick, Lee
Chen, Yuan-Jyue
Dumas Ang, Siena
Lopez, Randolph
Liu, Xiaomeng
Strauss, Karin
Ceze, Luis
author_facet Organick, Lee
Chen, Yuan-Jyue
Dumas Ang, Siena
Lopez, Randolph
Liu, Xiaomeng
Strauss, Karin
Ceze, Luis
author_sort Organick, Lee
collection PubMed
description Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PCR-based random access when as few as ten copies per sequence are stored, on average. This results in density of about 17 exabytes/gram, nearly two orders of magnitude greater than prior work has shown. We successfully retrieve the same data in a complex pool of over 10(10) unique sequences per microliter with no evidence that we have begun to approach complexity limits. Finally, we also investigate the effects of file size and sequencing coverage on successful file retrieval and look for systematic DNA strand drop out. These findings substantiate the robustness and high data density of the process examined here.
format Online
Article
Text
id pubmed-6992699
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69926992020-02-03 Probing the physical limits of reliable DNA data retrieval Organick, Lee Chen, Yuan-Jyue Dumas Ang, Siena Lopez, Randolph Liu, Xiaomeng Strauss, Karin Ceze, Luis Nat Commun Article Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PCR-based random access when as few as ten copies per sequence are stored, on average. This results in density of about 17 exabytes/gram, nearly two orders of magnitude greater than prior work has shown. We successfully retrieve the same data in a complex pool of over 10(10) unique sequences per microliter with no evidence that we have begun to approach complexity limits. Finally, we also investigate the effects of file size and sequencing coverage on successful file retrieval and look for systematic DNA strand drop out. These findings substantiate the robustness and high data density of the process examined here. Nature Publishing Group UK 2020-01-30 /pmc/articles/PMC6992699/ /pubmed/32001691 http://dx.doi.org/10.1038/s41467-020-14319-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Organick, Lee
Chen, Yuan-Jyue
Dumas Ang, Siena
Lopez, Randolph
Liu, Xiaomeng
Strauss, Karin
Ceze, Luis
Probing the physical limits of reliable DNA data retrieval
title Probing the physical limits of reliable DNA data retrieval
title_full Probing the physical limits of reliable DNA data retrieval
title_fullStr Probing the physical limits of reliable DNA data retrieval
title_full_unstemmed Probing the physical limits of reliable DNA data retrieval
title_short Probing the physical limits of reliable DNA data retrieval
title_sort probing the physical limits of reliable dna data retrieval
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992699/
https://www.ncbi.nlm.nih.gov/pubmed/32001691
http://dx.doi.org/10.1038/s41467-020-14319-8
work_keys_str_mv AT organicklee probingthephysicallimitsofreliablednadataretrieval
AT chenyuanjyue probingthephysicallimitsofreliablednadataretrieval
AT dumasangsiena probingthephysicallimitsofreliablednadataretrieval
AT lopezrandolph probingthephysicallimitsofreliablednadataretrieval
AT liuxiaomeng probingthephysicallimitsofreliablednadataretrieval
AT strausskarin probingthephysicallimitsofreliablednadataretrieval
AT cezeluis probingthephysicallimitsofreliablednadataretrieval