Cargando…

Rewritable two-dimensional DNA-based data storage with machine learning reconstruction

DNA-based data storage platforms traditionally encode information only in the nucleotide sequence of the molecule. Here we report on a two-dimensional molecular data storage system that records information in both the sequence and the backbone structure of DNA and performs nontrivial joint data enco...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Chao, Tabatabaei, S. Kasra, Tabatabaei Yazdi, S. M. Hossein, Hernandez, Alvaro G., Schroeder, Charles M., Milenkovic, Olgica
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142566/
https://www.ncbi.nlm.nih.gov/pubmed/35624096
http://dx.doi.org/10.1038/s41467-022-30140-x
_version_ 1784715601035395072
author Pan, Chao
Tabatabaei, S. Kasra
Tabatabaei Yazdi, S. M. Hossein
Hernandez, Alvaro G.
Schroeder, Charles M.
Milenkovic, Olgica
author_facet Pan, Chao
Tabatabaei, S. Kasra
Tabatabaei Yazdi, S. M. Hossein
Hernandez, Alvaro G.
Schroeder, Charles M.
Milenkovic, Olgica
author_sort Pan, Chao
collection PubMed
description DNA-based data storage platforms traditionally encode information only in the nucleotide sequence of the molecule. Here we report on a two-dimensional molecular data storage system that records information in both the sequence and the backbone structure of DNA and performs nontrivial joint data encoding, decoding and processing. Our 2DDNA method efficiently stores images in synthetic DNA and embeds pertinent metadata as nicks in the DNA backbone. To avoid costly worst-case redundancy for correcting sequencing/rewriting errors and to mitigate issues associated with mismatched decoding parameters, we develop machine learning techniques for automatic discoloration detection and image inpainting. The 2DDNA platform is experimentally tested by reconstructing a library of images with undetectable or small visual degradation after readout processing, and by erasing and rewriting copyright metadata encoded in nicks. Our results demonstrate that DNA can serve both as a write-once and rewritable memory for heterogenous data and that data can be erased in a permanent, privacy-preserving manner. Moreover, the storage system can be made robust to degrading channel qualities while avoiding global error-correction redundancy.
format Online
Article
Text
id pubmed-9142566
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91425662022-05-29 Rewritable two-dimensional DNA-based data storage with machine learning reconstruction Pan, Chao Tabatabaei, S. Kasra Tabatabaei Yazdi, S. M. Hossein Hernandez, Alvaro G. Schroeder, Charles M. Milenkovic, Olgica Nat Commun Article DNA-based data storage platforms traditionally encode information only in the nucleotide sequence of the molecule. Here we report on a two-dimensional molecular data storage system that records information in both the sequence and the backbone structure of DNA and performs nontrivial joint data encoding, decoding and processing. Our 2DDNA method efficiently stores images in synthetic DNA and embeds pertinent metadata as nicks in the DNA backbone. To avoid costly worst-case redundancy for correcting sequencing/rewriting errors and to mitigate issues associated with mismatched decoding parameters, we develop machine learning techniques for automatic discoloration detection and image inpainting. The 2DDNA platform is experimentally tested by reconstructing a library of images with undetectable or small visual degradation after readout processing, and by erasing and rewriting copyright metadata encoded in nicks. Our results demonstrate that DNA can serve both as a write-once and rewritable memory for heterogenous data and that data can be erased in a permanent, privacy-preserving manner. Moreover, the storage system can be made robust to degrading channel qualities while avoiding global error-correction redundancy. Nature Publishing Group UK 2022-05-27 /pmc/articles/PMC9142566/ /pubmed/35624096 http://dx.doi.org/10.1038/s41467-022-30140-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Pan, Chao
Tabatabaei, S. Kasra
Tabatabaei Yazdi, S. M. Hossein
Hernandez, Alvaro G.
Schroeder, Charles M.
Milenkovic, Olgica
Rewritable two-dimensional DNA-based data storage with machine learning reconstruction
title Rewritable two-dimensional DNA-based data storage with machine learning reconstruction
title_full Rewritable two-dimensional DNA-based data storage with machine learning reconstruction
title_fullStr Rewritable two-dimensional DNA-based data storage with machine learning reconstruction
title_full_unstemmed Rewritable two-dimensional DNA-based data storage with machine learning reconstruction
title_short Rewritable two-dimensional DNA-based data storage with machine learning reconstruction
title_sort rewritable two-dimensional dna-based data storage with machine learning reconstruction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142566/
https://www.ncbi.nlm.nih.gov/pubmed/35624096
http://dx.doi.org/10.1038/s41467-022-30140-x
work_keys_str_mv AT panchao rewritabletwodimensionaldnabaseddatastoragewithmachinelearningreconstruction
AT tabatabaeiskasra rewritabletwodimensionaldnabaseddatastoragewithmachinelearningreconstruction
AT tabatabaeiyazdismhossein rewritabletwodimensionaldnabaseddatastoragewithmachinelearningreconstruction
AT hernandezalvarog rewritabletwodimensionaldnabaseddatastoragewithmachinelearningreconstruction
AT schroedercharlesm rewritabletwodimensionaldnabaseddatastoragewithmachinelearningreconstruction
AT milenkovicolgica rewritabletwodimensionaldnabaseddatastoragewithmachinelearningreconstruction