Cargando…

Towards long double-stranded chains and robust DNA-based data storage using the random code system

DNA has become a popular choice for next-generation storage media due to its high storage density and stability. As the storage medium of life’s information, DNA has significant storage capacity and low-cost, low-power replication and transcription capabilities. However, utilizing long double-strand...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Xu, Shi, Xiaolong, Lai, Langwen, Chen, Congzhou, Xu, Huaisheng, Deng, Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10294226/
https://www.ncbi.nlm.nih.gov/pubmed/37384333
http://dx.doi.org/10.3389/fgene.2023.1179867
_version_ 1785063150105657344
author Yang, Xu
Shi, Xiaolong
Lai, Langwen
Chen, Congzhou
Xu, Huaisheng
Deng, Ming
author_facet Yang, Xu
Shi, Xiaolong
Lai, Langwen
Chen, Congzhou
Xu, Huaisheng
Deng, Ming
author_sort Yang, Xu
collection PubMed
description DNA has become a popular choice for next-generation storage media due to its high storage density and stability. As the storage medium of life’s information, DNA has significant storage capacity and low-cost, low-power replication and transcription capabilities. However, utilizing long double-stranded DNA for storage can introduce unstable factors that make it difficult to meet the constraints of biological systems. To address this challenge, we have designed a highly robust coding scheme called the “random code system,” inspired by the idea of fountain codes. The random code system includes the establishment of a random matrix, Gaussian preprocessing, and random equilibrium. Compared to Luby transform codes (LT codes), random code (RC) has better robustness and recovery ability of lost information. In biological experiments, we successfully stored 29,390 bits of data in 25,700 bp chains, achieving a storage density of 1.78 bits per nucleotide. These results demonstrate the potential for using long double-stranded DNA and the random code system for robust DNA-based data storage.
format Online
Article
Text
id pubmed-10294226
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102942262023-06-28 Towards long double-stranded chains and robust DNA-based data storage using the random code system Yang, Xu Shi, Xiaolong Lai, Langwen Chen, Congzhou Xu, Huaisheng Deng, Ming Front Genet Genetics DNA has become a popular choice for next-generation storage media due to its high storage density and stability. As the storage medium of life’s information, DNA has significant storage capacity and low-cost, low-power replication and transcription capabilities. However, utilizing long double-stranded DNA for storage can introduce unstable factors that make it difficult to meet the constraints of biological systems. To address this challenge, we have designed a highly robust coding scheme called the “random code system,” inspired by the idea of fountain codes. The random code system includes the establishment of a random matrix, Gaussian preprocessing, and random equilibrium. Compared to Luby transform codes (LT codes), random code (RC) has better robustness and recovery ability of lost information. In biological experiments, we successfully stored 29,390 bits of data in 25,700 bp chains, achieving a storage density of 1.78 bits per nucleotide. These results demonstrate the potential for using long double-stranded DNA and the random code system for robust DNA-based data storage. Frontiers Media S.A. 2023-06-13 /pmc/articles/PMC10294226/ /pubmed/37384333 http://dx.doi.org/10.3389/fgene.2023.1179867 Text en Copyright © 2023 Yang, Shi, Lai, Chen, Xu and Deng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Yang, Xu
Shi, Xiaolong
Lai, Langwen
Chen, Congzhou
Xu, Huaisheng
Deng, Ming
Towards long double-stranded chains and robust DNA-based data storage using the random code system
title Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_full Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_fullStr Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_full_unstemmed Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_short Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_sort towards long double-stranded chains and robust dna-based data storage using the random code system
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10294226/
https://www.ncbi.nlm.nih.gov/pubmed/37384333
http://dx.doi.org/10.3389/fgene.2023.1179867
work_keys_str_mv AT yangxu towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT shixiaolong towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT lailangwen towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT chencongzhou towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT xuhuaisheng towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT dengming towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem