Cargando…

Combinatorial constraint coding based on the EORS algorithm in DNA storage

The development of information technology has produced massive amounts of data, which has brought severe challenges to information storage. Traditional electronic storage media cannot keep up with the ever-increasing demand for data storage, but in its place DNA has emerged as a feasible storage med...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiaoru, Li, Ling, Guo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320985/
https://www.ncbi.nlm.nih.gov/pubmed/34324571
http://dx.doi.org/10.1371/journal.pone.0255376
_version_ 1783730743997890560
author Xiaoru, Li
Ling, Guo
author_facet Xiaoru, Li
Ling, Guo
author_sort Xiaoru, Li
collection PubMed
description The development of information technology has produced massive amounts of data, which has brought severe challenges to information storage. Traditional electronic storage media cannot keep up with the ever-increasing demand for data storage, but in its place DNA has emerged as a feasible storage medium with high density, large storage capacity and strong durability. In DNA data storage, many different approaches can be used to encode data into codewords. DNA coding is a key step in DNA storage and can directly affect storage performance and data integrity. However, since errors are prone to occur in DNA synthesis and sequencing, and non-specific hybridization is prone to occur in the solution, how to effectively encode DNA has become an urgent problem to be solved. In this article, we propose a DNA storage coding method based on the equilibrium optimization random search (EORS) algorithm, which meets the Hamming distance, GC content and no-runlength constraints and can reduce the error rate in storage. Simulation experiments have shown that the size of the DNA storage code set constructed by the EORS algorithm that meets the combination constraints has increased by an average of 11% compared with previous work. The increase in the code set means that shorter DNA chains can be used to store more data.
format Online
Article
Text
id pubmed-8320985
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83209852021-07-31 Combinatorial constraint coding based on the EORS algorithm in DNA storage Xiaoru, Li Ling, Guo PLoS One Research Article The development of information technology has produced massive amounts of data, which has brought severe challenges to information storage. Traditional electronic storage media cannot keep up with the ever-increasing demand for data storage, but in its place DNA has emerged as a feasible storage medium with high density, large storage capacity and strong durability. In DNA data storage, many different approaches can be used to encode data into codewords. DNA coding is a key step in DNA storage and can directly affect storage performance and data integrity. However, since errors are prone to occur in DNA synthesis and sequencing, and non-specific hybridization is prone to occur in the solution, how to effectively encode DNA has become an urgent problem to be solved. In this article, we propose a DNA storage coding method based on the equilibrium optimization random search (EORS) algorithm, which meets the Hamming distance, GC content and no-runlength constraints and can reduce the error rate in storage. Simulation experiments have shown that the size of the DNA storage code set constructed by the EORS algorithm that meets the combination constraints has increased by an average of 11% compared with previous work. The increase in the code set means that shorter DNA chains can be used to store more data. Public Library of Science 2021-07-29 /pmc/articles/PMC8320985/ /pubmed/34324571 http://dx.doi.org/10.1371/journal.pone.0255376 Text en © 2021 Xiaoru, Ling https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Xiaoru, Li
Ling, Guo
Combinatorial constraint coding based on the EORS algorithm in DNA storage
title Combinatorial constraint coding based on the EORS algorithm in DNA storage
title_full Combinatorial constraint coding based on the EORS algorithm in DNA storage
title_fullStr Combinatorial constraint coding based on the EORS algorithm in DNA storage
title_full_unstemmed Combinatorial constraint coding based on the EORS algorithm in DNA storage
title_short Combinatorial constraint coding based on the EORS algorithm in DNA storage
title_sort combinatorial constraint coding based on the eors algorithm in dna storage
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320985/
https://www.ncbi.nlm.nih.gov/pubmed/34324571
http://dx.doi.org/10.1371/journal.pone.0255376
work_keys_str_mv AT xiaoruli combinatorialconstraintcodingbasedontheeorsalgorithmindnastorage
AT lingguo combinatorialconstraintcodingbasedontheeorsalgorithmindnastorage