Cargando…

CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage

In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media becaus...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Yanfen, Wu, Jieqiong, Wang, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8129200/
https://www.ncbi.nlm.nih.gov/pubmed/34017354
http://dx.doi.org/10.3389/fgene.2021.644945
_version_ 1783694277268733952
author Zheng, Yanfen
Wu, Jieqiong
Wang, Bin
author_facet Zheng, Yanfen
Wu, Jieqiong
Wang, Bin
author_sort Zheng, Yanfen
collection PubMed
description In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media because of its high density, large capacity and durability. In this study, we describe an enhanced gradient-based optimizer that includes the Cauchy and Levy mutation strategy (CLGBO) to construct DNA coding sets, which are used as primer and address libraries. Our experimental results show that the lower bounds of DNA storage coding sets obtained using the CLGBO algorithm are increased by 4.3–13.5% compared with previous work. The non-adjacent subsequence constraint was introduced to reduce the error rate in the storage process. This helps to resolve the problem that arises when consecutive repetitive subsequences in the sequence cause errors in DNA storage. We made use of the CLGBO algorithm and the non-adjacent subsequence constraint to construct larger and more highly robust coding sets.
format Online
Article
Text
id pubmed-8129200
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81292002021-05-19 CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage Zheng, Yanfen Wu, Jieqiong Wang, Bin Front Genet Genetics In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media because of its high density, large capacity and durability. In this study, we describe an enhanced gradient-based optimizer that includes the Cauchy and Levy mutation strategy (CLGBO) to construct DNA coding sets, which are used as primer and address libraries. Our experimental results show that the lower bounds of DNA storage coding sets obtained using the CLGBO algorithm are increased by 4.3–13.5% compared with previous work. The non-adjacent subsequence constraint was introduced to reduce the error rate in the storage process. This helps to resolve the problem that arises when consecutive repetitive subsequences in the sequence cause errors in DNA storage. We made use of the CLGBO algorithm and the non-adjacent subsequence constraint to construct larger and more highly robust coding sets. Frontiers Media S.A. 2021-05-04 /pmc/articles/PMC8129200/ /pubmed/34017354 http://dx.doi.org/10.3389/fgene.2021.644945 Text en Copyright © 2021 Zheng, Wu and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zheng, Yanfen
Wu, Jieqiong
Wang, Bin
CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
title CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
title_full CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
title_fullStr CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
title_full_unstemmed CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
title_short CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
title_sort clgbo: an algorithm for constructing highly robust coding sets for dna storage
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8129200/
https://www.ncbi.nlm.nih.gov/pubmed/34017354
http://dx.doi.org/10.3389/fgene.2021.644945
work_keys_str_mv AT zhengyanfen clgboanalgorithmforconstructinghighlyrobustcodingsetsfordnastorage
AT wujieqiong clgboanalgorithmforconstructinghighlyrobustcodingsetsfordnastorage
AT wangbin clgboanalgorithmforconstructinghighlyrobustcodingsetsfordnastorage