Cargando…
CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage
In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media becaus...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8129200/ https://www.ncbi.nlm.nih.gov/pubmed/34017354 http://dx.doi.org/10.3389/fgene.2021.644945 |
_version_ | 1783694277268733952 |
---|---|
author | Zheng, Yanfen Wu, Jieqiong Wang, Bin |
author_facet | Zheng, Yanfen Wu, Jieqiong Wang, Bin |
author_sort | Zheng, Yanfen |
collection | PubMed |
description | In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media because of its high density, large capacity and durability. In this study, we describe an enhanced gradient-based optimizer that includes the Cauchy and Levy mutation strategy (CLGBO) to construct DNA coding sets, which are used as primer and address libraries. Our experimental results show that the lower bounds of DNA storage coding sets obtained using the CLGBO algorithm are increased by 4.3–13.5% compared with previous work. The non-adjacent subsequence constraint was introduced to reduce the error rate in the storage process. This helps to resolve the problem that arises when consecutive repetitive subsequences in the sequence cause errors in DNA storage. We made use of the CLGBO algorithm and the non-adjacent subsequence constraint to construct larger and more highly robust coding sets. |
format | Online Article Text |
id | pubmed-8129200 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81292002021-05-19 CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage Zheng, Yanfen Wu, Jieqiong Wang, Bin Front Genet Genetics In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media because of its high density, large capacity and durability. In this study, we describe an enhanced gradient-based optimizer that includes the Cauchy and Levy mutation strategy (CLGBO) to construct DNA coding sets, which are used as primer and address libraries. Our experimental results show that the lower bounds of DNA storage coding sets obtained using the CLGBO algorithm are increased by 4.3–13.5% compared with previous work. The non-adjacent subsequence constraint was introduced to reduce the error rate in the storage process. This helps to resolve the problem that arises when consecutive repetitive subsequences in the sequence cause errors in DNA storage. We made use of the CLGBO algorithm and the non-adjacent subsequence constraint to construct larger and more highly robust coding sets. Frontiers Media S.A. 2021-05-04 /pmc/articles/PMC8129200/ /pubmed/34017354 http://dx.doi.org/10.3389/fgene.2021.644945 Text en Copyright © 2021 Zheng, Wu and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Zheng, Yanfen Wu, Jieqiong Wang, Bin CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage |
title | CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage |
title_full | CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage |
title_fullStr | CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage |
title_full_unstemmed | CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage |
title_short | CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage |
title_sort | clgbo: an algorithm for constructing highly robust coding sets for dna storage |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8129200/ https://www.ncbi.nlm.nih.gov/pubmed/34017354 http://dx.doi.org/10.3389/fgene.2021.644945 |
work_keys_str_mv | AT zhengyanfen clgboanalgorithmforconstructinghighlyrobustcodingsetsfordnastorage AT wujieqiong clgboanalgorithmforconstructinghighlyrobustcodingsetsfordnastorage AT wangbin clgboanalgorithmforconstructinghighlyrobustcodingsetsfordnastorage |