Cargando…
Study on DNA Storage Encoding Based IAOA under Innovation Constraints
With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been conside...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136724/ https://www.ncbi.nlm.nih.gov/pubmed/37185757 http://dx.doi.org/10.3390/cimb45040233 |
_version_ | 1785032288039337984 |
---|---|
author | Du, Haigui Zhou, Shihua Yan, WeiQi Wang, Sijie |
author_facet | Du, Haigui Zhou, Shihua Yan, WeiQi Wang, Sijie |
author_sort | Du, Haigui |
collection | PubMed |
description | With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been considered the most prospective storage media to solve the data storage problem. Synthesis is an important process for DNA storage, and low-quality DNA coding can increase errors during sequencing, which can affect the storage efficiency. To reduce errors caused by the poor stability of DNA sequences during storage, this paper proposes a method that uses the double-matching and error-pairing constraints to improve the quality of the DNA coding set. First, the double-matching and error-pairing constraints are defined to solve problems of sequences with self-complementary reactions in the solution that are prone to mismatch at the 3′ end. In addition, two strategies are introduced in the arithmetic optimization algorithm, including a random perturbation of the elementary function and a double adaptive weighting strategy. An improved arithmetic optimization algorithm (IAOA) is proposed to construct DNA coding sets. The experimental results of the IAOA on 13 benchmark functions show a significant improvement in its exploration and development capabilities over the existing algorithms. Moreover, the IAOA is used in the DNA encoding design under both traditional and new constraints. The DNA coding sets are tested to estimate their quality regarding the number of hairpins and melting temperature. The DNA storage coding sets constructed in this study are improved by 77.7% at the lower boundary compared to existing algorithms. The DNA sequences in the storage sets show a reduction of 9.7–84.1% in the melting temperature variance, and the hairpin structure ratio is reduced by 2.1–80%. The results indicate that the stability of the DNA coding sets is improved under the two proposed constraints compared to traditional constraints. |
format | Online Article Text |
id | pubmed-10136724 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101367242023-04-28 Study on DNA Storage Encoding Based IAOA under Innovation Constraints Du, Haigui Zhou, Shihua Yan, WeiQi Wang, Sijie Curr Issues Mol Biol Article With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been considered the most prospective storage media to solve the data storage problem. Synthesis is an important process for DNA storage, and low-quality DNA coding can increase errors during sequencing, which can affect the storage efficiency. To reduce errors caused by the poor stability of DNA sequences during storage, this paper proposes a method that uses the double-matching and error-pairing constraints to improve the quality of the DNA coding set. First, the double-matching and error-pairing constraints are defined to solve problems of sequences with self-complementary reactions in the solution that are prone to mismatch at the 3′ end. In addition, two strategies are introduced in the arithmetic optimization algorithm, including a random perturbation of the elementary function and a double adaptive weighting strategy. An improved arithmetic optimization algorithm (IAOA) is proposed to construct DNA coding sets. The experimental results of the IAOA on 13 benchmark functions show a significant improvement in its exploration and development capabilities over the existing algorithms. Moreover, the IAOA is used in the DNA encoding design under both traditional and new constraints. The DNA coding sets are tested to estimate their quality regarding the number of hairpins and melting temperature. The DNA storage coding sets constructed in this study are improved by 77.7% at the lower boundary compared to existing algorithms. The DNA sequences in the storage sets show a reduction of 9.7–84.1% in the melting temperature variance, and the hairpin structure ratio is reduced by 2.1–80%. The results indicate that the stability of the DNA coding sets is improved under the two proposed constraints compared to traditional constraints. MDPI 2023-04-18 /pmc/articles/PMC10136724/ /pubmed/37185757 http://dx.doi.org/10.3390/cimb45040233 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Du, Haigui Zhou, Shihua Yan, WeiQi Wang, Sijie Study on DNA Storage Encoding Based IAOA under Innovation Constraints |
title | Study on DNA Storage Encoding Based IAOA under Innovation Constraints |
title_full | Study on DNA Storage Encoding Based IAOA under Innovation Constraints |
title_fullStr | Study on DNA Storage Encoding Based IAOA under Innovation Constraints |
title_full_unstemmed | Study on DNA Storage Encoding Based IAOA under Innovation Constraints |
title_short | Study on DNA Storage Encoding Based IAOA under Innovation Constraints |
title_sort | study on dna storage encoding based iaoa under innovation constraints |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136724/ https://www.ncbi.nlm.nih.gov/pubmed/37185757 http://dx.doi.org/10.3390/cimb45040233 |
work_keys_str_mv | AT duhaigui studyondnastorageencodingbasediaoaunderinnovationconstraints AT zhoushihua studyondnastorageencodingbasediaoaunderinnovationconstraints AT yanweiqi studyondnastorageencodingbasediaoaunderinnovationconstraints AT wangsijie studyondnastorageencodingbasediaoaunderinnovationconstraints |