Cargando…

Study on DNA Storage Encoding Based IAOA under Innovation Constraints

With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been conside...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Haigui, Zhou, Shihua, Yan, WeiQi, Wang, Sijie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136724/
https://www.ncbi.nlm.nih.gov/pubmed/37185757
http://dx.doi.org/10.3390/cimb45040233
_version_ 1785032288039337984
author Du, Haigui
Zhou, Shihua
Yan, WeiQi
Wang, Sijie
author_facet Du, Haigui
Zhou, Shihua
Yan, WeiQi
Wang, Sijie
author_sort Du, Haigui
collection PubMed
description With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been considered the most prospective storage media to solve the data storage problem. Synthesis is an important process for DNA storage, and low-quality DNA coding can increase errors during sequencing, which can affect the storage efficiency. To reduce errors caused by the poor stability of DNA sequences during storage, this paper proposes a method that uses the double-matching and error-pairing constraints to improve the quality of the DNA coding set. First, the double-matching and error-pairing constraints are defined to solve problems of sequences with self-complementary reactions in the solution that are prone to mismatch at the 3′ end. In addition, two strategies are introduced in the arithmetic optimization algorithm, including a random perturbation of the elementary function and a double adaptive weighting strategy. An improved arithmetic optimization algorithm (IAOA) is proposed to construct DNA coding sets. The experimental results of the IAOA on 13 benchmark functions show a significant improvement in its exploration and development capabilities over the existing algorithms. Moreover, the IAOA is used in the DNA encoding design under both traditional and new constraints. The DNA coding sets are tested to estimate their quality regarding the number of hairpins and melting temperature. The DNA storage coding sets constructed in this study are improved by 77.7% at the lower boundary compared to existing algorithms. The DNA sequences in the storage sets show a reduction of 9.7–84.1% in the melting temperature variance, and the hairpin structure ratio is reduced by 2.1–80%. The results indicate that the stability of the DNA coding sets is improved under the two proposed constraints compared to traditional constraints.
format Online
Article
Text
id pubmed-10136724
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101367242023-04-28 Study on DNA Storage Encoding Based IAOA under Innovation Constraints Du, Haigui Zhou, Shihua Yan, WeiQi Wang, Sijie Curr Issues Mol Biol Article With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been considered the most prospective storage media to solve the data storage problem. Synthesis is an important process for DNA storage, and low-quality DNA coding can increase errors during sequencing, which can affect the storage efficiency. To reduce errors caused by the poor stability of DNA sequences during storage, this paper proposes a method that uses the double-matching and error-pairing constraints to improve the quality of the DNA coding set. First, the double-matching and error-pairing constraints are defined to solve problems of sequences with self-complementary reactions in the solution that are prone to mismatch at the 3′ end. In addition, two strategies are introduced in the arithmetic optimization algorithm, including a random perturbation of the elementary function and a double adaptive weighting strategy. An improved arithmetic optimization algorithm (IAOA) is proposed to construct DNA coding sets. The experimental results of the IAOA on 13 benchmark functions show a significant improvement in its exploration and development capabilities over the existing algorithms. Moreover, the IAOA is used in the DNA encoding design under both traditional and new constraints. The DNA coding sets are tested to estimate their quality regarding the number of hairpins and melting temperature. The DNA storage coding sets constructed in this study are improved by 77.7% at the lower boundary compared to existing algorithms. The DNA sequences in the storage sets show a reduction of 9.7–84.1% in the melting temperature variance, and the hairpin structure ratio is reduced by 2.1–80%. The results indicate that the stability of the DNA coding sets is improved under the two proposed constraints compared to traditional constraints. MDPI 2023-04-18 /pmc/articles/PMC10136724/ /pubmed/37185757 http://dx.doi.org/10.3390/cimb45040233 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Du, Haigui
Zhou, Shihua
Yan, WeiQi
Wang, Sijie
Study on DNA Storage Encoding Based IAOA under Innovation Constraints
title Study on DNA Storage Encoding Based IAOA under Innovation Constraints
title_full Study on DNA Storage Encoding Based IAOA under Innovation Constraints
title_fullStr Study on DNA Storage Encoding Based IAOA under Innovation Constraints
title_full_unstemmed Study on DNA Storage Encoding Based IAOA under Innovation Constraints
title_short Study on DNA Storage Encoding Based IAOA under Innovation Constraints
title_sort study on dna storage encoding based iaoa under innovation constraints
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136724/
https://www.ncbi.nlm.nih.gov/pubmed/37185757
http://dx.doi.org/10.3390/cimb45040233
work_keys_str_mv AT duhaigui studyondnastorageencodingbasediaoaunderinnovationconstraints
AT zhoushihua studyondnastorageencodingbasediaoaunderinnovationconstraints
AT yanweiqi studyondnastorageencodingbasediaoaunderinnovationconstraints
AT wangsijie studyondnastorageencodingbasediaoaunderinnovationconstraints