Cargando…

Small-Angle Scattering and Multifractal Analysis of DNA Sequences

The arrangement of A, C, G and T nucleotides in large DNA sequences of many prokaryotic and eukaryotic cells exhibit long-range correlations with fractal properties. Chaos game representation (CGR) of such DNA sequences, followed by a multifractal analysis, is a useful way to analyze the correspondi...

Descripción completa

Detalles Bibliográficos
Autor principal: Anitas, Eugen Mircea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7369734/
https://www.ncbi.nlm.nih.gov/pubmed/32629908
http://dx.doi.org/10.3390/ijms21134651
_version_ 1783560836390846464
author Anitas, Eugen Mircea
author_facet Anitas, Eugen Mircea
author_sort Anitas, Eugen Mircea
collection PubMed
description The arrangement of A, C, G and T nucleotides in large DNA sequences of many prokaryotic and eukaryotic cells exhibit long-range correlations with fractal properties. Chaos game representation (CGR) of such DNA sequences, followed by a multifractal analysis, is a useful way to analyze the corresponding scaling properties. This approach provides a powerful visualization method to characterize their spatial inhomogeneity, and allows discrimination between mono- and multifractal distributions. However, in some cases, two different arbitrary point distributions, may generate indistinguishable multifractal spectra. By using a new model based on multiplicative deterministic cascades, here it is shown that small-angle scattering (SAS) formalism can be used to address such issue, and to extract additional structural information. It is shown that the box-counting dimension given by multifractal spectra can be recovered from the scattering exponent of SAS intensity in the fractal region. This approach is illustrated for point distributions of CGR data corresponding to Escherichia coli, Phospholamban and Mouse mitochondrial DNA, and it is shown that for the latter two cases, SAS allows extraction of the fractal iteration number and the scaling factor corresponding to “ACGT” square, or to recover the number of bases. The results are compared with a model based on multiplicative deterministic cascades, and respectively with one which takes into account the existence of forbidden sequences in DNA. This allows a classification of the DNA sequences in terms of random and deterministic fractals structures emerging in CGR.
format Online
Article
Text
id pubmed-7369734
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73697342020-07-21 Small-Angle Scattering and Multifractal Analysis of DNA Sequences Anitas, Eugen Mircea Int J Mol Sci Article The arrangement of A, C, G and T nucleotides in large DNA sequences of many prokaryotic and eukaryotic cells exhibit long-range correlations with fractal properties. Chaos game representation (CGR) of such DNA sequences, followed by a multifractal analysis, is a useful way to analyze the corresponding scaling properties. This approach provides a powerful visualization method to characterize their spatial inhomogeneity, and allows discrimination between mono- and multifractal distributions. However, in some cases, two different arbitrary point distributions, may generate indistinguishable multifractal spectra. By using a new model based on multiplicative deterministic cascades, here it is shown that small-angle scattering (SAS) formalism can be used to address such issue, and to extract additional structural information. It is shown that the box-counting dimension given by multifractal spectra can be recovered from the scattering exponent of SAS intensity in the fractal region. This approach is illustrated for point distributions of CGR data corresponding to Escherichia coli, Phospholamban and Mouse mitochondrial DNA, and it is shown that for the latter two cases, SAS allows extraction of the fractal iteration number and the scaling factor corresponding to “ACGT” square, or to recover the number of bases. The results are compared with a model based on multiplicative deterministic cascades, and respectively with one which takes into account the existence of forbidden sequences in DNA. This allows a classification of the DNA sequences in terms of random and deterministic fractals structures emerging in CGR. MDPI 2020-06-30 /pmc/articles/PMC7369734/ /pubmed/32629908 http://dx.doi.org/10.3390/ijms21134651 Text en © 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Anitas, Eugen Mircea
Small-Angle Scattering and Multifractal Analysis of DNA Sequences
title Small-Angle Scattering and Multifractal Analysis of DNA Sequences
title_full Small-Angle Scattering and Multifractal Analysis of DNA Sequences
title_fullStr Small-Angle Scattering and Multifractal Analysis of DNA Sequences
title_full_unstemmed Small-Angle Scattering and Multifractal Analysis of DNA Sequences
title_short Small-Angle Scattering and Multifractal Analysis of DNA Sequences
title_sort small-angle scattering and multifractal analysis of dna sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7369734/
https://www.ncbi.nlm.nih.gov/pubmed/32629908
http://dx.doi.org/10.3390/ijms21134651
work_keys_str_mv AT anitaseugenmircea smallanglescatteringandmultifractalanalysisofdnasequences