Cargando…

Map-invariant spectral analysis for the identification of DNA periodicities

Many signal processing based methods for finding hidden periodicities in DNA sequences have primarily focused on assigning numerical values to the symbolic DNA sequence and then applying spectral analysis tools such as the short-time discrete Fourier transform (ST-DFT) to locate these repeats. The k...

Descripción completa

Detalles Bibliográficos
Autores principales: Rushdi, Ahmad, Tuqan, Jamal, Strohmer, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3751961/
https://www.ncbi.nlm.nih.gov/pubmed/23067324
http://dx.doi.org/10.1186/1687-4153-2012-16
_version_ 1782281710721826816
author Rushdi, Ahmad
Tuqan, Jamal
Strohmer, Thomas
author_facet Rushdi, Ahmad
Tuqan, Jamal
Strohmer, Thomas
author_sort Rushdi, Ahmad
collection PubMed
description Many signal processing based methods for finding hidden periodicities in DNA sequences have primarily focused on assigning numerical values to the symbolic DNA sequence and then applying spectral analysis tools such as the short-time discrete Fourier transform (ST-DFT) to locate these repeats. The key results pertaining to this approach are however obtained using a very specific symbolic to numerical map, namely the so-called Voss representation. An important research problem is to therefore quantify the sensitivity of these results to the choice of the symbolic to numerical map. In this article, a novel algebraic approach to the periodicity detection problem is presented and provides a natural framework for studying the role of the symbolic to numerical map in finding these repeats. More specifically, we derive a new matrix-based expression of the DNA spectrum that comprises most of the widely used mappings in the literature as special cases, shows that the DNA spectrum is in fact invariable under all these mappings, and generates a necessary and sufficient condition for the invariance of the DNA spectrum to the symbolic to numerical map. Furthermore, the new algebraic framework decomposes the periodicity detection problem into several fundamental building blocks that are totally independent of each other. Sophisticated digital filters and/or alternate fast data transforms such as the discrete cosine and sine transforms can therefore be always incorporated in the periodicity detection scheme regardless of the choice of the symbolic to numerical map. Although the newly proposed framework is matrix based, identification of these periodicities can be achieved at a low computational cost.
format Online
Article
Text
id pubmed-3751961
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37519612013-08-27 Map-invariant spectral analysis for the identification of DNA periodicities Rushdi, Ahmad Tuqan, Jamal Strohmer, Thomas EURASIP J Bioinform Syst Biol Research Many signal processing based methods for finding hidden periodicities in DNA sequences have primarily focused on assigning numerical values to the symbolic DNA sequence and then applying spectral analysis tools such as the short-time discrete Fourier transform (ST-DFT) to locate these repeats. The key results pertaining to this approach are however obtained using a very specific symbolic to numerical map, namely the so-called Voss representation. An important research problem is to therefore quantify the sensitivity of these results to the choice of the symbolic to numerical map. In this article, a novel algebraic approach to the periodicity detection problem is presented and provides a natural framework for studying the role of the symbolic to numerical map in finding these repeats. More specifically, we derive a new matrix-based expression of the DNA spectrum that comprises most of the widely used mappings in the literature as special cases, shows that the DNA spectrum is in fact invariable under all these mappings, and generates a necessary and sufficient condition for the invariance of the DNA spectrum to the symbolic to numerical map. Furthermore, the new algebraic framework decomposes the periodicity detection problem into several fundamental building blocks that are totally independent of each other. Sophisticated digital filters and/or alternate fast data transforms such as the discrete cosine and sine transforms can therefore be always incorporated in the periodicity detection scheme regardless of the choice of the symbolic to numerical map. Although the newly proposed framework is matrix based, identification of these periodicities can be achieved at a low computational cost. BioMed Central 2012 2012-10-15 /pmc/articles/PMC3751961/ /pubmed/23067324 http://dx.doi.org/10.1186/1687-4153-2012-16 Text en Copyright © 2012 Rushdi et al.; licensee Springer. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Rushdi, Ahmad
Tuqan, Jamal
Strohmer, Thomas
Map-invariant spectral analysis for the identification of DNA periodicities
title Map-invariant spectral analysis for the identification of DNA periodicities
title_full Map-invariant spectral analysis for the identification of DNA periodicities
title_fullStr Map-invariant spectral analysis for the identification of DNA periodicities
title_full_unstemmed Map-invariant spectral analysis for the identification of DNA periodicities
title_short Map-invariant spectral analysis for the identification of DNA periodicities
title_sort map-invariant spectral analysis for the identification of dna periodicities
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3751961/
https://www.ncbi.nlm.nih.gov/pubmed/23067324
http://dx.doi.org/10.1186/1687-4153-2012-16
work_keys_str_mv AT rushdiahmad mapinvariantspectralanalysisfortheidentificationofdnaperiodicities
AT tuqanjamal mapinvariantspectralanalysisfortheidentificationofdnaperiodicities
AT strohmerthomas mapinvariantspectralanalysisfortheidentificationofdnaperiodicities