Cargando…

Interpretive time-frequency analysis of genomic sequences

BACKGROUND: Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analys...

Descripción completa

Detalles Bibliográficos
Autores principales: Hassani Saadi, Hamed, Sameni, Reza, Zollanvari, Amin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374637/
https://www.ncbi.nlm.nih.gov/pubmed/28361669
http://dx.doi.org/10.1186/s12859-017-1524-0
_version_ 1782518931167117312
author Hassani Saadi, Hamed
Sameni, Reza
Zollanvari, Amin
author_facet Hassani Saadi, Hamed
Sameni, Reza
Zollanvari, Amin
author_sort Hassani Saadi, Hamed
collection PubMed
description BACKGROUND: Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analysis for these sequences. The conventional approach to analyze non-numeric genomic sequences using techniques specific to numerical data is to convert non-numerical data into numerical values in some way and then apply time or transform domain signal processing algorithms. Nevertheless, this approach raises questions regarding the relative magnitudes under numeric transforms, which can potentially lead to spurious patterns or misinterpretation of results. RESULTS: In this paper, using the notion of interpretive signal processing (ISP) and by redefining correlation functions for non-numeric sequences, a general class of TF transforms are extended and applied to non-numerical genomic sequences. The technique has been successfully evaluated on synthetic and real DNA sequences. CONCLUSION: The proposed framework is fairly generic and is believed to be useful for extracting quantitative and visual information regarding local and global periodicity, symmetry, (non-) stationarity and spectral color of genomic sequences. The notion of interpretive time-frequency analysis introduced in this work can be considered as the first step towards the development of a rigorous mathematical construct for genomic signal processing.
format Online
Article
Text
id pubmed-5374637
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53746372017-04-03 Interpretive time-frequency analysis of genomic sequences Hassani Saadi, Hamed Sameni, Reza Zollanvari, Amin BMC Bioinformatics Research BACKGROUND: Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analysis for these sequences. The conventional approach to analyze non-numeric genomic sequences using techniques specific to numerical data is to convert non-numerical data into numerical values in some way and then apply time or transform domain signal processing algorithms. Nevertheless, this approach raises questions regarding the relative magnitudes under numeric transforms, which can potentially lead to spurious patterns or misinterpretation of results. RESULTS: In this paper, using the notion of interpretive signal processing (ISP) and by redefining correlation functions for non-numeric sequences, a general class of TF transforms are extended and applied to non-numerical genomic sequences. The technique has been successfully evaluated on synthetic and real DNA sequences. CONCLUSION: The proposed framework is fairly generic and is believed to be useful for extracting quantitative and visual information regarding local and global periodicity, symmetry, (non-) stationarity and spectral color of genomic sequences. The notion of interpretive time-frequency analysis introduced in this work can be considered as the first step towards the development of a rigorous mathematical construct for genomic signal processing. BioMed Central 2017-03-22 /pmc/articles/PMC5374637/ /pubmed/28361669 http://dx.doi.org/10.1186/s12859-017-1524-0 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Hassani Saadi, Hamed
Sameni, Reza
Zollanvari, Amin
Interpretive time-frequency analysis of genomic sequences
title Interpretive time-frequency analysis of genomic sequences
title_full Interpretive time-frequency analysis of genomic sequences
title_fullStr Interpretive time-frequency analysis of genomic sequences
title_full_unstemmed Interpretive time-frequency analysis of genomic sequences
title_short Interpretive time-frequency analysis of genomic sequences
title_sort interpretive time-frequency analysis of genomic sequences
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374637/
https://www.ncbi.nlm.nih.gov/pubmed/28361669
http://dx.doi.org/10.1186/s12859-017-1524-0
work_keys_str_mv AT hassanisaadihamed interpretivetimefrequencyanalysisofgenomicsequences
AT samenireza interpretivetimefrequencyanalysisofgenomicsequences
AT zollanvariamin interpretivetimefrequencyanalysisofgenomicsequences