Cargando…
Interpretive time-frequency analysis of genomic sequences
BACKGROUND: Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analys...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374637/ https://www.ncbi.nlm.nih.gov/pubmed/28361669 http://dx.doi.org/10.1186/s12859-017-1524-0 |
_version_ | 1782518931167117312 |
---|---|
author | Hassani Saadi, Hamed Sameni, Reza Zollanvari, Amin |
author_facet | Hassani Saadi, Hamed Sameni, Reza Zollanvari, Amin |
author_sort | Hassani Saadi, Hamed |
collection | PubMed |
description | BACKGROUND: Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analysis for these sequences. The conventional approach to analyze non-numeric genomic sequences using techniques specific to numerical data is to convert non-numerical data into numerical values in some way and then apply time or transform domain signal processing algorithms. Nevertheless, this approach raises questions regarding the relative magnitudes under numeric transforms, which can potentially lead to spurious patterns or misinterpretation of results. RESULTS: In this paper, using the notion of interpretive signal processing (ISP) and by redefining correlation functions for non-numeric sequences, a general class of TF transforms are extended and applied to non-numerical genomic sequences. The technique has been successfully evaluated on synthetic and real DNA sequences. CONCLUSION: The proposed framework is fairly generic and is believed to be useful for extracting quantitative and visual information regarding local and global periodicity, symmetry, (non-) stationarity and spectral color of genomic sequences. The notion of interpretive time-frequency analysis introduced in this work can be considered as the first step towards the development of a rigorous mathematical construct for genomic signal processing. |
format | Online Article Text |
id | pubmed-5374637 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53746372017-04-03 Interpretive time-frequency analysis of genomic sequences Hassani Saadi, Hamed Sameni, Reza Zollanvari, Amin BMC Bioinformatics Research BACKGROUND: Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analysis for these sequences. The conventional approach to analyze non-numeric genomic sequences using techniques specific to numerical data is to convert non-numerical data into numerical values in some way and then apply time or transform domain signal processing algorithms. Nevertheless, this approach raises questions regarding the relative magnitudes under numeric transforms, which can potentially lead to spurious patterns or misinterpretation of results. RESULTS: In this paper, using the notion of interpretive signal processing (ISP) and by redefining correlation functions for non-numeric sequences, a general class of TF transforms are extended and applied to non-numerical genomic sequences. The technique has been successfully evaluated on synthetic and real DNA sequences. CONCLUSION: The proposed framework is fairly generic and is believed to be useful for extracting quantitative and visual information regarding local and global periodicity, symmetry, (non-) stationarity and spectral color of genomic sequences. The notion of interpretive time-frequency analysis introduced in this work can be considered as the first step towards the development of a rigorous mathematical construct for genomic signal processing. BioMed Central 2017-03-22 /pmc/articles/PMC5374637/ /pubmed/28361669 http://dx.doi.org/10.1186/s12859-017-1524-0 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hassani Saadi, Hamed Sameni, Reza Zollanvari, Amin Interpretive time-frequency analysis of genomic sequences |
title | Interpretive time-frequency analysis of genomic sequences |
title_full | Interpretive time-frequency analysis of genomic sequences |
title_fullStr | Interpretive time-frequency analysis of genomic sequences |
title_full_unstemmed | Interpretive time-frequency analysis of genomic sequences |
title_short | Interpretive time-frequency analysis of genomic sequences |
title_sort | interpretive time-frequency analysis of genomic sequences |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374637/ https://www.ncbi.nlm.nih.gov/pubmed/28361669 http://dx.doi.org/10.1186/s12859-017-1524-0 |
work_keys_str_mv | AT hassanisaadihamed interpretivetimefrequencyanalysisofgenomicsequences AT samenireza interpretivetimefrequencyanalysisofgenomicsequences AT zollanvariamin interpretivetimefrequencyanalysisofgenomicsequences |