Cargando…

Extracting biologically significant patterns from short time series gene expression data

BACKGROUND: Time series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult. RESULTS: We developed two new a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tchagang, Alain B, Bui, Kevin V, McGinnis, Thomas, Benos, Panayiotis V
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2743670/
https://www.ncbi.nlm.nih.gov/pubmed/19695084
http://dx.doi.org/10.1186/1471-2105-10-255
_version_ 1782171873338982400
author Tchagang, Alain B
Bui, Kevin V
McGinnis, Thomas
Benos, Panayiotis V
author_facet Tchagang, Alain B
Bui, Kevin V
McGinnis, Thomas
Benos, Panayiotis V
author_sort Tchagang, Alain B
collection PubMed
description BACKGROUND: Time series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult. RESULTS: We developed two new algorithms that are capable of extracting biological patterns from short time point series gene expression data. The two algorithms, ASTRO and MiMeSR, are inspired by the rank order preserving framework and the minimum mean squared residue approach, respectively. However, ASTRO and MiMeSR differ from previous approaches in that they take advantage of the relatively few number of time points in order to reduce the problem from NP-hard to linear. Tested on well-defined short time expression data, we found that our approaches are robust to noise, as well as to random patterns, and that they can correctly detect the temporal expression profile of relevant functional categories. Evaluation of our methods was performed using Gene Ontology (GO) annotations and chromatin immunoprecipitation (ChIP-chip) data. CONCLUSION: Our approaches generally outperform both standard clustering algorithms and algorithms designed specifically for clustering of short time series gene expression data. Both algorithms are available at .
format Text
id pubmed-2743670
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27436702009-09-15 Extracting biologically significant patterns from short time series gene expression data Tchagang, Alain B Bui, Kevin V McGinnis, Thomas Benos, Panayiotis V BMC Bioinformatics Methodology Article BACKGROUND: Time series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult. RESULTS: We developed two new algorithms that are capable of extracting biological patterns from short time point series gene expression data. The two algorithms, ASTRO and MiMeSR, are inspired by the rank order preserving framework and the minimum mean squared residue approach, respectively. However, ASTRO and MiMeSR differ from previous approaches in that they take advantage of the relatively few number of time points in order to reduce the problem from NP-hard to linear. Tested on well-defined short time expression data, we found that our approaches are robust to noise, as well as to random patterns, and that they can correctly detect the temporal expression profile of relevant functional categories. Evaluation of our methods was performed using Gene Ontology (GO) annotations and chromatin immunoprecipitation (ChIP-chip) data. CONCLUSION: Our approaches generally outperform both standard clustering algorithms and algorithms designed specifically for clustering of short time series gene expression data. Both algorithms are available at . BioMed Central 2009-08-20 /pmc/articles/PMC2743670/ /pubmed/19695084 http://dx.doi.org/10.1186/1471-2105-10-255 Text en Copyright © 2009 Tchagang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Tchagang, Alain B
Bui, Kevin V
McGinnis, Thomas
Benos, Panayiotis V
Extracting biologically significant patterns from short time series gene expression data
title Extracting biologically significant patterns from short time series gene expression data
title_full Extracting biologically significant patterns from short time series gene expression data
title_fullStr Extracting biologically significant patterns from short time series gene expression data
title_full_unstemmed Extracting biologically significant patterns from short time series gene expression data
title_short Extracting biologically significant patterns from short time series gene expression data
title_sort extracting biologically significant patterns from short time series gene expression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2743670/
https://www.ncbi.nlm.nih.gov/pubmed/19695084
http://dx.doi.org/10.1186/1471-2105-10-255
work_keys_str_mv AT tchagangalainb extractingbiologicallysignificantpatternsfromshorttimeseriesgeneexpressiondata
AT buikevinv extractingbiologicallysignificantpatternsfromshorttimeseriesgeneexpressiondata
AT mcginnisthomas extractingbiologicallysignificantpatternsfromshorttimeseriesgeneexpressiondata
AT benospanayiotisv extractingbiologicallysignificantpatternsfromshorttimeseriesgeneexpressiondata