Cargando…

Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models

BACKGROUND: We consider data from a time course microarray experiment that was conducted on grapevines over the development cycle of the grape berries at two different vineyards in South Australia. Although the underlying biological process of berry development is the same at both vineyards, there a...

Descripción completa

Detalles Bibliográficos
Autores principales: Robinson, Sean, Glonek, Garique, Koch, Inge, Thomas, Mark, Davies, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4472167/
https://www.ncbi.nlm.nih.gov/pubmed/26084333
http://dx.doi.org/10.1186/s12859-015-0634-9
_version_ 1782377013134229504
author Robinson, Sean
Glonek, Garique
Koch, Inge
Thomas, Mark
Davies, Christopher
author_facet Robinson, Sean
Glonek, Garique
Koch, Inge
Thomas, Mark
Davies, Christopher
author_sort Robinson, Sean
collection PubMed
description BACKGROUND: We consider data from a time course microarray experiment that was conducted on grapevines over the development cycle of the grape berries at two different vineyards in South Australia. Although the underlying biological process of berry development is the same at both vineyards, there are differences in the timing of the development due to local conditions. We aim to align the data from the two vineyards to enable an integrated analysis of the gene expression and use the alignment of the expression profiles to classify likely developmental function. RESULTS: We present a novel alignment method based on hidden Markov models (HMMs) and use the method to align the motivating grapevine data. We show that our alignment method is robust against subsets of profiles that are not suitable for alignment, investigate alignment diagnostics under the model and demonstrate the classification of developmentally driven genes. CONCLUSIONS: The classification of developmentally driven genes both validates that the alignment we obtain is meaningful and also gives new evidence that can be used to identify the role of genes with unknown function. Using our alignment methodology, we find at least 1279 grapevine probe sets with no current annotated function that are likely to be controlled in a developmental manner. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0634-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4472167
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44721672015-06-19 Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models Robinson, Sean Glonek, Garique Koch, Inge Thomas, Mark Davies, Christopher BMC Bioinformatics Research Article BACKGROUND: We consider data from a time course microarray experiment that was conducted on grapevines over the development cycle of the grape berries at two different vineyards in South Australia. Although the underlying biological process of berry development is the same at both vineyards, there are differences in the timing of the development due to local conditions. We aim to align the data from the two vineyards to enable an integrated analysis of the gene expression and use the alignment of the expression profiles to classify likely developmental function. RESULTS: We present a novel alignment method based on hidden Markov models (HMMs) and use the method to align the motivating grapevine data. We show that our alignment method is robust against subsets of profiles that are not suitable for alignment, investigate alignment diagnostics under the model and demonstrate the classification of developmentally driven genes. CONCLUSIONS: The classification of developmentally driven genes both validates that the alignment we obtain is meaningful and also gives new evidence that can be used to identify the role of genes with unknown function. Using our alignment methodology, we find at least 1279 grapevine probe sets with no current annotated function that are likely to be controlled in a developmental manner. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0634-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-18 /pmc/articles/PMC4472167/ /pubmed/26084333 http://dx.doi.org/10.1186/s12859-015-0634-9 Text en © Robinson et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Robinson, Sean
Glonek, Garique
Koch, Inge
Thomas, Mark
Davies, Christopher
Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
title Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
title_full Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
title_fullStr Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
title_full_unstemmed Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
title_short Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
title_sort alignment of time course gene expression data and the classification of developmentally driven genes with hidden markov models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4472167/
https://www.ncbi.nlm.nih.gov/pubmed/26084333
http://dx.doi.org/10.1186/s12859-015-0634-9
work_keys_str_mv AT robinsonsean alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels
AT glonekgarique alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels
AT kochinge alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels
AT thomasmark alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels
AT davieschristopher alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels