Cargando…
Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models
BACKGROUND: We consider data from a time course microarray experiment that was conducted on grapevines over the development cycle of the grape berries at two different vineyards in South Australia. Although the underlying biological process of berry development is the same at both vineyards, there a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4472167/ https://www.ncbi.nlm.nih.gov/pubmed/26084333 http://dx.doi.org/10.1186/s12859-015-0634-9 |
_version_ | 1782377013134229504 |
---|---|
author | Robinson, Sean Glonek, Garique Koch, Inge Thomas, Mark Davies, Christopher |
author_facet | Robinson, Sean Glonek, Garique Koch, Inge Thomas, Mark Davies, Christopher |
author_sort | Robinson, Sean |
collection | PubMed |
description | BACKGROUND: We consider data from a time course microarray experiment that was conducted on grapevines over the development cycle of the grape berries at two different vineyards in South Australia. Although the underlying biological process of berry development is the same at both vineyards, there are differences in the timing of the development due to local conditions. We aim to align the data from the two vineyards to enable an integrated analysis of the gene expression and use the alignment of the expression profiles to classify likely developmental function. RESULTS: We present a novel alignment method based on hidden Markov models (HMMs) and use the method to align the motivating grapevine data. We show that our alignment method is robust against subsets of profiles that are not suitable for alignment, investigate alignment diagnostics under the model and demonstrate the classification of developmentally driven genes. CONCLUSIONS: The classification of developmentally driven genes both validates that the alignment we obtain is meaningful and also gives new evidence that can be used to identify the role of genes with unknown function. Using our alignment methodology, we find at least 1279 grapevine probe sets with no current annotated function that are likely to be controlled in a developmental manner. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0634-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4472167 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44721672015-06-19 Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models Robinson, Sean Glonek, Garique Koch, Inge Thomas, Mark Davies, Christopher BMC Bioinformatics Research Article BACKGROUND: We consider data from a time course microarray experiment that was conducted on grapevines over the development cycle of the grape berries at two different vineyards in South Australia. Although the underlying biological process of berry development is the same at both vineyards, there are differences in the timing of the development due to local conditions. We aim to align the data from the two vineyards to enable an integrated analysis of the gene expression and use the alignment of the expression profiles to classify likely developmental function. RESULTS: We present a novel alignment method based on hidden Markov models (HMMs) and use the method to align the motivating grapevine data. We show that our alignment method is robust against subsets of profiles that are not suitable for alignment, investigate alignment diagnostics under the model and demonstrate the classification of developmentally driven genes. CONCLUSIONS: The classification of developmentally driven genes both validates that the alignment we obtain is meaningful and also gives new evidence that can be used to identify the role of genes with unknown function. Using our alignment methodology, we find at least 1279 grapevine probe sets with no current annotated function that are likely to be controlled in a developmental manner. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0634-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-18 /pmc/articles/PMC4472167/ /pubmed/26084333 http://dx.doi.org/10.1186/s12859-015-0634-9 Text en © Robinson et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Robinson, Sean Glonek, Garique Koch, Inge Thomas, Mark Davies, Christopher Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models |
title | Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models |
title_full | Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models |
title_fullStr | Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models |
title_full_unstemmed | Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models |
title_short | Alignment of time course gene expression data and the classification of developmentally driven genes with hidden Markov models |
title_sort | alignment of time course gene expression data and the classification of developmentally driven genes with hidden markov models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4472167/ https://www.ncbi.nlm.nih.gov/pubmed/26084333 http://dx.doi.org/10.1186/s12859-015-0634-9 |
work_keys_str_mv | AT robinsonsean alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels AT glonekgarique alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels AT kochinge alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels AT thomasmark alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels AT davieschristopher alignmentoftimecoursegeneexpressiondataandtheclassificationofdevelopmentallydrivengeneswithhiddenmarkovmodels |