Cargando…

A method to identify differential expression profiles of time-course gene data with Fourier transformation

BACKGROUND: Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jaehee, Ogden, Robert Todd, Kim, Haseong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015127/
https://www.ncbi.nlm.nih.gov/pubmed/24134721
http://dx.doi.org/10.1186/1471-2105-14-310
_version_ 1782315285348352000
author Kim, Jaehee
Ogden, Robert Todd
Kim, Haseong
author_facet Kim, Jaehee
Ogden, Robert Todd
Kim, Haseong
author_sort Kim, Jaehee
collection PubMed
description BACKGROUND: Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. RESULTS: This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization. The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. CONCLUSIONS: Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be potentially used to identify genes which have the same patterns or biological processes, and help facing the present and forthcoming challenges of data analysis in functional genomics.
format Online
Article
Text
id pubmed-4015127
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40151272014-05-23 A method to identify differential expression profiles of time-course gene data with Fourier transformation Kim, Jaehee Ogden, Robert Todd Kim, Haseong BMC Bioinformatics Research Article BACKGROUND: Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. RESULTS: This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization. The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. CONCLUSIONS: Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be potentially used to identify genes which have the same patterns or biological processes, and help facing the present and forthcoming challenges of data analysis in functional genomics. BioMed Central 2013-10-18 /pmc/articles/PMC4015127/ /pubmed/24134721 http://dx.doi.org/10.1186/1471-2105-14-310 Text en Copyright © 2013 Kim et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kim, Jaehee
Ogden, Robert Todd
Kim, Haseong
A method to identify differential expression profiles of time-course gene data with Fourier transformation
title A method to identify differential expression profiles of time-course gene data with Fourier transformation
title_full A method to identify differential expression profiles of time-course gene data with Fourier transformation
title_fullStr A method to identify differential expression profiles of time-course gene data with Fourier transformation
title_full_unstemmed A method to identify differential expression profiles of time-course gene data with Fourier transformation
title_short A method to identify differential expression profiles of time-course gene data with Fourier transformation
title_sort method to identify differential expression profiles of time-course gene data with fourier transformation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015127/
https://www.ncbi.nlm.nih.gov/pubmed/24134721
http://dx.doi.org/10.1186/1471-2105-14-310
work_keys_str_mv AT kimjaehee amethodtoidentifydifferentialexpressionprofilesoftimecoursegenedatawithfouriertransformation
AT ogdenroberttodd amethodtoidentifydifferentialexpressionprofilesoftimecoursegenedatawithfouriertransformation
AT kimhaseong amethodtoidentifydifferentialexpressionprofilesoftimecoursegenedatawithfouriertransformation
AT kimjaehee methodtoidentifydifferentialexpressionprofilesoftimecoursegenedatawithfouriertransformation
AT ogdenroberttodd methodtoidentifydifferentialexpressionprofilesoftimecoursegenedatawithfouriertransformation
AT kimhaseong methodtoidentifydifferentialexpressionprofilesoftimecoursegenedatawithfouriertransformation