Cargando…

Statistical modelling of transcript profiles of differentially regulated genes

BACKGROUND: The vast quantities of gene expression profiling data produced in microarray studies, and the more precise quantitative PCR, are often not statistically analysed to their full potential. Previous studies have summarised gene expression profiles using simple descriptive statistics, basic...

Descripción completa

Detalles Bibliográficos
Autores principales: Eastwood, Daniel C, Mead, Andrew, Sergeant, Martin J, Burton, Kerry S
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2525656/
https://www.ncbi.nlm.nih.gov/pubmed/18651954
http://dx.doi.org/10.1186/1471-2199-9-66
_version_ 1782158681497927680
author Eastwood, Daniel C
Mead, Andrew
Sergeant, Martin J
Burton, Kerry S
author_facet Eastwood, Daniel C
Mead, Andrew
Sergeant, Martin J
Burton, Kerry S
author_sort Eastwood, Daniel C
collection PubMed
description BACKGROUND: The vast quantities of gene expression profiling data produced in microarray studies, and the more precise quantitative PCR, are often not statistically analysed to their full potential. Previous studies have summarised gene expression profiles using simple descriptive statistics, basic analysis of variance (ANOVA) and the clustering of genes based on simple models fitted to their expression profiles over time. We report the novel application of statistical non-linear regression modelling techniques to describe the shapes of expression profiles for the fungus Agaricus bisporus, quantified by PCR, and for E. coli and Rattus norvegicus, using microarray technology. The use of parametric non-linear regression models provides a more precise description of expression profiles, reducing the "noise" of the raw data to produce a clear "signal" given by the fitted curve, and describing each profile with a small number of biologically interpretable parameters. This approach then allows the direct comparison and clustering of the shapes of response patterns between genes and potentially enables a greater exploration and interpretation of the biological processes driving gene expression. RESULTS: Quantitative reverse transcriptase PCR-derived time-course data of genes were modelled. "Split-line" or "broken-stick" regression identified the initial time of gene up-regulation, enabling the classification of genes into those with primary and secondary responses. Five-day profiles were modelled using the biologically-oriented, critical exponential curve, y(t) = A + (B + Ct)R(t )+ ε. This non-linear regression approach allowed the expression patterns for different genes to be compared in terms of curve shape, time of maximal transcript level and the decline and asymptotic response levels. Three distinct regulatory patterns were identified for the five genes studied. Applying the regression modelling approach to microarray-derived time course data allowed 11% of the Escherichia coli features to be fitted by an exponential function, and 25% of the Rattus norvegicus features could be described by the critical exponential model, all with statistical significance of p < 0.05. CONCLUSION: The statistical non-linear regression approaches presented in this study provide detailed biologically oriented descriptions of individual gene expression profiles, using biologically variable data to generate a set of defining parameters. These approaches have application to the modelling and greater interpretation of profiles obtained across a wide range of platforms, such as microarrays. Through careful choice of appropriate model forms, such statistical regression approaches allow an improved comparison of gene expression profiles, and may provide an approach for the greater understanding of common regulatory mechanisms between genes.
format Text
id pubmed-2525656
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25256562008-08-27 Statistical modelling of transcript profiles of differentially regulated genes Eastwood, Daniel C Mead, Andrew Sergeant, Martin J Burton, Kerry S BMC Mol Biol Research Article BACKGROUND: The vast quantities of gene expression profiling data produced in microarray studies, and the more precise quantitative PCR, are often not statistically analysed to their full potential. Previous studies have summarised gene expression profiles using simple descriptive statistics, basic analysis of variance (ANOVA) and the clustering of genes based on simple models fitted to their expression profiles over time. We report the novel application of statistical non-linear regression modelling techniques to describe the shapes of expression profiles for the fungus Agaricus bisporus, quantified by PCR, and for E. coli and Rattus norvegicus, using microarray technology. The use of parametric non-linear regression models provides a more precise description of expression profiles, reducing the "noise" of the raw data to produce a clear "signal" given by the fitted curve, and describing each profile with a small number of biologically interpretable parameters. This approach then allows the direct comparison and clustering of the shapes of response patterns between genes and potentially enables a greater exploration and interpretation of the biological processes driving gene expression. RESULTS: Quantitative reverse transcriptase PCR-derived time-course data of genes were modelled. "Split-line" or "broken-stick" regression identified the initial time of gene up-regulation, enabling the classification of genes into those with primary and secondary responses. Five-day profiles were modelled using the biologically-oriented, critical exponential curve, y(t) = A + (B + Ct)R(t )+ ε. This non-linear regression approach allowed the expression patterns for different genes to be compared in terms of curve shape, time of maximal transcript level and the decline and asymptotic response levels. Three distinct regulatory patterns were identified for the five genes studied. Applying the regression modelling approach to microarray-derived time course data allowed 11% of the Escherichia coli features to be fitted by an exponential function, and 25% of the Rattus norvegicus features could be described by the critical exponential model, all with statistical significance of p < 0.05. CONCLUSION: The statistical non-linear regression approaches presented in this study provide detailed biologically oriented descriptions of individual gene expression profiles, using biologically variable data to generate a set of defining parameters. These approaches have application to the modelling and greater interpretation of profiles obtained across a wide range of platforms, such as microarrays. Through careful choice of appropriate model forms, such statistical regression approaches allow an improved comparison of gene expression profiles, and may provide an approach for the greater understanding of common regulatory mechanisms between genes. BioMed Central 2008-07-23 /pmc/articles/PMC2525656/ /pubmed/18651954 http://dx.doi.org/10.1186/1471-2199-9-66 Text en Copyright © 2008 Eastwood et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Eastwood, Daniel C
Mead, Andrew
Sergeant, Martin J
Burton, Kerry S
Statistical modelling of transcript profiles of differentially regulated genes
title Statistical modelling of transcript profiles of differentially regulated genes
title_full Statistical modelling of transcript profiles of differentially regulated genes
title_fullStr Statistical modelling of transcript profiles of differentially regulated genes
title_full_unstemmed Statistical modelling of transcript profiles of differentially regulated genes
title_short Statistical modelling of transcript profiles of differentially regulated genes
title_sort statistical modelling of transcript profiles of differentially regulated genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2525656/
https://www.ncbi.nlm.nih.gov/pubmed/18651954
http://dx.doi.org/10.1186/1471-2199-9-66
work_keys_str_mv AT eastwooddanielc statisticalmodellingoftranscriptprofilesofdifferentiallyregulatedgenes
AT meadandrew statisticalmodellingoftranscriptprofilesofdifferentiallyregulatedgenes
AT sergeantmartinj statisticalmodellingoftranscriptprofilesofdifferentiallyregulatedgenes
AT burtonkerrys statisticalmodellingoftranscriptprofilesofdifferentiallyregulatedgenes