Cargando…

A model selection approach to discover age-dependent gene expression patterns using quantile regression models

BACKGROUND: It has been a long-standing biological challenge to understand the molecular regulatory mechanisms behind mammalian ageing. Harnessing the availability of many ageing microarray datasets, a number of studies have shown that it is possible to identify genes that have age-dependent differe...

Descripción completa

Detalles Bibliográficos
Autores principales: Ho, Joshua WK, Stefani, Maurizio, dos Remedios, Cristobal G, Charleston, Michael A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788368/
https://www.ncbi.nlm.nih.gov/pubmed/19958479
http://dx.doi.org/10.1186/1471-2164-10-S3-S16
_version_ 1782174965503623168
author Ho, Joshua WK
Stefani, Maurizio
dos Remedios, Cristobal G
Charleston, Michael A
author_facet Ho, Joshua WK
Stefani, Maurizio
dos Remedios, Cristobal G
Charleston, Michael A
author_sort Ho, Joshua WK
collection PubMed
description BACKGROUND: It has been a long-standing biological challenge to understand the molecular regulatory mechanisms behind mammalian ageing. Harnessing the availability of many ageing microarray datasets, a number of studies have shown that it is possible to identify genes that have age-dependent differential expression (DE) or differential variability (DV) patterns. The majority of the studies identify "interesting" genes using a linear regression approach, which is known to perform poorly in the presence of outliers or if the underlying age-dependent pattern is non-linear. Clearly a more robust and flexible approach is needed to identify genes with various age-dependent gene expression patterns. RESULTS: Here we present a novel model selection approach to discover genes with linear or non-linear age-dependent gene expression patterns from microarray data. To identify DE genes, our method fits three quantile regression models (constant, linear and piecewise linear models) to the expression profile of each gene, and selects the least complex model that best fits the available data. Similarly, DV genes are identified by fitting and comparing two quantile regression models (non-DV and the DV models) to the expression profile of each gene. We show that our approach is much more robust than the standard linear regression approach in discovering age-dependent patterns. We also applied our approach to analyze two human brain ageing datasets and found many biologically interesting gene expression patterns, including some very interesting DV patterns, that have been overlooked in the original studies. Furthermore, we propose that our model selection approach can be extended to discover DE and DV genes from microarray datasets with discrete class labels, by considering different quantile regression models. CONCLUSION: In this paper, we present a novel application of quantile regression models to identify genes that have interesting linear or non-linear age-dependent expression patterns. One important contribution of this paper is to introduce a model selection approach to DE and DV gene identification, which is most commonly tackled by null hypothesis testing approaches. We show that our approach is robust in analyzing real and simulated datasets. We believe that our approach is applicable in many ageing or time-series data analysis tasks.
format Text
id pubmed-2788368
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27883682009-12-04 A model selection approach to discover age-dependent gene expression patterns using quantile regression models Ho, Joshua WK Stefani, Maurizio dos Remedios, Cristobal G Charleston, Michael A BMC Genomics Proceedings BACKGROUND: It has been a long-standing biological challenge to understand the molecular regulatory mechanisms behind mammalian ageing. Harnessing the availability of many ageing microarray datasets, a number of studies have shown that it is possible to identify genes that have age-dependent differential expression (DE) or differential variability (DV) patterns. The majority of the studies identify "interesting" genes using a linear regression approach, which is known to perform poorly in the presence of outliers or if the underlying age-dependent pattern is non-linear. Clearly a more robust and flexible approach is needed to identify genes with various age-dependent gene expression patterns. RESULTS: Here we present a novel model selection approach to discover genes with linear or non-linear age-dependent gene expression patterns from microarray data. To identify DE genes, our method fits three quantile regression models (constant, linear and piecewise linear models) to the expression profile of each gene, and selects the least complex model that best fits the available data. Similarly, DV genes are identified by fitting and comparing two quantile regression models (non-DV and the DV models) to the expression profile of each gene. We show that our approach is much more robust than the standard linear regression approach in discovering age-dependent patterns. We also applied our approach to analyze two human brain ageing datasets and found many biologically interesting gene expression patterns, including some very interesting DV patterns, that have been overlooked in the original studies. Furthermore, we propose that our model selection approach can be extended to discover DE and DV genes from microarray datasets with discrete class labels, by considering different quantile regression models. CONCLUSION: In this paper, we present a novel application of quantile regression models to identify genes that have interesting linear or non-linear age-dependent expression patterns. One important contribution of this paper is to introduce a model selection approach to DE and DV gene identification, which is most commonly tackled by null hypothesis testing approaches. We show that our approach is robust in analyzing real and simulated datasets. We believe that our approach is applicable in many ageing or time-series data analysis tasks. BioMed Central 2009-12-03 /pmc/articles/PMC2788368/ /pubmed/19958479 http://dx.doi.org/10.1186/1471-2164-10-S3-S16 Text en Copyright ©2009 Ho et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Ho, Joshua WK
Stefani, Maurizio
dos Remedios, Cristobal G
Charleston, Michael A
A model selection approach to discover age-dependent gene expression patterns using quantile regression models
title A model selection approach to discover age-dependent gene expression patterns using quantile regression models
title_full A model selection approach to discover age-dependent gene expression patterns using quantile regression models
title_fullStr A model selection approach to discover age-dependent gene expression patterns using quantile regression models
title_full_unstemmed A model selection approach to discover age-dependent gene expression patterns using quantile regression models
title_short A model selection approach to discover age-dependent gene expression patterns using quantile regression models
title_sort model selection approach to discover age-dependent gene expression patterns using quantile regression models
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788368/
https://www.ncbi.nlm.nih.gov/pubmed/19958479
http://dx.doi.org/10.1186/1471-2164-10-S3-S16
work_keys_str_mv AT hojoshuawk amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT stefanimaurizio amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT dosremedioscristobalg amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT charlestonmichaela amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT hojoshuawk modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT stefanimaurizio modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT dosremedioscristobalg modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels
AT charlestonmichaela modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels