Cargando…
A model selection approach to discover age-dependent gene expression patterns using quantile regression models
BACKGROUND: It has been a long-standing biological challenge to understand the molecular regulatory mechanisms behind mammalian ageing. Harnessing the availability of many ageing microarray datasets, a number of studies have shown that it is possible to identify genes that have age-dependent differe...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788368/ https://www.ncbi.nlm.nih.gov/pubmed/19958479 http://dx.doi.org/10.1186/1471-2164-10-S3-S16 |
_version_ | 1782174965503623168 |
---|---|
author | Ho, Joshua WK Stefani, Maurizio dos Remedios, Cristobal G Charleston, Michael A |
author_facet | Ho, Joshua WK Stefani, Maurizio dos Remedios, Cristobal G Charleston, Michael A |
author_sort | Ho, Joshua WK |
collection | PubMed |
description | BACKGROUND: It has been a long-standing biological challenge to understand the molecular regulatory mechanisms behind mammalian ageing. Harnessing the availability of many ageing microarray datasets, a number of studies have shown that it is possible to identify genes that have age-dependent differential expression (DE) or differential variability (DV) patterns. The majority of the studies identify "interesting" genes using a linear regression approach, which is known to perform poorly in the presence of outliers or if the underlying age-dependent pattern is non-linear. Clearly a more robust and flexible approach is needed to identify genes with various age-dependent gene expression patterns. RESULTS: Here we present a novel model selection approach to discover genes with linear or non-linear age-dependent gene expression patterns from microarray data. To identify DE genes, our method fits three quantile regression models (constant, linear and piecewise linear models) to the expression profile of each gene, and selects the least complex model that best fits the available data. Similarly, DV genes are identified by fitting and comparing two quantile regression models (non-DV and the DV models) to the expression profile of each gene. We show that our approach is much more robust than the standard linear regression approach in discovering age-dependent patterns. We also applied our approach to analyze two human brain ageing datasets and found many biologically interesting gene expression patterns, including some very interesting DV patterns, that have been overlooked in the original studies. Furthermore, we propose that our model selection approach can be extended to discover DE and DV genes from microarray datasets with discrete class labels, by considering different quantile regression models. CONCLUSION: In this paper, we present a novel application of quantile regression models to identify genes that have interesting linear or non-linear age-dependent expression patterns. One important contribution of this paper is to introduce a model selection approach to DE and DV gene identification, which is most commonly tackled by null hypothesis testing approaches. We show that our approach is robust in analyzing real and simulated datasets. We believe that our approach is applicable in many ageing or time-series data analysis tasks. |
format | Text |
id | pubmed-2788368 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27883682009-12-04 A model selection approach to discover age-dependent gene expression patterns using quantile regression models Ho, Joshua WK Stefani, Maurizio dos Remedios, Cristobal G Charleston, Michael A BMC Genomics Proceedings BACKGROUND: It has been a long-standing biological challenge to understand the molecular regulatory mechanisms behind mammalian ageing. Harnessing the availability of many ageing microarray datasets, a number of studies have shown that it is possible to identify genes that have age-dependent differential expression (DE) or differential variability (DV) patterns. The majority of the studies identify "interesting" genes using a linear regression approach, which is known to perform poorly in the presence of outliers or if the underlying age-dependent pattern is non-linear. Clearly a more robust and flexible approach is needed to identify genes with various age-dependent gene expression patterns. RESULTS: Here we present a novel model selection approach to discover genes with linear or non-linear age-dependent gene expression patterns from microarray data. To identify DE genes, our method fits three quantile regression models (constant, linear and piecewise linear models) to the expression profile of each gene, and selects the least complex model that best fits the available data. Similarly, DV genes are identified by fitting and comparing two quantile regression models (non-DV and the DV models) to the expression profile of each gene. We show that our approach is much more robust than the standard linear regression approach in discovering age-dependent patterns. We also applied our approach to analyze two human brain ageing datasets and found many biologically interesting gene expression patterns, including some very interesting DV patterns, that have been overlooked in the original studies. Furthermore, we propose that our model selection approach can be extended to discover DE and DV genes from microarray datasets with discrete class labels, by considering different quantile regression models. CONCLUSION: In this paper, we present a novel application of quantile regression models to identify genes that have interesting linear or non-linear age-dependent expression patterns. One important contribution of this paper is to introduce a model selection approach to DE and DV gene identification, which is most commonly tackled by null hypothesis testing approaches. We show that our approach is robust in analyzing real and simulated datasets. We believe that our approach is applicable in many ageing or time-series data analysis tasks. BioMed Central 2009-12-03 /pmc/articles/PMC2788368/ /pubmed/19958479 http://dx.doi.org/10.1186/1471-2164-10-S3-S16 Text en Copyright ©2009 Ho et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Ho, Joshua WK Stefani, Maurizio dos Remedios, Cristobal G Charleston, Michael A A model selection approach to discover age-dependent gene expression patterns using quantile regression models |
title | A model selection approach to discover age-dependent gene expression patterns using quantile regression models |
title_full | A model selection approach to discover age-dependent gene expression patterns using quantile regression models |
title_fullStr | A model selection approach to discover age-dependent gene expression patterns using quantile regression models |
title_full_unstemmed | A model selection approach to discover age-dependent gene expression patterns using quantile regression models |
title_short | A model selection approach to discover age-dependent gene expression patterns using quantile regression models |
title_sort | model selection approach to discover age-dependent gene expression patterns using quantile regression models |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788368/ https://www.ncbi.nlm.nih.gov/pubmed/19958479 http://dx.doi.org/10.1186/1471-2164-10-S3-S16 |
work_keys_str_mv | AT hojoshuawk amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT stefanimaurizio amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT dosremedioscristobalg amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT charlestonmichaela amodelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT hojoshuawk modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT stefanimaurizio modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT dosremedioscristobalg modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels AT charlestonmichaela modelselectionapproachtodiscoveragedependentgeneexpressionpatternsusingquantileregressionmodels |