Cargando…
Differential expression analysis with global network adjustment
BACKGROUND: Large-scale chromosomal deletions or other non-specific perturbations of the transcriptome can alter the expression of hundreds or thousands of genes, and it is of biological interest to understand which genes are most profoundly affected. We present a method for predicting a gene’s expr...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3766173/ https://www.ncbi.nlm.nih.gov/pubmed/23968143 http://dx.doi.org/10.1186/1471-2105-14-258 |
_version_ | 1782283481520275456 |
---|---|
author | Gelfond, Jonathan A Ibrahim, Joseph G Gupta, Mayetri Chen, Ming-Hui Cody, Jannine D |
author_facet | Gelfond, Jonathan A Ibrahim, Joseph G Gupta, Mayetri Chen, Ming-Hui Cody, Jannine D |
author_sort | Gelfond, Jonathan A |
collection | PubMed |
description | BACKGROUND: Large-scale chromosomal deletions or other non-specific perturbations of the transcriptome can alter the expression of hundreds or thousands of genes, and it is of biological interest to understand which genes are most profoundly affected. We present a method for predicting a gene’s expression as a function of other genes thereby accounting for the effect of transcriptional regulation that confounds the identification of genes differentially expressed relative to a regulatory network. The challenge in constructing such models is that the number of possible regulator transcripts within a global network is on the order of thousands, and the number of biological samples is typically on the order of 10. Nevertheless, there are large gene expression databases that can be used to construct networks that could be helpful in modeling transcriptional regulation in smaller experiments. RESULTS: We demonstrate a type of penalized regression model that can be estimated from large gene expression databases, and then applied to smaller experiments. The ridge parameter is selected by minimizing the cross-validation error of the predictions in the independent out-sample. This tends to increase the model stability and leads to a much greater degree of parameter shrinkage, but the resulting biased estimation is mitigated by a second round of regression. Nevertheless, the proposed computationally efficient “over-shrinkage” method outperforms previously used LASSO-based techniques. In two independent datasets, we find that the median proportion of explained variability in expression is approximately 25%, and this results in a substantial increase in the signal-to-noise ratio allowing more powerful inferences on differential gene expression leading to biologically intuitive findings. We also show that a large proportion of gene dependencies are conditional on the biological state, which would be impossible with standard differential expression methods. CONCLUSIONS: By adjusting for the effects of the global network on individual genes, both the sensitivity and reliability of differential expression measures are greatly improved. |
format | Online Article Text |
id | pubmed-3766173 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37661732013-09-12 Differential expression analysis with global network adjustment Gelfond, Jonathan A Ibrahim, Joseph G Gupta, Mayetri Chen, Ming-Hui Cody, Jannine D BMC Bioinformatics Methodology Article BACKGROUND: Large-scale chromosomal deletions or other non-specific perturbations of the transcriptome can alter the expression of hundreds or thousands of genes, and it is of biological interest to understand which genes are most profoundly affected. We present a method for predicting a gene’s expression as a function of other genes thereby accounting for the effect of transcriptional regulation that confounds the identification of genes differentially expressed relative to a regulatory network. The challenge in constructing such models is that the number of possible regulator transcripts within a global network is on the order of thousands, and the number of biological samples is typically on the order of 10. Nevertheless, there are large gene expression databases that can be used to construct networks that could be helpful in modeling transcriptional regulation in smaller experiments. RESULTS: We demonstrate a type of penalized regression model that can be estimated from large gene expression databases, and then applied to smaller experiments. The ridge parameter is selected by minimizing the cross-validation error of the predictions in the independent out-sample. This tends to increase the model stability and leads to a much greater degree of parameter shrinkage, but the resulting biased estimation is mitigated by a second round of regression. Nevertheless, the proposed computationally efficient “over-shrinkage” method outperforms previously used LASSO-based techniques. In two independent datasets, we find that the median proportion of explained variability in expression is approximately 25%, and this results in a substantial increase in the signal-to-noise ratio allowing more powerful inferences on differential gene expression leading to biologically intuitive findings. We also show that a large proportion of gene dependencies are conditional on the biological state, which would be impossible with standard differential expression methods. CONCLUSIONS: By adjusting for the effects of the global network on individual genes, both the sensitivity and reliability of differential expression measures are greatly improved. BioMed Central 2013-08-23 /pmc/articles/PMC3766173/ /pubmed/23968143 http://dx.doi.org/10.1186/1471-2105-14-258 Text en Copyright © 2013 Gelfond et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Gelfond, Jonathan A Ibrahim, Joseph G Gupta, Mayetri Chen, Ming-Hui Cody, Jannine D Differential expression analysis with global network adjustment |
title | Differential expression analysis with global network adjustment |
title_full | Differential expression analysis with global network adjustment |
title_fullStr | Differential expression analysis with global network adjustment |
title_full_unstemmed | Differential expression analysis with global network adjustment |
title_short | Differential expression analysis with global network adjustment |
title_sort | differential expression analysis with global network adjustment |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3766173/ https://www.ncbi.nlm.nih.gov/pubmed/23968143 http://dx.doi.org/10.1186/1471-2105-14-258 |
work_keys_str_mv | AT gelfondjonathana differentialexpressionanalysiswithglobalnetworkadjustment AT ibrahimjosephg differentialexpressionanalysiswithglobalnetworkadjustment AT guptamayetri differentialexpressionanalysiswithglobalnetworkadjustment AT chenminghui differentialexpressionanalysiswithglobalnetworkadjustment AT codyjannined differentialexpressionanalysiswithglobalnetworkadjustment |