Cargando…

variancePartition: interpreting drivers of variation in complex gene expression studies

BACKGROUND: As large-scale studies of gene expression with multiple sources of biological and technical variation become widely adopted, characterizing these drivers of variation becomes essential to understanding disease biology and regulatory genetics. RESULTS: We describe a statistical and visual...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoffman, Gabriel E., Schadt, Eric E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123296/
https://www.ncbi.nlm.nih.gov/pubmed/27884101
http://dx.doi.org/10.1186/s12859-016-1323-z
_version_ 1782469705314861056
author Hoffman, Gabriel E.
Schadt, Eric E.
author_facet Hoffman, Gabriel E.
Schadt, Eric E.
author_sort Hoffman, Gabriel E.
collection PubMed
description BACKGROUND: As large-scale studies of gene expression with multiple sources of biological and technical variation become widely adopted, characterizing these drivers of variation becomes essential to understanding disease biology and regulatory genetics. RESULTS: We describe a statistical and visualization framework, variancePartition, to prioritize drivers of variation based on a genome-wide summary, and identify genes that deviate from the genome-wide trend. Using a linear mixed model, variancePartition quantifies variation in each expression trait attributable to differences in disease status, sex, cell or tissue type, ancestry, genetic background, experimental stimulus, or technical variables. Analysis of four large-scale transcriptome profiling datasets illustrates that variancePartition recovers striking patterns of biological and technical variation that are reproducible across multiple datasets. CONCLUSIONS: Our open source software, variancePartition, enables rapid interpretation of complex gene expression studies as well as other high-throughput genomics assays. variancePartition is available from Bioconductor: http://bioconductor.org/packages/variancePartition. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1323-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5123296
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51232962016-12-06 variancePartition: interpreting drivers of variation in complex gene expression studies Hoffman, Gabriel E. Schadt, Eric E. BMC Bioinformatics Software BACKGROUND: As large-scale studies of gene expression with multiple sources of biological and technical variation become widely adopted, characterizing these drivers of variation becomes essential to understanding disease biology and regulatory genetics. RESULTS: We describe a statistical and visualization framework, variancePartition, to prioritize drivers of variation based on a genome-wide summary, and identify genes that deviate from the genome-wide trend. Using a linear mixed model, variancePartition quantifies variation in each expression trait attributable to differences in disease status, sex, cell or tissue type, ancestry, genetic background, experimental stimulus, or technical variables. Analysis of four large-scale transcriptome profiling datasets illustrates that variancePartition recovers striking patterns of biological and technical variation that are reproducible across multiple datasets. CONCLUSIONS: Our open source software, variancePartition, enables rapid interpretation of complex gene expression studies as well as other high-throughput genomics assays. variancePartition is available from Bioconductor: http://bioconductor.org/packages/variancePartition. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1323-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-25 /pmc/articles/PMC5123296/ /pubmed/27884101 http://dx.doi.org/10.1186/s12859-016-1323-z Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Hoffman, Gabriel E.
Schadt, Eric E.
variancePartition: interpreting drivers of variation in complex gene expression studies
title variancePartition: interpreting drivers of variation in complex gene expression studies
title_full variancePartition: interpreting drivers of variation in complex gene expression studies
title_fullStr variancePartition: interpreting drivers of variation in complex gene expression studies
title_full_unstemmed variancePartition: interpreting drivers of variation in complex gene expression studies
title_short variancePartition: interpreting drivers of variation in complex gene expression studies
title_sort variancepartition: interpreting drivers of variation in complex gene expression studies
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123296/
https://www.ncbi.nlm.nih.gov/pubmed/27884101
http://dx.doi.org/10.1186/s12859-016-1323-z
work_keys_str_mv AT hoffmangabriele variancepartitioninterpretingdriversofvariationincomplexgeneexpressionstudies
AT schadterice variancepartitioninterpretingdriversofvariationincomplexgeneexpressionstudies