Cargando…

On the relevance of technical variation due to building pools in microarray experiments

BACKGROUND: Pooled samples are frequently used in experiments measuring gene expression. In this method, RNA from different individuals sharing the same experimental conditions and explanatory variables is blended and their concentrations are jointly measured. As a matter of principle, individuals a...

Descripción completa

Detalles Bibliográficos
Autores principales: Rudolf, Henrik, Nuernberg, Gerd, Koczan, Dirk, Vanselow, Jens, Gempe, Tanja, Beye, Martin, Leboulle, Gérard, Bienefeld, Kaspar, Reinsch, Norbert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667463/
https://www.ncbi.nlm.nih.gov/pubmed/26628392
http://dx.doi.org/10.1186/s12864-015-2055-6
_version_ 1782403833419268096
author Rudolf, Henrik
Nuernberg, Gerd
Koczan, Dirk
Vanselow, Jens
Gempe, Tanja
Beye, Martin
Leboulle, Gérard
Bienefeld, Kaspar
Reinsch, Norbert
author_facet Rudolf, Henrik
Nuernberg, Gerd
Koczan, Dirk
Vanselow, Jens
Gempe, Tanja
Beye, Martin
Leboulle, Gérard
Bienefeld, Kaspar
Reinsch, Norbert
author_sort Rudolf, Henrik
collection PubMed
description BACKGROUND: Pooled samples are frequently used in experiments measuring gene expression. In this method, RNA from different individuals sharing the same experimental conditions and explanatory variables is blended and their concentrations are jointly measured. As a matter of principle, individuals are represented in equal shares in each pool. However, some degree of disproportionality may arise from the limits of technical precision. As a consequence a special kind of technical error occurs, which can be modelled by a respective variance component. Previously published theory - allowing for variable pool sizes - has been applied to four microarray gene expression data sets from different species in order to assess the practical relevance of this type of technical error in terms of significance and size of this variance component. RESULTS: The number of transcripts with a significant variance component due to imperfect blending was found to be 4329 (23 %) in mouse data and 7093 (49 %) in honey bees, but only 6 in rats and none whatsoever in human data. These results correspond to a false discovery rate of 5 % in each data set. The number of transcripts found to be differentially expressed between treatments was always higher when the blending error variance was neglected. Simulations clearly indicated overly-optimistic (anti-conservative) test results in terms of false discovery rates whenever this source of variability was not represented in the model. CONCLUSIONS: Imperfect equality of shares when blending RNA from different individuals into joint pools of variable size is a source of technical variation with relevance for experimental design, practice at the laboratory bench and data analysis. Its potentially adverse effects, incorrect identification of differentially expressed transcripts and overly-optimistic significance tests, can be fully avoided, however, by the sound application of recently established theory and models for data analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2055-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4667463
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46674632015-12-03 On the relevance of technical variation due to building pools in microarray experiments Rudolf, Henrik Nuernberg, Gerd Koczan, Dirk Vanselow, Jens Gempe, Tanja Beye, Martin Leboulle, Gérard Bienefeld, Kaspar Reinsch, Norbert BMC Genomics Methodology Article BACKGROUND: Pooled samples are frequently used in experiments measuring gene expression. In this method, RNA from different individuals sharing the same experimental conditions and explanatory variables is blended and their concentrations are jointly measured. As a matter of principle, individuals are represented in equal shares in each pool. However, some degree of disproportionality may arise from the limits of technical precision. As a consequence a special kind of technical error occurs, which can be modelled by a respective variance component. Previously published theory - allowing for variable pool sizes - has been applied to four microarray gene expression data sets from different species in order to assess the practical relevance of this type of technical error in terms of significance and size of this variance component. RESULTS: The number of transcripts with a significant variance component due to imperfect blending was found to be 4329 (23 %) in mouse data and 7093 (49 %) in honey bees, but only 6 in rats and none whatsoever in human data. These results correspond to a false discovery rate of 5 % in each data set. The number of transcripts found to be differentially expressed between treatments was always higher when the blending error variance was neglected. Simulations clearly indicated overly-optimistic (anti-conservative) test results in terms of false discovery rates whenever this source of variability was not represented in the model. CONCLUSIONS: Imperfect equality of shares when blending RNA from different individuals into joint pools of variable size is a source of technical variation with relevance for experimental design, practice at the laboratory bench and data analysis. Its potentially adverse effects, incorrect identification of differentially expressed transcripts and overly-optimistic significance tests, can be fully avoided, however, by the sound application of recently established theory and models for data analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2055-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-12-01 /pmc/articles/PMC4667463/ /pubmed/26628392 http://dx.doi.org/10.1186/s12864-015-2055-6 Text en © Rudolf et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Rudolf, Henrik
Nuernberg, Gerd
Koczan, Dirk
Vanselow, Jens
Gempe, Tanja
Beye, Martin
Leboulle, Gérard
Bienefeld, Kaspar
Reinsch, Norbert
On the relevance of technical variation due to building pools in microarray experiments
title On the relevance of technical variation due to building pools in microarray experiments
title_full On the relevance of technical variation due to building pools in microarray experiments
title_fullStr On the relevance of technical variation due to building pools in microarray experiments
title_full_unstemmed On the relevance of technical variation due to building pools in microarray experiments
title_short On the relevance of technical variation due to building pools in microarray experiments
title_sort on the relevance of technical variation due to building pools in microarray experiments
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667463/
https://www.ncbi.nlm.nih.gov/pubmed/26628392
http://dx.doi.org/10.1186/s12864-015-2055-6
work_keys_str_mv AT rudolfhenrik ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT nuernberggerd ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT koczandirk ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT vanselowjens ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT gempetanja ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT beyemartin ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT leboullegerard ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT bienefeldkaspar ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments
AT reinschnorbert ontherelevanceoftechnicalvariationduetobuildingpoolsinmicroarrayexperiments