Cargando…

Using mixtures of biological samples as process controls for RNA-sequencing experiments

BACKGROUND: Genome-scale “-omics” measurements are challenging to benchmark due to the enormous variety of unique biological molecules involved. Mixtures of previously-characterized samples can be used to benchmark repeatability and reproducibility using component proportions as truth for the measur...

Descripción completa

Detalles Bibliográficos
Autores principales: Parsons, Jerod, Munro, Sarah, Pine, P. Scott, McDaniel, Jennifer, Mehaffey, Michele, Salit, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4574543/
https://www.ncbi.nlm.nih.gov/pubmed/26383878
http://dx.doi.org/10.1186/s12864-015-1912-7
_version_ 1782390645663465472
author Parsons, Jerod
Munro, Sarah
Pine, P. Scott
McDaniel, Jennifer
Mehaffey, Michele
Salit, Marc
author_facet Parsons, Jerod
Munro, Sarah
Pine, P. Scott
McDaniel, Jennifer
Mehaffey, Michele
Salit, Marc
author_sort Parsons, Jerod
collection PubMed
description BACKGROUND: Genome-scale “-omics” measurements are challenging to benchmark due to the enormous variety of unique biological molecules involved. Mixtures of previously-characterized samples can be used to benchmark repeatability and reproducibility using component proportions as truth for the measurement. We describe and evaluate experiments characterizing the performance of RNA-sequencing (RNA-Seq) measurements, and discuss cases where mixtures can serve as effective process controls. RESULTS: We apply a linear model to total RNA mixture samples in RNA-seq experiments. This model provides a context for performance benchmarking. The parameters of the model fit to experimental results can be evaluated to assess bias and variability of the measurement of a mixture. A linear model describes the behavior of mixture expression measures and provides a context for performance benchmarking. Residuals from fitting the model to experimental data can be used as a metric for evaluating the effect that an individual step in an experimental process has on the linear response function and precision of the underlying measurement while identifying signals affected by interference from other sources. Effective benchmarking requires well-defined mixtures, which for RNA-Seq requires knowledge of the post-enrichment ‘target RNA’ content of the individual total RNA components. We demonstrate and evaluate an experimental method suitable for use in genome-scale process control and lay out a method utilizing spike-in controls to determine enriched RNA content of total RNA in samples. CONCLUSIONS: Genome-scale process controls can be derived from mixtures. These controls relate prior knowledge of individual components to a complex mixture, allowing assessment of measurement performance. The target RNA fraction accounts for differential selection of RNA out of variable total RNA samples. Spike-in controls can be utilized to measure this relationship between target RNA content and input total RNA. Our mixture analysis method also enables estimation of the proportions of an unknown mixture, even when component-specific markers are not previously known, whenever pure components are measured alongside the mixture. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1912-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4574543
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45745432015-09-19 Using mixtures of biological samples as process controls for RNA-sequencing experiments Parsons, Jerod Munro, Sarah Pine, P. Scott McDaniel, Jennifer Mehaffey, Michele Salit, Marc BMC Genomics Methodology Article BACKGROUND: Genome-scale “-omics” measurements are challenging to benchmark due to the enormous variety of unique biological molecules involved. Mixtures of previously-characterized samples can be used to benchmark repeatability and reproducibility using component proportions as truth for the measurement. We describe and evaluate experiments characterizing the performance of RNA-sequencing (RNA-Seq) measurements, and discuss cases where mixtures can serve as effective process controls. RESULTS: We apply a linear model to total RNA mixture samples in RNA-seq experiments. This model provides a context for performance benchmarking. The parameters of the model fit to experimental results can be evaluated to assess bias and variability of the measurement of a mixture. A linear model describes the behavior of mixture expression measures and provides a context for performance benchmarking. Residuals from fitting the model to experimental data can be used as a metric for evaluating the effect that an individual step in an experimental process has on the linear response function and precision of the underlying measurement while identifying signals affected by interference from other sources. Effective benchmarking requires well-defined mixtures, which for RNA-Seq requires knowledge of the post-enrichment ‘target RNA’ content of the individual total RNA components. We demonstrate and evaluate an experimental method suitable for use in genome-scale process control and lay out a method utilizing spike-in controls to determine enriched RNA content of total RNA in samples. CONCLUSIONS: Genome-scale process controls can be derived from mixtures. These controls relate prior knowledge of individual components to a complex mixture, allowing assessment of measurement performance. The target RNA fraction accounts for differential selection of RNA out of variable total RNA samples. Spike-in controls can be utilized to measure this relationship between target RNA content and input total RNA. Our mixture analysis method also enables estimation of the proportions of an unknown mixture, even when component-specific markers are not previously known, whenever pure components are measured alongside the mixture. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1912-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-17 /pmc/articles/PMC4574543/ /pubmed/26383878 http://dx.doi.org/10.1186/s12864-015-1912-7 Text en © Parsons et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Parsons, Jerod
Munro, Sarah
Pine, P. Scott
McDaniel, Jennifer
Mehaffey, Michele
Salit, Marc
Using mixtures of biological samples as process controls for RNA-sequencing experiments
title Using mixtures of biological samples as process controls for RNA-sequencing experiments
title_full Using mixtures of biological samples as process controls for RNA-sequencing experiments
title_fullStr Using mixtures of biological samples as process controls for RNA-sequencing experiments
title_full_unstemmed Using mixtures of biological samples as process controls for RNA-sequencing experiments
title_short Using mixtures of biological samples as process controls for RNA-sequencing experiments
title_sort using mixtures of biological samples as process controls for rna-sequencing experiments
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4574543/
https://www.ncbi.nlm.nih.gov/pubmed/26383878
http://dx.doi.org/10.1186/s12864-015-1912-7
work_keys_str_mv AT parsonsjerod usingmixturesofbiologicalsamplesasprocesscontrolsforrnasequencingexperiments
AT munrosarah usingmixturesofbiologicalsamplesasprocesscontrolsforrnasequencingexperiments
AT pinepscott usingmixturesofbiologicalsamplesasprocesscontrolsforrnasequencingexperiments
AT mcdanieljennifer usingmixturesofbiologicalsamplesasprocesscontrolsforrnasequencingexperiments
AT mehaffeymichele usingmixturesofbiologicalsamplesasprocesscontrolsforrnasequencingexperiments
AT salitmarc usingmixturesofbiologicalsamplesasprocesscontrolsforrnasequencingexperiments