Cargando…
AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression
In addition to detecting novel transcripts and higher dynamic range, a principal claim for RNA-sequencing has been greater replicability, typically measured in sample-sample correlations of gene expression levels. Through a re-analysis of ENCODE data, we show that replicability of transcript abundan...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4833304/ https://www.ncbi.nlm.nih.gov/pubmed/27082953 http://dx.doi.org/10.1371/journal.pcbi.1004868 |
_version_ | 1782427333912690688 |
---|---|
author | Ballouz, Sara Gillis, Jesse |
author_facet | Ballouz, Sara Gillis, Jesse |
author_sort | Ballouz, Sara |
collection | PubMed |
description | In addition to detecting novel transcripts and higher dynamic range, a principal claim for RNA-sequencing has been greater replicability, typically measured in sample-sample correlations of gene expression levels. Through a re-analysis of ENCODE data, we show that replicability of transcript abundances will provide misleading estimates of the replicability of conditional variation in transcript abundances (i.e., most expression experiments). Heuristics which implicitly address this problem have emerged in quality control measures to obtain ‘good’ differential expression results. However, these methods involve strict filters such as discarding low expressing genes or using technical replicates to remove discordant transcripts, and are costly or simply ad hoc. As an alternative, we model gene-level replicability of differential activity using co-expressing genes. We find that sets of housekeeping interactions provide a sensitive means of estimating the replicability of expression changes, where the co-expressing pair can be regarded as pseudo-replicates of one another. We model the effects of noise that perturbs a gene’s expression within its usual distribution of values and show that perturbing expression by only 5% within that range is readily detectable (AUROC~0.73). We have made our method available as a set of easily implemented R scripts. |
format | Online Article Text |
id | pubmed-4833304 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-48333042016-04-22 AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression Ballouz, Sara Gillis, Jesse PLoS Comput Biol Research Article In addition to detecting novel transcripts and higher dynamic range, a principal claim for RNA-sequencing has been greater replicability, typically measured in sample-sample correlations of gene expression levels. Through a re-analysis of ENCODE data, we show that replicability of transcript abundances will provide misleading estimates of the replicability of conditional variation in transcript abundances (i.e., most expression experiments). Heuristics which implicitly address this problem have emerged in quality control measures to obtain ‘good’ differential expression results. However, these methods involve strict filters such as discarding low expressing genes or using technical replicates to remove discordant transcripts, and are costly or simply ad hoc. As an alternative, we model gene-level replicability of differential activity using co-expressing genes. We find that sets of housekeeping interactions provide a sensitive means of estimating the replicability of expression changes, where the co-expressing pair can be regarded as pseudo-replicates of one another. We model the effects of noise that perturbs a gene’s expression within its usual distribution of values and show that perturbing expression by only 5% within that range is readily detectable (AUROC~0.73). We have made our method available as a set of easily implemented R scripts. Public Library of Science 2016-04-15 /pmc/articles/PMC4833304/ /pubmed/27082953 http://dx.doi.org/10.1371/journal.pcbi.1004868 Text en © 2016 Ballouz, Gillis http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ballouz, Sara Gillis, Jesse AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression |
title | AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression |
title_full | AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression |
title_fullStr | AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression |
title_full_unstemmed | AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression |
title_short | AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression |
title_sort | aupairwise: a method to estimate rna-seq replicability through co-expression |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4833304/ https://www.ncbi.nlm.nih.gov/pubmed/27082953 http://dx.doi.org/10.1371/journal.pcbi.1004868 |
work_keys_str_mv | AT ballouzsara aupairwiseamethodtoestimaternaseqreplicabilitythroughcoexpression AT gillisjesse aupairwiseamethodtoestimaternaseqreplicabilitythroughcoexpression |