Cargando…

A new approach to bias correction in RNA-Seq

Motivation: Quantification of sequence abundance in RNA-Seq experiments is often conflated by protocol-specific sequence bias. The exact sources of the bias are unknown, but may be influenced by polymerase chain reaction amplification, or differing primer affinities and mixtures, for example. The re...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Daniel C., Ruzzo, Walter L., Peng, Xinxia, Katze, Michael G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315719/
https://www.ncbi.nlm.nih.gov/pubmed/22285831
http://dx.doi.org/10.1093/bioinformatics/bts055
_version_ 1782228278177693696
author Jones, Daniel C.
Ruzzo, Walter L.
Peng, Xinxia
Katze, Michael G.
author_facet Jones, Daniel C.
Ruzzo, Walter L.
Peng, Xinxia
Katze, Michael G.
author_sort Jones, Daniel C.
collection PubMed
description Motivation: Quantification of sequence abundance in RNA-Seq experiments is often conflated by protocol-specific sequence bias. The exact sources of the bias are unknown, but may be influenced by polymerase chain reaction amplification, or differing primer affinities and mixtures, for example. The result is decreased accuracy in many applications, such as de novo gene annotation and transcript quantification. Results: We present a new method to measure and correct for these influences using a simple graphical model. Our model does not rely on existing gene annotations, and model selection is performed automatically making it applicable with few assumptions. We evaluate our method on several datasets, and by multiple criteria, demonstrating that it effectively decreases bias and increases uniformity. Additionally, we provide theoretical and empirical results showing that the method is unlikely to have any effect on unbiased data, suggesting it can be applied with little risk of spurious adjustment. Availability: The method is implemented in the seqbias R/Bioconductor package, available freely under the LGPL license from http://bioconductor.org Contact: dcjones@cs.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3315719
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33157192012-03-30 A new approach to bias correction in RNA-Seq Jones, Daniel C. Ruzzo, Walter L. Peng, Xinxia Katze, Michael G. Bioinformatics Original Papers Motivation: Quantification of sequence abundance in RNA-Seq experiments is often conflated by protocol-specific sequence bias. The exact sources of the bias are unknown, but may be influenced by polymerase chain reaction amplification, or differing primer affinities and mixtures, for example. The result is decreased accuracy in many applications, such as de novo gene annotation and transcript quantification. Results: We present a new method to measure and correct for these influences using a simple graphical model. Our model does not rely on existing gene annotations, and model selection is performed automatically making it applicable with few assumptions. We evaluate our method on several datasets, and by multiple criteria, demonstrating that it effectively decreases bias and increases uniformity. Additionally, we provide theoretical and empirical results showing that the method is unlikely to have any effect on unbiased data, suggesting it can be applied with little risk of spurious adjustment. Availability: The method is implemented in the seqbias R/Bioconductor package, available freely under the LGPL license from http://bioconductor.org Contact: dcjones@cs.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-04-01 2012-01-28 /pmc/articles/PMC3315719/ /pubmed/22285831 http://dx.doi.org/10.1093/bioinformatics/bts055 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Jones, Daniel C.
Ruzzo, Walter L.
Peng, Xinxia
Katze, Michael G.
A new approach to bias correction in RNA-Seq
title A new approach to bias correction in RNA-Seq
title_full A new approach to bias correction in RNA-Seq
title_fullStr A new approach to bias correction in RNA-Seq
title_full_unstemmed A new approach to bias correction in RNA-Seq
title_short A new approach to bias correction in RNA-Seq
title_sort new approach to bias correction in rna-seq
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315719/
https://www.ncbi.nlm.nih.gov/pubmed/22285831
http://dx.doi.org/10.1093/bioinformatics/bts055
work_keys_str_mv AT jonesdanielc anewapproachtobiascorrectioninrnaseq
AT ruzzowalterl anewapproachtobiascorrectioninrnaseq
AT pengxinxia anewapproachtobiascorrectioninrnaseq
AT katzemichaelg anewapproachtobiascorrectioninrnaseq
AT jonesdanielc newapproachtobiascorrectioninrnaseq
AT ruzzowalterl newapproachtobiascorrectioninrnaseq
AT pengxinxia newapproachtobiascorrectioninrnaseq
AT katzemichaelg newapproachtobiascorrectioninrnaseq