Cargando…
Modeling non-uniformity in short-read rates in RNA-Seq data
After mapping, RNA-Seq data can be summarized by a sequence of read counts commonly modeled as Poisson variables with constant rates along each transcript, which actually fit data poorly. We suggest using variable rates for different positions, and propose two models to predict these rates based on...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898062/ https://www.ncbi.nlm.nih.gov/pubmed/20459815 http://dx.doi.org/10.1186/gb-2010-11-5-r50 |
_version_ | 1782183467633606656 |
---|---|
author | Li, Jun Jiang, Hui Wong, Wing Hung |
author_facet | Li, Jun Jiang, Hui Wong, Wing Hung |
author_sort | Li, Jun |
collection | PubMed |
description | After mapping, RNA-Seq data can be summarized by a sequence of read counts commonly modeled as Poisson variables with constant rates along each transcript, which actually fit data poorly. We suggest using variable rates for different positions, and propose two models to predict these rates based on local sequences. These models explain more than 50% of the variations and can lead to improved estimates of gene and isoform expressions for both Illumina and Applied Biosystems data. |
format | Text |
id | pubmed-2898062 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28980622010-07-07 Modeling non-uniformity in short-read rates in RNA-Seq data Li, Jun Jiang, Hui Wong, Wing Hung Genome Biol Method After mapping, RNA-Seq data can be summarized by a sequence of read counts commonly modeled as Poisson variables with constant rates along each transcript, which actually fit data poorly. We suggest using variable rates for different positions, and propose two models to predict these rates based on local sequences. These models explain more than 50% of the variations and can lead to improved estimates of gene and isoform expressions for both Illumina and Applied Biosystems data. BioMed Central 2010 2010-05-11 /pmc/articles/PMC2898062/ /pubmed/20459815 http://dx.doi.org/10.1186/gb-2010-11-5-r50 Text en Copyright ©2010 Li et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Li, Jun Jiang, Hui Wong, Wing Hung Modeling non-uniformity in short-read rates in RNA-Seq data |
title | Modeling non-uniformity in short-read rates in RNA-Seq data |
title_full | Modeling non-uniformity in short-read rates in RNA-Seq data |
title_fullStr | Modeling non-uniformity in short-read rates in RNA-Seq data |
title_full_unstemmed | Modeling non-uniformity in short-read rates in RNA-Seq data |
title_short | Modeling non-uniformity in short-read rates in RNA-Seq data |
title_sort | modeling non-uniformity in short-read rates in rna-seq data |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898062/ https://www.ncbi.nlm.nih.gov/pubmed/20459815 http://dx.doi.org/10.1186/gb-2010-11-5-r50 |
work_keys_str_mv | AT lijun modelingnonuniformityinshortreadratesinrnaseqdata AT jianghui modelingnonuniformityinshortreadratesinrnaseqdata AT wongwinghung modelingnonuniformityinshortreadratesinrnaseqdata |