Cargando…

Universal Count Correction for High-Throughput Sequencing

We show that existing RNA-seq, DNase-seq, and ChIP-seq data exhibit overdispersed per-base read count distributions that are not matched to existing computational method assumptions. To compensate for this overdispersion we introduce a nonparametric and universal method for processing per-base seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Hashimoto, Tatsunori B., Edwards, Matthew D., Gifford, David K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3945112/
https://www.ncbi.nlm.nih.gov/pubmed/24603409
http://dx.doi.org/10.1371/journal.pcbi.1003494
_version_ 1782306483331923968
author Hashimoto, Tatsunori B.
Edwards, Matthew D.
Gifford, David K.
author_facet Hashimoto, Tatsunori B.
Edwards, Matthew D.
Gifford, David K.
author_sort Hashimoto, Tatsunori B.
collection PubMed
description We show that existing RNA-seq, DNase-seq, and ChIP-seq data exhibit overdispersed per-base read count distributions that are not matched to existing computational method assumptions. To compensate for this overdispersion we introduce a nonparametric and universal method for processing per-base sequencing read count data called Fixseq. We demonstrate that Fixseq substantially improves the performance of existing RNA-seq, DNase-seq, and ChIP-seq analysis tools when compared with existing alternatives.
format Online
Article
Text
id pubmed-3945112
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39451122014-03-12 Universal Count Correction for High-Throughput Sequencing Hashimoto, Tatsunori B. Edwards, Matthew D. Gifford, David K. PLoS Comput Biol Research Article We show that existing RNA-seq, DNase-seq, and ChIP-seq data exhibit overdispersed per-base read count distributions that are not matched to existing computational method assumptions. To compensate for this overdispersion we introduce a nonparametric and universal method for processing per-base sequencing read count data called Fixseq. We demonstrate that Fixseq substantially improves the performance of existing RNA-seq, DNase-seq, and ChIP-seq analysis tools when compared with existing alternatives. Public Library of Science 2014-03-06 /pmc/articles/PMC3945112/ /pubmed/24603409 http://dx.doi.org/10.1371/journal.pcbi.1003494 Text en © 2014 Hashimoto et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hashimoto, Tatsunori B.
Edwards, Matthew D.
Gifford, David K.
Universal Count Correction for High-Throughput Sequencing
title Universal Count Correction for High-Throughput Sequencing
title_full Universal Count Correction for High-Throughput Sequencing
title_fullStr Universal Count Correction for High-Throughput Sequencing
title_full_unstemmed Universal Count Correction for High-Throughput Sequencing
title_short Universal Count Correction for High-Throughput Sequencing
title_sort universal count correction for high-throughput sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3945112/
https://www.ncbi.nlm.nih.gov/pubmed/24603409
http://dx.doi.org/10.1371/journal.pcbi.1003494
work_keys_str_mv AT hashimototatsunorib universalcountcorrectionforhighthroughputsequencing
AT edwardsmatthewd universalcountcorrectionforhighthroughputsequencing
AT gifforddavidk universalcountcorrectionforhighthroughputsequencing