Cargando…

Polee: RNA-Seq analysis using approximate likelihood

The analysis of mRNA transcript abundance with RNA-Seq is a central tool in molecular biology research, but often analyses fail to account for the uncertainty in these estimates, which can be significant, especially when trying to disentangle isoforms or duplicated genes. Preserving uncertainty nece...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Daniel C, Ruzzo, Walter L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152449/
https://www.ncbi.nlm.nih.gov/pubmed/34056596
http://dx.doi.org/10.1093/nargab/lqab046
_version_ 1783698608415047680
author Jones, Daniel C
Ruzzo, Walter L
author_facet Jones, Daniel C
Ruzzo, Walter L
author_sort Jones, Daniel C
collection PubMed
description The analysis of mRNA transcript abundance with RNA-Seq is a central tool in molecular biology research, but often analyses fail to account for the uncertainty in these estimates, which can be significant, especially when trying to disentangle isoforms or duplicated genes. Preserving uncertainty necessitates a full probabilistic model of the all the sequencing reads, which quickly becomes intractable, as experiments can consist of billions of reads. To overcome these limitations, we propose a new method of approximating the likelihood function of a sparse mixture model, using a technique we call the Pólya tree transformation. We demonstrate that substituting this approximation for the real thing achieves most of the benefits with a fraction of the computational costs, leading to more accurate detection of differential transcript expression and transcript coexpression.
format Online
Article
Text
id pubmed-8152449
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81524492021-05-28 Polee: RNA-Seq analysis using approximate likelihood Jones, Daniel C Ruzzo, Walter L NAR Genom Bioinform Methods Article The analysis of mRNA transcript abundance with RNA-Seq is a central tool in molecular biology research, but often analyses fail to account for the uncertainty in these estimates, which can be significant, especially when trying to disentangle isoforms or duplicated genes. Preserving uncertainty necessitates a full probabilistic model of the all the sequencing reads, which quickly becomes intractable, as experiments can consist of billions of reads. To overcome these limitations, we propose a new method of approximating the likelihood function of a sparse mixture model, using a technique we call the Pólya tree transformation. We demonstrate that substituting this approximation for the real thing achieves most of the benefits with a fraction of the computational costs, leading to more accurate detection of differential transcript expression and transcript coexpression. Oxford University Press 2021-05-25 /pmc/articles/PMC8152449/ /pubmed/34056596 http://dx.doi.org/10.1093/nargab/lqab046 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Article
Jones, Daniel C
Ruzzo, Walter L
Polee: RNA-Seq analysis using approximate likelihood
title Polee: RNA-Seq analysis using approximate likelihood
title_full Polee: RNA-Seq analysis using approximate likelihood
title_fullStr Polee: RNA-Seq analysis using approximate likelihood
title_full_unstemmed Polee: RNA-Seq analysis using approximate likelihood
title_short Polee: RNA-Seq analysis using approximate likelihood
title_sort polee: rna-seq analysis using approximate likelihood
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152449/
https://www.ncbi.nlm.nih.gov/pubmed/34056596
http://dx.doi.org/10.1093/nargab/lqab046
work_keys_str_mv AT jonesdanielc poleernaseqanalysisusingapproximatelikelihood
AT ruzzowalterl poleernaseqanalysisusingapproximatelikelihood