Cargando…

Using equivalence class counts for fast and accurate testing of differential transcript usage

Background: RNA sequencing has enabled high-throughput and fine-grained quantitative analyses of the transcriptome. While differential gene expression is the most widely used application of this technology, RNA-seq data also has the resolution to infer differential transcript usage (DTU), which can...

Descripción completa

Detalles Bibliográficos
Autores principales: Cmero, Marek, Davidson, Nadia M., Oshlack, Alicia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6524746/
https://www.ncbi.nlm.nih.gov/pubmed/31143443
http://dx.doi.org/10.12688/f1000research.18276.2
_version_ 1783419606618079232
author Cmero, Marek
Davidson, Nadia M.
Oshlack, Alicia
author_facet Cmero, Marek
Davidson, Nadia M.
Oshlack, Alicia
author_sort Cmero, Marek
collection PubMed
description Background: RNA sequencing has enabled high-throughput and fine-grained quantitative analyses of the transcriptome. While differential gene expression is the most widely used application of this technology, RNA-seq data also has the resolution to infer differential transcript usage (DTU), which can elucidate the role of different transcript isoforms between experimental conditions, cell types or tissues. DTU has typically been inferred from exon-count data, which has issues with assigning reads unambiguously to counting bins, and requires alignment of reads to the genome. Recently, approaches have emerged that use transcript quantification estimates directly for DTU. Transcript counts can be inferred from 'pseudo' or lightweight aligners, which are significantly faster than traditional genome alignment. However, recent evaluations show lower sensitivity in DTU analysis compared to exon-level analysis. Transcript abundances are estimated from equivalence classes (ECs), which determine the transcripts that any given read is compatible with. Recent work has proposed performing a variety of RNA-seq analysis directly on equivalence class counts (ECCs). Methods: Here we demonstrate that ECCs can be used effectively with existing count-based methods for detecting DTU. We evaluate this approach on simulated human and drosophila data, as well as on a real dataset through subset testing. Results: We find that ECCs have similar sensitivity and false discovery rates as exon-level counts but can be generated in a fraction of the time through the use of pseudo-aligners. Conclusions: We posit that equivalence class read counts are a natural unit on which to perform differential transcript usage analysis.
format Online
Article
Text
id pubmed-6524746
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-65247462019-05-28 Using equivalence class counts for fast and accurate testing of differential transcript usage Cmero, Marek Davidson, Nadia M. Oshlack, Alicia F1000Res Research Article Background: RNA sequencing has enabled high-throughput and fine-grained quantitative analyses of the transcriptome. While differential gene expression is the most widely used application of this technology, RNA-seq data also has the resolution to infer differential transcript usage (DTU), which can elucidate the role of different transcript isoforms between experimental conditions, cell types or tissues. DTU has typically been inferred from exon-count data, which has issues with assigning reads unambiguously to counting bins, and requires alignment of reads to the genome. Recently, approaches have emerged that use transcript quantification estimates directly for DTU. Transcript counts can be inferred from 'pseudo' or lightweight aligners, which are significantly faster than traditional genome alignment. However, recent evaluations show lower sensitivity in DTU analysis compared to exon-level analysis. Transcript abundances are estimated from equivalence classes (ECs), which determine the transcripts that any given read is compatible with. Recent work has proposed performing a variety of RNA-seq analysis directly on equivalence class counts (ECCs). Methods: Here we demonstrate that ECCs can be used effectively with existing count-based methods for detecting DTU. We evaluate this approach on simulated human and drosophila data, as well as on a real dataset through subset testing. Results: We find that ECCs have similar sensitivity and false discovery rates as exon-level counts but can be generated in a fraction of the time through the use of pseudo-aligners. Conclusions: We posit that equivalence class read counts are a natural unit on which to perform differential transcript usage analysis. F1000 Research Limited 2019-04-29 /pmc/articles/PMC6524746/ /pubmed/31143443 http://dx.doi.org/10.12688/f1000research.18276.2 Text en Copyright: © 2019 Cmero M et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Cmero, Marek
Davidson, Nadia M.
Oshlack, Alicia
Using equivalence class counts for fast and accurate testing of differential transcript usage
title Using equivalence class counts for fast and accurate testing of differential transcript usage
title_full Using equivalence class counts for fast and accurate testing of differential transcript usage
title_fullStr Using equivalence class counts for fast and accurate testing of differential transcript usage
title_full_unstemmed Using equivalence class counts for fast and accurate testing of differential transcript usage
title_short Using equivalence class counts for fast and accurate testing of differential transcript usage
title_sort using equivalence class counts for fast and accurate testing of differential transcript usage
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6524746/
https://www.ncbi.nlm.nih.gov/pubmed/31143443
http://dx.doi.org/10.12688/f1000research.18276.2
work_keys_str_mv AT cmeromarek usingequivalenceclasscountsforfastandaccuratetestingofdifferentialtranscriptusage
AT davidsonnadiam usingequivalenceclasscountsforfastandaccuratetestingofdifferentialtranscriptusage
AT oshlackalicia usingequivalenceclasscountsforfastandaccuratetestingofdifferentialtranscriptusage