Cargando…

A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs

Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibi...

Descripción completa

Detalles Bibliográficos
Autores principales: Soneson, Charlotte, Love, Michael I, Patro, Rob, Hussain, Shobbir, Malhotra, Dheeraj, Robinson, Mark D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Life Science Alliance LLC 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6337739/
https://www.ncbi.nlm.nih.gov/pubmed/30655364
http://dx.doi.org/10.26508/lsa.201800175
_version_ 1783388319826051072
author Soneson, Charlotte
Love, Michael I
Patro, Rob
Hussain, Shobbir
Malhotra, Dheeraj
Robinson, Mark D
author_facet Soneson, Charlotte
Love, Michael I
Patro, Rob
Hussain, Shobbir
Malhotra, Dheeraj
Robinson, Mark D
author_sort Soneson, Charlotte
collection PubMed
description Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibility score, which provides a way to evaluate the reliability of transcript-level abundance estimates and the accuracy of transcript annotation catalogs. It works by comparing the observed number of reads spanning each annotated splice junction in a genomic region to the predicted number of junction-spanning reads, inferred from the estimated transcript abundances and the genomic coordinates of the corresponding annotated transcripts. We show that although most genes show good agreement between the observed and predicted junction coverages, there is a small set of genes that do not. Genes with poor agreement are found regardless of the method used to estimate transcript abundances, and the corresponding transcript abundances should be treated with care in any downstream analyses.
format Online
Article
Text
id pubmed-6337739
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Life Science Alliance LLC
record_format MEDLINE/PubMed
spelling pubmed-63377392019-01-22 A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs Soneson, Charlotte Love, Michael I Patro, Rob Hussain, Shobbir Malhotra, Dheeraj Robinson, Mark D Life Sci Alliance Research Articles Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibility score, which provides a way to evaluate the reliability of transcript-level abundance estimates and the accuracy of transcript annotation catalogs. It works by comparing the observed number of reads spanning each annotated splice junction in a genomic region to the predicted number of junction-spanning reads, inferred from the estimated transcript abundances and the genomic coordinates of the corresponding annotated transcripts. We show that although most genes show good agreement between the observed and predicted junction coverages, there is a small set of genes that do not. Genes with poor agreement are found regardless of the method used to estimate transcript abundances, and the corresponding transcript abundances should be treated with care in any downstream analyses. Life Science Alliance LLC 2019-01-17 /pmc/articles/PMC6337739/ /pubmed/30655364 http://dx.doi.org/10.26508/lsa.201800175 Text en © 2019 Soneson et al. https://creativecommons.org/licenses/by/4.0/This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Articles
Soneson, Charlotte
Love, Michael I
Patro, Rob
Hussain, Shobbir
Malhotra, Dheeraj
Robinson, Mark D
A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
title A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
title_full A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
title_fullStr A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
title_full_unstemmed A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
title_short A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
title_sort junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6337739/
https://www.ncbi.nlm.nih.gov/pubmed/30655364
http://dx.doi.org/10.26508/lsa.201800175
work_keys_str_mv AT sonesoncharlotte ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT lovemichaeli ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT patrorob ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT hussainshobbir ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT malhotradheeraj ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT robinsonmarkd ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT sonesoncharlotte junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT lovemichaeli junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT patrorob junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT hussainshobbir junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT malhotradheeraj junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs
AT robinsonmarkd junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs