Cargando…
A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs
Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Life Science Alliance LLC
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6337739/ https://www.ncbi.nlm.nih.gov/pubmed/30655364 http://dx.doi.org/10.26508/lsa.201800175 |
_version_ | 1783388319826051072 |
---|---|
author | Soneson, Charlotte Love, Michael I Patro, Rob Hussain, Shobbir Malhotra, Dheeraj Robinson, Mark D |
author_facet | Soneson, Charlotte Love, Michael I Patro, Rob Hussain, Shobbir Malhotra, Dheeraj Robinson, Mark D |
author_sort | Soneson, Charlotte |
collection | PubMed |
description | Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibility score, which provides a way to evaluate the reliability of transcript-level abundance estimates and the accuracy of transcript annotation catalogs. It works by comparing the observed number of reads spanning each annotated splice junction in a genomic region to the predicted number of junction-spanning reads, inferred from the estimated transcript abundances and the genomic coordinates of the corresponding annotated transcripts. We show that although most genes show good agreement between the observed and predicted junction coverages, there is a small set of genes that do not. Genes with poor agreement are found regardless of the method used to estimate transcript abundances, and the corresponding transcript abundances should be treated with care in any downstream analyses. |
format | Online Article Text |
id | pubmed-6337739 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Life Science Alliance LLC |
record_format | MEDLINE/PubMed |
spelling | pubmed-63377392019-01-22 A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs Soneson, Charlotte Love, Michael I Patro, Rob Hussain, Shobbir Malhotra, Dheeraj Robinson, Mark D Life Sci Alliance Research Articles Most methods for statistical analysis of RNA-seq data take a matrix of abundance estimates for some type of genomic features as their input, and consequently the quality of any obtained results is directly dependent on the quality of these abundances. Here, we present the junction coverage compatibility score, which provides a way to evaluate the reliability of transcript-level abundance estimates and the accuracy of transcript annotation catalogs. It works by comparing the observed number of reads spanning each annotated splice junction in a genomic region to the predicted number of junction-spanning reads, inferred from the estimated transcript abundances and the genomic coordinates of the corresponding annotated transcripts. We show that although most genes show good agreement between the observed and predicted junction coverages, there is a small set of genes that do not. Genes with poor agreement are found regardless of the method used to estimate transcript abundances, and the corresponding transcript abundances should be treated with care in any downstream analyses. Life Science Alliance LLC 2019-01-17 /pmc/articles/PMC6337739/ /pubmed/30655364 http://dx.doi.org/10.26508/lsa.201800175 Text en © 2019 Soneson et al. https://creativecommons.org/licenses/by/4.0/This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Articles Soneson, Charlotte Love, Michael I Patro, Rob Hussain, Shobbir Malhotra, Dheeraj Robinson, Mark D A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
title | A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
title_full | A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
title_fullStr | A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
title_full_unstemmed | A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
title_short | A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
title_sort | junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6337739/ https://www.ncbi.nlm.nih.gov/pubmed/30655364 http://dx.doi.org/10.26508/lsa.201800175 |
work_keys_str_mv | AT sonesoncharlotte ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT lovemichaeli ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT patrorob ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT hussainshobbir ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT malhotradheeraj ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT robinsonmarkd ajunctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT sonesoncharlotte junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT lovemichaeli junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT patrorob junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT hussainshobbir junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT malhotradheeraj junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs AT robinsonmarkd junctioncoveragecompatibilityscoretoquantifythereliabilityoftranscriptabundanceestimatesandannotationcatalogs |