Cargando…

The fractured landscape of RNA-seq alignment: the default in our STARs

Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods’ good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Ballouz, Sara, Dobin, Alexander, Gingeras, Thomas R, Gillis, Jesse
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007662/
https://www.ncbi.nlm.nih.gov/pubmed/29718481
http://dx.doi.org/10.1093/nar/gky325
_version_ 1783333077079031808
author Ballouz, Sara
Dobin, Alexander
Gingeras, Thomas R
Gillis, Jesse
author_facet Ballouz, Sara
Dobin, Alexander
Gingeras, Thomas R
Gillis, Jesse
author_sort Ballouz, Sara
collection PubMed
description Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods’ good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR’s performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur.
format Online
Article
Text
id pubmed-6007662
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60076622018-06-25 The fractured landscape of RNA-seq alignment: the default in our STARs Ballouz, Sara Dobin, Alexander Gingeras, Thomas R Gillis, Jesse Nucleic Acids Res Genomics Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods’ good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR’s performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur. Oxford University Press 2018-06-01 2018-05-01 /pmc/articles/PMC6007662/ /pubmed/29718481 http://dx.doi.org/10.1093/nar/gky325 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Genomics
Ballouz, Sara
Dobin, Alexander
Gingeras, Thomas R
Gillis, Jesse
The fractured landscape of RNA-seq alignment: the default in our STARs
title The fractured landscape of RNA-seq alignment: the default in our STARs
title_full The fractured landscape of RNA-seq alignment: the default in our STARs
title_fullStr The fractured landscape of RNA-seq alignment: the default in our STARs
title_full_unstemmed The fractured landscape of RNA-seq alignment: the default in our STARs
title_short The fractured landscape of RNA-seq alignment: the default in our STARs
title_sort fractured landscape of rna-seq alignment: the default in our stars
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007662/
https://www.ncbi.nlm.nih.gov/pubmed/29718481
http://dx.doi.org/10.1093/nar/gky325
work_keys_str_mv AT ballouzsara thefracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT dobinalexander thefracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT gingerasthomasr thefracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT gillisjesse thefracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT ballouzsara fracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT dobinalexander fracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT gingerasthomasr fracturedlandscapeofrnaseqalignmentthedefaultinourstars
AT gillisjesse fracturedlandscapeofrnaseqalignmentthedefaultinourstars