Cargando…
The fractured landscape of RNA-seq alignment: the default in our STARs
Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods’ good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, w...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007662/ https://www.ncbi.nlm.nih.gov/pubmed/29718481 http://dx.doi.org/10.1093/nar/gky325 |
_version_ | 1783333077079031808 |
---|---|
author | Ballouz, Sara Dobin, Alexander Gingeras, Thomas R Gillis, Jesse |
author_facet | Ballouz, Sara Dobin, Alexander Gingeras, Thomas R Gillis, Jesse |
author_sort | Ballouz, Sara |
collection | PubMed |
description | Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods’ good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR’s performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur. |
format | Online Article Text |
id | pubmed-6007662 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60076622018-06-25 The fractured landscape of RNA-seq alignment: the default in our STARs Ballouz, Sara Dobin, Alexander Gingeras, Thomas R Gillis, Jesse Nucleic Acids Res Genomics Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods’ good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR’s performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur. Oxford University Press 2018-06-01 2018-05-01 /pmc/articles/PMC6007662/ /pubmed/29718481 http://dx.doi.org/10.1093/nar/gky325 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Genomics Ballouz, Sara Dobin, Alexander Gingeras, Thomas R Gillis, Jesse The fractured landscape of RNA-seq alignment: the default in our STARs |
title | The fractured landscape of RNA-seq alignment: the default in our STARs |
title_full | The fractured landscape of RNA-seq alignment: the default in our STARs |
title_fullStr | The fractured landscape of RNA-seq alignment: the default in our STARs |
title_full_unstemmed | The fractured landscape of RNA-seq alignment: the default in our STARs |
title_short | The fractured landscape of RNA-seq alignment: the default in our STARs |
title_sort | fractured landscape of rna-seq alignment: the default in our stars |
topic | Genomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007662/ https://www.ncbi.nlm.nih.gov/pubmed/29718481 http://dx.doi.org/10.1093/nar/gky325 |
work_keys_str_mv | AT ballouzsara thefracturedlandscapeofrnaseqalignmentthedefaultinourstars AT dobinalexander thefracturedlandscapeofrnaseqalignmentthedefaultinourstars AT gingerasthomasr thefracturedlandscapeofrnaseqalignmentthedefaultinourstars AT gillisjesse thefracturedlandscapeofrnaseqalignmentthedefaultinourstars AT ballouzsara fracturedlandscapeofrnaseqalignmentthedefaultinourstars AT dobinalexander fracturedlandscapeofrnaseqalignmentthedefaultinourstars AT gingerasthomasr fracturedlandscapeofrnaseqalignmentthedefaultinourstars AT gillisjesse fracturedlandscapeofrnaseqalignmentthedefaultinourstars |