Cargando…
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers
BACKGROUND: In recent years, massively parallel complementary DNA sequencing (RNA sequencing [RNA-Seq]) has emerged as a fast, cost-effective, and robust technology to study entire transcriptomes in various manners. In particular, for non-model organisms and in the absence of an appropriate referenc...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511074/ https://www.ncbi.nlm.nih.gov/pubmed/31077315 http://dx.doi.org/10.1093/gigascience/giz039 |
_version_ | 1783417515071766528 |
---|---|
author | Hölzer, Martin Marz, Manja |
author_facet | Hölzer, Martin Marz, Manja |
author_sort | Hölzer, Martin |
collection | PubMed |
description | BACKGROUND: In recent years, massively parallel complementary DNA sequencing (RNA sequencing [RNA-Seq]) has emerged as a fast, cost-effective, and robust technology to study entire transcriptomes in various manners. In particular, for non-model organisms and in the absence of an appropriate reference genome, RNA-Seq is used to reconstruct the transcriptome de novo. Although the de novo transcriptome assembly of non-model organisms has been on the rise recently and new tools are frequently developing, there is still a knowledge gap about which assembly software should be used to build a comprehensive de novo assembly. RESULTS: Here, we present a large-scale comparative study in which 10 de novo assembly tools are applied to 9 RNA-Seq data sets spanning different kingdoms of life. Overall, we built >200 single assemblies and evaluated their performance on a combination of 20 biological-based and reference-free metrics. Our study is accompanied by a comprehensive and extensible Electronic Supplement that summarizes all data sets, assembly execution instructions, and evaluation results. Trinity, SPAdes, and Trans-ABySS, followed by Bridger and SOAPdenovo-Trans, generally outperformed the other tools compared. Moreover, we observed species-specific differences in the performance of each assembler. No tool delivered the best results for all data sets. CONCLUSIONS: We recommend a careful choice and normalization of evaluation metrics to select the best assembling results as a critical step in the reconstruction of a comprehensive de novo transcriptome assembly. |
format | Online Article Text |
id | pubmed-6511074 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-65110742019-05-15 De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers Hölzer, Martin Marz, Manja Gigascience Research BACKGROUND: In recent years, massively parallel complementary DNA sequencing (RNA sequencing [RNA-Seq]) has emerged as a fast, cost-effective, and robust technology to study entire transcriptomes in various manners. In particular, for non-model organisms and in the absence of an appropriate reference genome, RNA-Seq is used to reconstruct the transcriptome de novo. Although the de novo transcriptome assembly of non-model organisms has been on the rise recently and new tools are frequently developing, there is still a knowledge gap about which assembly software should be used to build a comprehensive de novo assembly. RESULTS: Here, we present a large-scale comparative study in which 10 de novo assembly tools are applied to 9 RNA-Seq data sets spanning different kingdoms of life. Overall, we built >200 single assemblies and evaluated their performance on a combination of 20 biological-based and reference-free metrics. Our study is accompanied by a comprehensive and extensible Electronic Supplement that summarizes all data sets, assembly execution instructions, and evaluation results. Trinity, SPAdes, and Trans-ABySS, followed by Bridger and SOAPdenovo-Trans, generally outperformed the other tools compared. Moreover, we observed species-specific differences in the performance of each assembler. No tool delivered the best results for all data sets. CONCLUSIONS: We recommend a careful choice and normalization of evaluation metrics to select the best assembling results as a critical step in the reconstruction of a comprehensive de novo transcriptome assembly. Oxford University Press 2019-05-11 /pmc/articles/PMC6511074/ /pubmed/31077315 http://dx.doi.org/10.1093/gigascience/giz039 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Hölzer, Martin Marz, Manja De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers |
title |
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers |
title_full |
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers |
title_fullStr |
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers |
title_full_unstemmed |
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers |
title_short |
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers |
title_sort | de novo transcriptome assembly: a comprehensive cross-species comparison of short-read rna-seq assemblers |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511074/ https://www.ncbi.nlm.nih.gov/pubmed/31077315 http://dx.doi.org/10.1093/gigascience/giz039 |
work_keys_str_mv | AT holzermartin denovotranscriptomeassemblyacomprehensivecrossspeciescomparisonofshortreadrnaseqassemblers AT marzmanja denovotranscriptomeassemblyacomprehensivecrossspeciescomparisonofshortreadrnaseqassemblers |