Cargando…

Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers

BACKGROUND: The sequencing, de novo assembly and annotation of transcriptome datasets generated with next generation sequencing (NGS) has enabled biologists to answer genomic questions in non-model species with unprecedented ease. Reliable and accurate de novo assembly and annotation of transcriptom...

Descripción completa

Detalles Bibliográficos
Autores principales: Amin, Shorash, Prentis, Peter J, Gilding, Edward K, Pavasovic, Ana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4124492/
https://www.ncbi.nlm.nih.gov/pubmed/25084827
http://dx.doi.org/10.1186/1756-0500-7-488
_version_ 1782329626761101312
author Amin, Shorash
Prentis, Peter J
Gilding, Edward K
Pavasovic, Ana
author_facet Amin, Shorash
Prentis, Peter J
Gilding, Edward K
Pavasovic, Ana
author_sort Amin, Shorash
collection PubMed
description BACKGROUND: The sequencing, de novo assembly and annotation of transcriptome datasets generated with next generation sequencing (NGS) has enabled biologists to answer genomic questions in non-model species with unprecedented ease. Reliable and accurate de novo assembly and annotation of transcriptomes, however, is a critically important step for transcriptome assemblies generated from short read sequences. Typical benchmarks for assembly and annotation reliability have been performed with model species. To address the reliability and accuracy of de novo transcriptome assembly in non-model species, we generated an RNAseq dataset for an intertidal gastropod mollusc species, Nerita melanotragus, and compared the assembly produced by four different de novo transcriptome assemblers; Velvet, Oases, Geneious and Trinity, for a number of quality metrics and redundancy. RESULTS: Transcriptome sequencing on the Ion Torrent PGM™ produced 1,883,624 raw reads with a mean length of 133 base pairs (bp). Both the Trinity and Oases de novo assemblers produced the best assemblies based on all quality metrics including fewer contigs, increased N50 and average contig length and contigs of greater length. Overall the BLAST and annotation success of our assemblies was not high with only 15-19% of contigs assigned a putative function. CONCLUSIONS: We believe that any improvement in annotation success of gastropod species will require more gastropod genome sequences, but in particular an increase in mollusc protein sequences in public databases. Overall, this paper demonstrates that reliable and accurate de novo transcriptome assemblies can be generated from short read sequencers with the right assembly algorithms.
format Online
Article
Text
id pubmed-4124492
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41244922014-08-08 Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers Amin, Shorash Prentis, Peter J Gilding, Edward K Pavasovic, Ana BMC Res Notes Research Article BACKGROUND: The sequencing, de novo assembly and annotation of transcriptome datasets generated with next generation sequencing (NGS) has enabled biologists to answer genomic questions in non-model species with unprecedented ease. Reliable and accurate de novo assembly and annotation of transcriptomes, however, is a critically important step for transcriptome assemblies generated from short read sequences. Typical benchmarks for assembly and annotation reliability have been performed with model species. To address the reliability and accuracy of de novo transcriptome assembly in non-model species, we generated an RNAseq dataset for an intertidal gastropod mollusc species, Nerita melanotragus, and compared the assembly produced by four different de novo transcriptome assemblers; Velvet, Oases, Geneious and Trinity, for a number of quality metrics and redundancy. RESULTS: Transcriptome sequencing on the Ion Torrent PGM™ produced 1,883,624 raw reads with a mean length of 133 base pairs (bp). Both the Trinity and Oases de novo assemblers produced the best assemblies based on all quality metrics including fewer contigs, increased N50 and average contig length and contigs of greater length. Overall the BLAST and annotation success of our assemblies was not high with only 15-19% of contigs assigned a putative function. CONCLUSIONS: We believe that any improvement in annotation success of gastropod species will require more gastropod genome sequences, but in particular an increase in mollusc protein sequences in public databases. Overall, this paper demonstrates that reliable and accurate de novo transcriptome assemblies can be generated from short read sequencers with the right assembly algorithms. BioMed Central 2014-08-01 /pmc/articles/PMC4124492/ /pubmed/25084827 http://dx.doi.org/10.1186/1756-0500-7-488 Text en Copyright © 2014 Amin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Amin, Shorash
Prentis, Peter J
Gilding, Edward K
Pavasovic, Ana
Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers
title Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers
title_full Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers
title_fullStr Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers
title_full_unstemmed Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers
title_short Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of De novo assemblers
title_sort assembly and annotation of a non-model gastropod (nerita melanotragus) transcriptome: a comparison of de novo assemblers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4124492/
https://www.ncbi.nlm.nih.gov/pubmed/25084827
http://dx.doi.org/10.1186/1756-0500-7-488
work_keys_str_mv AT aminshorash assemblyandannotationofanonmodelgastropodneritamelanotragustranscriptomeacomparisonofdenovoassemblers
AT prentispeterj assemblyandannotationofanonmodelgastropodneritamelanotragustranscriptomeacomparisonofdenovoassemblers
AT gildingedwardk assemblyandannotationofanonmodelgastropodneritamelanotragustranscriptomeacomparisonofdenovoassemblers
AT pavasovicana assemblyandannotationofanonmodelgastropodneritamelanotragustranscriptomeacomparisonofdenovoassemblers