Cargando…

The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly

Characterizing transcriptomes in non-model organisms has resulted in a massive increase in our understanding of biological phenomena. This boon, largely made possible via high-throughput sequencing, means that studies of functional, evolutionary, and population genomics are now being done by hundred...

Descripción completa

Detalles Bibliográficos
Autor principal: MacManes, Matthew D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6078068/
https://www.ncbi.nlm.nih.gov/pubmed/30083482
http://dx.doi.org/10.7717/peerj.5428
_version_ 1783345029500108800
author MacManes, Matthew D.
author_facet MacManes, Matthew D.
author_sort MacManes, Matthew D.
collection PubMed
description Characterizing transcriptomes in non-model organisms has resulted in a massive increase in our understanding of biological phenomena. This boon, largely made possible via high-throughput sequencing, means that studies of functional, evolutionary, and population genomics are now being done by hundreds or even thousands of labs around the world. For many, these studies begin with a de novo transcriptome assembly, which is a technically complicated process involving several discrete steps. The Oyster River Protocol (ORP), described here, implements a standardized and benchmarked set of bioinformatic processes, resulting in an assembly with enhanced qualities over other standard assembly methods. Specifically, ORP produced assemblies have higher Detonate and TransRate scores and mapping rates, which is largely a product of the fact that it leverages a multi-assembler and kmer assembly process, thereby bypassing the shortcomings of any one approach. These improvements are important, as previously unassembled transcripts are included in ORP assemblies, resulting in a significant enhancement of the power of downstream analysis. Further, as part of this study, I show that assembly quality is unrelated with the number of reads generated, above 30 million reads. Code Availability: The version controlled open-source code is available at https://github.com/macmanes-lab/Oyster_River_Protocol. Instructions for software installation and use, and other details are available at http://oyster-river-protocol.rtfd.org/.
format Online
Article
Text
id pubmed-6078068
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-60780682018-08-06 The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly MacManes, Matthew D. PeerJ Bioinformatics Characterizing transcriptomes in non-model organisms has resulted in a massive increase in our understanding of biological phenomena. This boon, largely made possible via high-throughput sequencing, means that studies of functional, evolutionary, and population genomics are now being done by hundreds or even thousands of labs around the world. For many, these studies begin with a de novo transcriptome assembly, which is a technically complicated process involving several discrete steps. The Oyster River Protocol (ORP), described here, implements a standardized and benchmarked set of bioinformatic processes, resulting in an assembly with enhanced qualities over other standard assembly methods. Specifically, ORP produced assemblies have higher Detonate and TransRate scores and mapping rates, which is largely a product of the fact that it leverages a multi-assembler and kmer assembly process, thereby bypassing the shortcomings of any one approach. These improvements are important, as previously unassembled transcripts are included in ORP assemblies, resulting in a significant enhancement of the power of downstream analysis. Further, as part of this study, I show that assembly quality is unrelated with the number of reads generated, above 30 million reads. Code Availability: The version controlled open-source code is available at https://github.com/macmanes-lab/Oyster_River_Protocol. Instructions for software installation and use, and other details are available at http://oyster-river-protocol.rtfd.org/. PeerJ Inc. 2018-08-03 /pmc/articles/PMC6078068/ /pubmed/30083482 http://dx.doi.org/10.7717/peerj.5428 Text en © 2018 MacManes http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
MacManes, Matthew D.
The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
title The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
title_full The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
title_fullStr The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
title_full_unstemmed The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
title_short The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
title_sort oyster river protocol: a multi-assembler and kmer approach for de novo transcriptome assembly
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6078068/
https://www.ncbi.nlm.nih.gov/pubmed/30083482
http://dx.doi.org/10.7717/peerj.5428
work_keys_str_mv AT macmanesmatthewd theoysterriverprotocolamultiassemblerandkmerapproachfordenovotranscriptomeassembly
AT macmanesmatthewd oysterriverprotocolamultiassemblerandkmerapproachfordenovotranscriptomeassembly