Cargando…

aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data

BACKGROUND: Assembling genes from next-generation sequencing data is not only time consuming but computationally difficult, particularly for taxa without a closely related reference genome. Assembling even a draft genome using de novo approaches can take days, even on a powerful computer, and these...

Descripción completa

Detalles Bibliográficos
Autores principales: Allen, Julie M, Huang, Daisie I, Cronk, Quentin C, Johnson, Kevin P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380108/
https://www.ncbi.nlm.nih.gov/pubmed/25887972
http://dx.doi.org/10.1186/s12859-015-0515-2
_version_ 1782364291333095424
author Allen, Julie M
Huang, Daisie I
Cronk, Quentin C
Johnson, Kevin P
author_facet Allen, Julie M
Huang, Daisie I
Cronk, Quentin C
Johnson, Kevin P
author_sort Allen, Julie M
collection PubMed
description BACKGROUND: Assembling genes from next-generation sequencing data is not only time consuming but computationally difficult, particularly for taxa without a closely related reference genome. Assembling even a draft genome using de novo approaches can take days, even on a powerful computer, and these assemblies typically require data from a variety of genomic libraries. Here we describe software that will alleviate these issues by rapidly assembling genes from distantly related taxa using a single library of paired-end reads: aTRAM, automated Target Restricted Assembly Method. The aTRAM pipeline uses a reference sequence, BLAST, and an iterative approach to target and locally assemble the genes of interest. RESULTS: Our results demonstrate that aTRAM rapidly assembles genes across distantly related taxa. In comparative tests with a closely related taxon, aTRAM assembled the same sequence as reference-based and de novo approaches taking on average < 1 min per gene. As a test case with divergent sequences, we assembled >1,000 genes from six taxa ranging from 25 – 110 million years divergent from the reference taxon. The gene recovery was between 97 – 99% from each taxon. CONCLUSIONS: aTRAM can quickly assemble genes across distantly-related taxa, obviating the need for draft genome assembly of all taxa of interest. Because aTRAM uses a targeted approach, loci can be assembled in minutes depending on the size of the target. Our results suggest that this software will be useful in rapidly assembling genes for phylogenomic projects covering a wide taxonomic range, as well as other applications. The software is freely available http://www.github.com/juliema/aTRAM. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0515-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4380108
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43801082015-04-01 aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data Allen, Julie M Huang, Daisie I Cronk, Quentin C Johnson, Kevin P BMC Bioinformatics Software BACKGROUND: Assembling genes from next-generation sequencing data is not only time consuming but computationally difficult, particularly for taxa without a closely related reference genome. Assembling even a draft genome using de novo approaches can take days, even on a powerful computer, and these assemblies typically require data from a variety of genomic libraries. Here we describe software that will alleviate these issues by rapidly assembling genes from distantly related taxa using a single library of paired-end reads: aTRAM, automated Target Restricted Assembly Method. The aTRAM pipeline uses a reference sequence, BLAST, and an iterative approach to target and locally assemble the genes of interest. RESULTS: Our results demonstrate that aTRAM rapidly assembles genes across distantly related taxa. In comparative tests with a closely related taxon, aTRAM assembled the same sequence as reference-based and de novo approaches taking on average < 1 min per gene. As a test case with divergent sequences, we assembled >1,000 genes from six taxa ranging from 25 – 110 million years divergent from the reference taxon. The gene recovery was between 97 – 99% from each taxon. CONCLUSIONS: aTRAM can quickly assemble genes across distantly-related taxa, obviating the need for draft genome assembly of all taxa of interest. Because aTRAM uses a targeted approach, loci can be assembled in minutes depending on the size of the target. Our results suggest that this software will be useful in rapidly assembling genes for phylogenomic projects covering a wide taxonomic range, as well as other applications. The software is freely available http://www.github.com/juliema/aTRAM. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0515-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-25 /pmc/articles/PMC4380108/ /pubmed/25887972 http://dx.doi.org/10.1186/s12859-015-0515-2 Text en © Allen et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Allen, Julie M
Huang, Daisie I
Cronk, Quentin C
Johnson, Kevin P
aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
title aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
title_full aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
title_fullStr aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
title_full_unstemmed aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
title_short aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
title_sort atram - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380108/
https://www.ncbi.nlm.nih.gov/pubmed/25887972
http://dx.doi.org/10.1186/s12859-015-0515-2
work_keys_str_mv AT allenjuliem atramautomatedtargetrestrictedassemblymethodafastmethodforassemblinglociacrossdivergenttaxafromnextgenerationsequencingdata
AT huangdaisiei atramautomatedtargetrestrictedassemblymethodafastmethodforassemblinglociacrossdivergenttaxafromnextgenerationsequencingdata
AT cronkquentinc atramautomatedtargetrestrictedassemblymethodafastmethodforassemblinglociacrossdivergenttaxafromnextgenerationsequencingdata
AT johnsonkevinp atramautomatedtargetrestrictedassemblymethodafastmethodforassemblinglociacrossdivergenttaxafromnextgenerationsequencingdata