Cargando…

Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly

BACKGROUND: De novo reference assemblies that are affordable, practical to produce, and of sufficient quality for most downstream applications, remain an unattained goal for many taxa. Insects, which may yield too little DNA from individual specimens for long-read sequencing library construction and...

Descripción completa

Detalles Bibliográficos
Autores principales: Love, R. Rebecca, Weisenfeld, Neil I., Jaffe, David B., Besansky, Nora J., Neafsey, Daniel E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4779211/
https://www.ncbi.nlm.nih.gov/pubmed/26944054
http://dx.doi.org/10.1186/s12864-016-2531-7
_version_ 1782419596018450432
author Love, R. Rebecca
Weisenfeld, Neil I.
Jaffe, David B.
Besansky, Nora J.
Neafsey, Daniel E.
author_facet Love, R. Rebecca
Weisenfeld, Neil I.
Jaffe, David B.
Besansky, Nora J.
Neafsey, Daniel E.
author_sort Love, R. Rebecca
collection PubMed
description BACKGROUND: De novo reference assemblies that are affordable, practical to produce, and of sufficient quality for most downstream applications, remain an unattained goal for many taxa. Insects, which may yield too little DNA from individual specimens for long-read sequencing library construction and often have highly heterozygous genomes, can be particularly hard to assemble using inexpensive short-read sequencing data. The large number of insect species with medical or economic importance makes this a critical problem to address. RESULTS: Using the assembler DISCOVAR de novo, we assembled the genome of the African malaria mosquito Anopheles arabiensis using 250 bp reads from a single library. The resulting assembly had a contig N50 of 22,433 bp, and recovered the gene set nearly as well as the ALLPATHS-LG AaraD1 An. arabiensis assembly produced with reads from three sequencing libraries and much greater resources. DISCOVAR de novo appeared to perform better than ALLPATHS-LG in regions of low complexity. CONCLUSIONS: DISCOVAR de novo performed well assembling the genome of an insect of medical importance, using simpler sequencing input than previous anopheline assemblies. We have shown that this program is a viable tool for cost-effective assembly of a modestly-sized insect genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2531-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4779211
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47792112016-03-06 Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly Love, R. Rebecca Weisenfeld, Neil I. Jaffe, David B. Besansky, Nora J. Neafsey, Daniel E. BMC Genomics Research Article BACKGROUND: De novo reference assemblies that are affordable, practical to produce, and of sufficient quality for most downstream applications, remain an unattained goal for many taxa. Insects, which may yield too little DNA from individual specimens for long-read sequencing library construction and often have highly heterozygous genomes, can be particularly hard to assemble using inexpensive short-read sequencing data. The large number of insect species with medical or economic importance makes this a critical problem to address. RESULTS: Using the assembler DISCOVAR de novo, we assembled the genome of the African malaria mosquito Anopheles arabiensis using 250 bp reads from a single library. The resulting assembly had a contig N50 of 22,433 bp, and recovered the gene set nearly as well as the ALLPATHS-LG AaraD1 An. arabiensis assembly produced with reads from three sequencing libraries and much greater resources. DISCOVAR de novo appeared to perform better than ALLPATHS-LG in regions of low complexity. CONCLUSIONS: DISCOVAR de novo performed well assembling the genome of an insect of medical importance, using simpler sequencing input than previous anopheline assemblies. We have shown that this program is a viable tool for cost-effective assembly of a modestly-sized insect genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2531-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-05 /pmc/articles/PMC4779211/ /pubmed/26944054 http://dx.doi.org/10.1186/s12864-016-2531-7 Text en © Love et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Love, R. Rebecca
Weisenfeld, Neil I.
Jaffe, David B.
Besansky, Nora J.
Neafsey, Daniel E.
Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly
title Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly
title_full Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly
title_fullStr Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly
title_full_unstemmed Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly
title_short Evaluation of DISCOVAR de novo using a mosquito sample for cost-effective short-read genome assembly
title_sort evaluation of discovar de novo using a mosquito sample for cost-effective short-read genome assembly
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4779211/
https://www.ncbi.nlm.nih.gov/pubmed/26944054
http://dx.doi.org/10.1186/s12864-016-2531-7
work_keys_str_mv AT loverrebecca evaluationofdiscovardenovousingamosquitosampleforcosteffectiveshortreadgenomeassembly
AT weisenfeldneili evaluationofdiscovardenovousingamosquitosampleforcosteffectiveshortreadgenomeassembly
AT jaffedavidb evaluationofdiscovardenovousingamosquitosampleforcosteffectiveshortreadgenomeassembly
AT besanskynoraj evaluationofdiscovardenovousingamosquitosampleforcosteffectiveshortreadgenomeassembly
AT neafseydaniele evaluationofdiscovardenovousingamosquitosampleforcosteffectiveshortreadgenomeassembly