Cargando…

A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system

BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greate...

Descripción completa

Detalles Bibliográficos
Autores principales: Kingan, Sarah B, Urban, Julie, Lambert, Christine C, Baybayan, Primo, Childers, Anna K, Coates, Brad, Scheffler, Brian, Hackett, Kevin, Korlach, Jonas, Geib, Scott M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6791401/
https://www.ncbi.nlm.nih.gov/pubmed/31609423
http://dx.doi.org/10.1093/gigascience/giz122
_version_ 1783458974720327680
author Kingan, Sarah B
Urban, Julie
Lambert, Christine C
Baybayan, Primo
Childers, Anna K
Coates, Brad
Scheffler, Brian
Hackett, Kevin
Korlach, Jonas
Geib, Scott M
author_facet Kingan, Sarah B
Urban, Julie
Lambert, Christine C
Baybayan, Primo
Childers, Anna K
Coates, Brad
Scheffler, Brian
Hackett, Kevin
Korlach, Jonas
Geib, Scott M
author_sort Kingan, Sarah B
collection PubMed
description BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.
format Online
Article
Text
id pubmed-6791401
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67914012019-10-21 A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system Kingan, Sarah B Urban, Julie Lambert, Christine C Baybayan, Primo Childers, Anna K Coates, Brad Scheffler, Brian Hackett, Kevin Korlach, Jonas Geib, Scott M Gigascience Data Note BACKGROUND: A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS: The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS: We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species. Oxford University Press 2019-10-14 /pmc/articles/PMC6791401/ /pubmed/31609423 http://dx.doi.org/10.1093/gigascience/giz122 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Kingan, Sarah B
Urban, Julie
Lambert, Christine C
Baybayan, Primo
Childers, Anna K
Coates, Brad
Scheffler, Brian
Hackett, Kevin
Korlach, Jonas
Geib, Scott M
A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system
title A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system
title_full A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system
title_fullStr A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system
title_full_unstemmed A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system
title_short A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system
title_sort high-quality genome assembly from a single, field-collected spotted lanternfly (lycorma delicatula) using the pacbio sequel ii system
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6791401/
https://www.ncbi.nlm.nih.gov/pubmed/31609423
http://dx.doi.org/10.1093/gigascience/giz122
work_keys_str_mv AT kingansarahb ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT urbanjulie ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT lambertchristinec ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT baybayanprimo ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT childersannak ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT coatesbrad ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT schefflerbrian ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT hackettkevin ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT korlachjonas ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT geibscottm ahighqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT kingansarahb highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT urbanjulie highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT lambertchristinec highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT baybayanprimo highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT childersannak highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT coatesbrad highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT schefflerbrian highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT hackettkevin highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT korlachjonas highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem
AT geibscottm highqualitygenomeassemblyfromasinglefieldcollectedspottedlanternflylycormadelicatulausingthepacbiosequeliisystem