Cargando…

A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus...

Descripción completa

Detalles Bibliográficos
Autores principales: Kingan, Sarah B., Heaton, Haynes, Cudini, Juliana, Lambert, Christine C., Baybayan, Primo, Galvin, Brendan D., Durbin, Richard, Korlach, Jonas, Lawniczak, Mara K. N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6357164/
https://www.ncbi.nlm.nih.gov/pubmed/30669388
http://dx.doi.org/10.3390/genes10010062
_version_ 1783391739260698624
author Kingan, Sarah B.
Heaton, Haynes
Cudini, Juliana
Lambert, Christine C.
Baybayan, Primo
Galvin, Brendan D.
Durbin, Richard
Korlach, Jonas
Lawniczak, Mara K. N.
author_facet Kingan, Sarah B.
Heaton, Haynes
Cudini, Juliana
Lambert, Christine C.
Baybayan, Primo
Galvin, Brendan D.
Durbin, Richard
Korlach, Jonas
Lawniczak, Mara K. N.
author_sort Kingan, Sarah B.
collection PubMed
description A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.
format Online
Article
Text
id pubmed-6357164
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63571642019-02-04 A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing Kingan, Sarah B. Heaton, Haynes Cudini, Juliana Lambert, Christine C. Baybayan, Primo Galvin, Brendan D. Durbin, Richard Korlach, Jonas Lawniczak, Mara K. N. Genes (Basel) Article A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life. MDPI 2019-01-18 /pmc/articles/PMC6357164/ /pubmed/30669388 http://dx.doi.org/10.3390/genes10010062 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kingan, Sarah B.
Heaton, Haynes
Cudini, Juliana
Lambert, Christine C.
Baybayan, Primo
Galvin, Brendan D.
Durbin, Richard
Korlach, Jonas
Lawniczak, Mara K. N.
A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
title A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
title_full A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
title_fullStr A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
title_full_unstemmed A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
title_short A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing
title_sort high-quality de novo genome assembly from a single mosquito using pacbio sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6357164/
https://www.ncbi.nlm.nih.gov/pubmed/30669388
http://dx.doi.org/10.3390/genes10010062
work_keys_str_mv AT kingansarahb ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT heatonhaynes ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT cudinijuliana ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT lambertchristinec ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT baybayanprimo ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT galvinbrendand ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT durbinrichard ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT korlachjonas ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT lawniczakmarakn ahighqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT kingansarahb highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT heatonhaynes highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT cudinijuliana highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT lambertchristinec highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT baybayanprimo highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT galvinbrendand highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT durbinrichard highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT korlachjonas highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing
AT lawniczakmarakn highqualitydenovogenomeassemblyfromasinglemosquitousingpacbiosequencing