Cargando…

From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)

We investigate the utility and scalability of new read cloud technologies to improve the draft genome assemblies of the colossal, and largely repetitive, genomes of conifers. Synthetic long read technologies have existed in various forms as a means of reducing complexity and resolving repeats since...

Descripción completa

Detalles Bibliográficos
Autores principales: Crepeau, Marc W., Langley, Charles H., Stevens, Kristian A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5427496/
https://www.ncbi.nlm.nih.gov/pubmed/28341701
http://dx.doi.org/10.1534/g3.117.040055
_version_ 1783235637445394432
author Crepeau, Marc W.
Langley, Charles H.
Stevens, Kristian A.
author_facet Crepeau, Marc W.
Langley, Charles H.
Stevens, Kristian A.
author_sort Crepeau, Marc W.
collection PubMed
description We investigate the utility and scalability of new read cloud technologies to improve the draft genome assemblies of the colossal, and largely repetitive, genomes of conifers. Synthetic long read technologies have existed in various forms as a means of reducing complexity and resolving repeats since the outset of genome assembly. Recently, technologies that combine subhaploid pools of high molecular weight DNA with barcoding on a massive scale have brought new efficiencies to sample preparation and data generation. When combined with inexpensive light shotgun sequencing, the resulting data can be used to scaffold large genomes. The protocol is efficient enough to consider routinely for even the largest genomes. Conifers represent the largest reference genome projects executed to date. The largest of these is that of the conifer Pinus lambertiana (sugar pine), with a genome size of 31 billion bp. In this paper, we report on the molecular and computational protocols for scaffolding the P. lambertiana genome using the library technology from 10× Genomics. At 247,000 bp, the NG50 of the existing reference sequence is the highest scaffold contiguity among the currently published conifer assemblies; this new assembly’s NG50 is 1.94 million bp, an eightfold increase.
format Online
Article
Text
id pubmed-5427496
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-54274962017-05-12 From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana) Crepeau, Marc W. Langley, Charles H. Stevens, Kristian A. G3 (Bethesda) Investigations We investigate the utility and scalability of new read cloud technologies to improve the draft genome assemblies of the colossal, and largely repetitive, genomes of conifers. Synthetic long read technologies have existed in various forms as a means of reducing complexity and resolving repeats since the outset of genome assembly. Recently, technologies that combine subhaploid pools of high molecular weight DNA with barcoding on a massive scale have brought new efficiencies to sample preparation and data generation. When combined with inexpensive light shotgun sequencing, the resulting data can be used to scaffold large genomes. The protocol is efficient enough to consider routinely for even the largest genomes. Conifers represent the largest reference genome projects executed to date. The largest of these is that of the conifer Pinus lambertiana (sugar pine), with a genome size of 31 billion bp. In this paper, we report on the molecular and computational protocols for scaffolding the P. lambertiana genome using the library technology from 10× Genomics. At 247,000 bp, the NG50 of the existing reference sequence is the highest scaffold contiguity among the currently published conifer assemblies; this new assembly’s NG50 is 1.94 million bp, an eightfold increase. Genetics Society of America 2017-04-05 /pmc/articles/PMC5427496/ /pubmed/28341701 http://dx.doi.org/10.1534/g3.117.040055 Text en Copyright © 2017 Crepeau et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Crepeau, Marc W.
Langley, Charles H.
Stevens, Kristian A.
From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)
title From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)
title_full From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)
title_fullStr From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)
title_full_unstemmed From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)
title_short From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine (Pinus lambertiana)
title_sort from pine cones to read clouds: rescaffolding the megagenome of sugar pine (pinus lambertiana)
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5427496/
https://www.ncbi.nlm.nih.gov/pubmed/28341701
http://dx.doi.org/10.1534/g3.117.040055
work_keys_str_mv AT crepeaumarcw frompineconestoreadcloudsrescaffoldingthemegagenomeofsugarpinepinuslambertiana
AT langleycharlesh frompineconestoreadcloudsrescaffoldingthemegagenomeofsugarpinepinuslambertiana
AT stevenskristiana frompineconestoreadcloudsrescaffoldingthemegagenomeofsugarpinepinuslambertiana