Cargando…

Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data

White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though...

Descripción completa

Detalles Bibliográficos
Autores principales: Birol, Inanc, Raymond, Anthony, Jackman, Shaun D., Pleasance, Stephen, Coope, Robin, Taylor, Greg A., Yuen, Macaire Man Saint, Keeling, Christopher I., Brand, Dana, Vandervalk, Benjamin P., Kirk, Heather, Pandoh, Pawan, Moore, Richard A., Zhao, Yongjun, Mungall, Andrew J., Jaquish, Barry, Yanchuk, Alvin, Ritland, Carol, Boyle, Brian, Bousquet, Jean, Ritland, Kermit, MacKay, John, Bohlmann, Jörg, Jones, Steven J.M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673215/
https://www.ncbi.nlm.nih.gov/pubmed/23698863
http://dx.doi.org/10.1093/bioinformatics/btt178
_version_ 1782272227988733952
author Birol, Inanc
Raymond, Anthony
Jackman, Shaun D.
Pleasance, Stephen
Coope, Robin
Taylor, Greg A.
Yuen, Macaire Man Saint
Keeling, Christopher I.
Brand, Dana
Vandervalk, Benjamin P.
Kirk, Heather
Pandoh, Pawan
Moore, Richard A.
Zhao, Yongjun
Mungall, Andrew J.
Jaquish, Barry
Yanchuk, Alvin
Ritland, Carol
Boyle, Brian
Bousquet, Jean
Ritland, Kermit
MacKay, John
Bohlmann, Jörg
Jones, Steven J.M.
author_facet Birol, Inanc
Raymond, Anthony
Jackman, Shaun D.
Pleasance, Stephen
Coope, Robin
Taylor, Greg A.
Yuen, Macaire Man Saint
Keeling, Christopher I.
Brand, Dana
Vandervalk, Benjamin P.
Kirk, Heather
Pandoh, Pawan
Moore, Richard A.
Zhao, Yongjun
Mungall, Andrew J.
Jaquish, Barry
Yanchuk, Alvin
Ritland, Carol
Boyle, Brian
Bousquet, Jean
Ritland, Kermit
MacKay, John
Bohlmann, Jörg
Jones, Steven J.M.
author_sort Birol, Inanc
collection PubMed
description White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20 356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies. Availability: The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435. Contact: ibirol@bcgsc.ca Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3673215
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36732152013-06-05 Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data Birol, Inanc Raymond, Anthony Jackman, Shaun D. Pleasance, Stephen Coope, Robin Taylor, Greg A. Yuen, Macaire Man Saint Keeling, Christopher I. Brand, Dana Vandervalk, Benjamin P. Kirk, Heather Pandoh, Pawan Moore, Richard A. Zhao, Yongjun Mungall, Andrew J. Jaquish, Barry Yanchuk, Alvin Ritland, Carol Boyle, Brian Bousquet, Jean Ritland, Kermit MacKay, John Bohlmann, Jörg Jones, Steven J.M. Bioinformatics Original Papers White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20 356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies. Availability: The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435. Contact: ibirol@bcgsc.ca Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-06-15 2013-05-22 /pmc/articles/PMC3673215/ /pubmed/23698863 http://dx.doi.org/10.1093/bioinformatics/btt178 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Birol, Inanc
Raymond, Anthony
Jackman, Shaun D.
Pleasance, Stephen
Coope, Robin
Taylor, Greg A.
Yuen, Macaire Man Saint
Keeling, Christopher I.
Brand, Dana
Vandervalk, Benjamin P.
Kirk, Heather
Pandoh, Pawan
Moore, Richard A.
Zhao, Yongjun
Mungall, Andrew J.
Jaquish, Barry
Yanchuk, Alvin
Ritland, Carol
Boyle, Brian
Bousquet, Jean
Ritland, Kermit
MacKay, John
Bohlmann, Jörg
Jones, Steven J.M.
Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
title Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
title_full Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
title_fullStr Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
title_full_unstemmed Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
title_short Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
title_sort assembling the 20 gb white spruce (picea glauca) genome from whole-genome shotgun sequencing data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673215/
https://www.ncbi.nlm.nih.gov/pubmed/23698863
http://dx.doi.org/10.1093/bioinformatics/btt178
work_keys_str_mv AT birolinanc assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT raymondanthony assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT jackmanshaund assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT pleasancestephen assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT cooperobin assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT taylorgrega assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT yuenmacairemansaint assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT keelingchristopheri assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT branddana assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT vandervalkbenjaminp assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT kirkheather assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT pandohpawan assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT moorericharda assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT zhaoyongjun assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT mungallandrewj assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT jaquishbarry assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT yanchukalvin assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT ritlandcarol assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT boylebrian assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT bousquetjean assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT ritlandkermit assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT mackayjohn assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT bohlmannjorg assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata
AT jonesstevenjm assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata