Cargando…
Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673215/ https://www.ncbi.nlm.nih.gov/pubmed/23698863 http://dx.doi.org/10.1093/bioinformatics/btt178 |
_version_ | 1782272227988733952 |
---|---|
author | Birol, Inanc Raymond, Anthony Jackman, Shaun D. Pleasance, Stephen Coope, Robin Taylor, Greg A. Yuen, Macaire Man Saint Keeling, Christopher I. Brand, Dana Vandervalk, Benjamin P. Kirk, Heather Pandoh, Pawan Moore, Richard A. Zhao, Yongjun Mungall, Andrew J. Jaquish, Barry Yanchuk, Alvin Ritland, Carol Boyle, Brian Bousquet, Jean Ritland, Kermit MacKay, John Bohlmann, Jörg Jones, Steven J.M. |
author_facet | Birol, Inanc Raymond, Anthony Jackman, Shaun D. Pleasance, Stephen Coope, Robin Taylor, Greg A. Yuen, Macaire Man Saint Keeling, Christopher I. Brand, Dana Vandervalk, Benjamin P. Kirk, Heather Pandoh, Pawan Moore, Richard A. Zhao, Yongjun Mungall, Andrew J. Jaquish, Barry Yanchuk, Alvin Ritland, Carol Boyle, Brian Bousquet, Jean Ritland, Kermit MacKay, John Bohlmann, Jörg Jones, Steven J.M. |
author_sort | Birol, Inanc |
collection | PubMed |
description | White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20 356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies. Availability: The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435. Contact: ibirol@bcgsc.ca Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3673215 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36732152013-06-05 Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data Birol, Inanc Raymond, Anthony Jackman, Shaun D. Pleasance, Stephen Coope, Robin Taylor, Greg A. Yuen, Macaire Man Saint Keeling, Christopher I. Brand, Dana Vandervalk, Benjamin P. Kirk, Heather Pandoh, Pawan Moore, Richard A. Zhao, Yongjun Mungall, Andrew J. Jaquish, Barry Yanchuk, Alvin Ritland, Carol Boyle, Brian Bousquet, Jean Ritland, Kermit MacKay, John Bohlmann, Jörg Jones, Steven J.M. Bioinformatics Original Papers White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20 356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies. Availability: The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435. Contact: ibirol@bcgsc.ca Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-06-15 2013-05-22 /pmc/articles/PMC3673215/ /pubmed/23698863 http://dx.doi.org/10.1093/bioinformatics/btt178 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Birol, Inanc Raymond, Anthony Jackman, Shaun D. Pleasance, Stephen Coope, Robin Taylor, Greg A. Yuen, Macaire Man Saint Keeling, Christopher I. Brand, Dana Vandervalk, Benjamin P. Kirk, Heather Pandoh, Pawan Moore, Richard A. Zhao, Yongjun Mungall, Andrew J. Jaquish, Barry Yanchuk, Alvin Ritland, Carol Boyle, Brian Bousquet, Jean Ritland, Kermit MacKay, John Bohlmann, Jörg Jones, Steven J.M. Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data |
title | Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data |
title_full | Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data |
title_fullStr | Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data |
title_full_unstemmed | Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data |
title_short | Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data |
title_sort | assembling the 20 gb white spruce (picea glauca) genome from whole-genome shotgun sequencing data |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673215/ https://www.ncbi.nlm.nih.gov/pubmed/23698863 http://dx.doi.org/10.1093/bioinformatics/btt178 |
work_keys_str_mv | AT birolinanc assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT raymondanthony assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT jackmanshaund assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT pleasancestephen assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT cooperobin assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT taylorgrega assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT yuenmacairemansaint assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT keelingchristopheri assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT branddana assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT vandervalkbenjaminp assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT kirkheather assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT pandohpawan assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT moorericharda assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT zhaoyongjun assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT mungallandrewj assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT jaquishbarry assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT yanchukalvin assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT ritlandcarol assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT boylebrian assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT bousquetjean assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT ritlandkermit assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT mackayjohn assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT bohlmannjorg assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata AT jonesstevenjm assemblingthe20gbwhitesprucepiceaglaucagenomefromwholegenomeshotgunsequencingdata |