Cargando…

Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes

BACKGROUND: We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico...

Descripción completa

Detalles Bibliográficos
Autores principales: Haiminen, Niina, Feltus, F Alex, Parida, Laxmi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224119/
https://www.ncbi.nlm.nih.gov/pubmed/21496274
http://dx.doi.org/10.1186/1471-2164-12-194
_version_ 1782217343977390080
author Haiminen, Niina
Feltus, F Alex
Parida, Laxmi
author_facet Haiminen, Niina
Feltus, F Alex
Parida, Laxmi
author_sort Haiminen, Niina
collection PubMed
description BACKGROUND: We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. RESULTS: The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. CONCLUSIONS: BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies.
format Online
Article
Text
id pubmed-3224119
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32241192011-11-26 Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes Haiminen, Niina Feltus, F Alex Parida, Laxmi BMC Genomics Research Article BACKGROUND: We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. RESULTS: The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. CONCLUSIONS: BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. BioMed Central 2011-04-15 /pmc/articles/PMC3224119/ /pubmed/21496274 http://dx.doi.org/10.1186/1471-2164-12-194 Text en Copyright ©2011 Haiminen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Haiminen, Niina
Feltus, F Alex
Parida, Laxmi
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
title Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
title_full Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
title_fullStr Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
title_full_unstemmed Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
title_short Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
title_sort assessing pooled bac and whole genome shotgun strategies for assembly of complex genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224119/
https://www.ncbi.nlm.nih.gov/pubmed/21496274
http://dx.doi.org/10.1186/1471-2164-12-194
work_keys_str_mv AT haiminenniina assessingpooledbacandwholegenomeshotgunstrategiesforassemblyofcomplexgenomes
AT feltusfalex assessingpooledbacandwholegenomeshotgunstrategiesforassemblyofcomplexgenomes
AT paridalaxmi assessingpooledbacandwholegenomeshotgunstrategiesforassemblyofcomplexgenomes