Cargando…
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
BACKGROUND: We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224119/ https://www.ncbi.nlm.nih.gov/pubmed/21496274 http://dx.doi.org/10.1186/1471-2164-12-194 |
_version_ | 1782217343977390080 |
---|---|
author | Haiminen, Niina Feltus, F Alex Parida, Laxmi |
author_facet | Haiminen, Niina Feltus, F Alex Parida, Laxmi |
author_sort | Haiminen, Niina |
collection | PubMed |
description | BACKGROUND: We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. RESULTS: The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. CONCLUSIONS: BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. |
format | Online Article Text |
id | pubmed-3224119 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32241192011-11-26 Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes Haiminen, Niina Feltus, F Alex Parida, Laxmi BMC Genomics Research Article BACKGROUND: We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. RESULTS: The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. CONCLUSIONS: BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. BioMed Central 2011-04-15 /pmc/articles/PMC3224119/ /pubmed/21496274 http://dx.doi.org/10.1186/1471-2164-12-194 Text en Copyright ©2011 Haiminen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Haiminen, Niina Feltus, F Alex Parida, Laxmi Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes |
title | Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes |
title_full | Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes |
title_fullStr | Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes |
title_full_unstemmed | Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes |
title_short | Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes |
title_sort | assessing pooled bac and whole genome shotgun strategies for assembly of complex genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224119/ https://www.ncbi.nlm.nih.gov/pubmed/21496274 http://dx.doi.org/10.1186/1471-2164-12-194 |
work_keys_str_mv | AT haiminenniina assessingpooledbacandwholegenomeshotgunstrategiesforassemblyofcomplexgenomes AT feltusfalex assessingpooledbacandwholegenomeshotgunstrategiesforassemblyofcomplexgenomes AT paridalaxmi assessingpooledbacandwholegenomeshotgunstrategiesforassemblyofcomplexgenomes |