Cargando…

Sequencing of BAC pools by different next generation sequencing platforms and strategies

BACKGROUND: Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mat...

Descripción completa

Detalles Bibliográficos
Autores principales: Taudien, Stefan, Steuernagel, Burkhard, Ariyadasa, Ruvini, Schulte, Daniela, Schmutzer, Thomas, Groth, Marco, Felder, Marius, Petzold, Andreas, Scholz, Uwe, Mayer, Klaus FX, Stein, Nils, Platzer, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3213688/
https://www.ncbi.nlm.nih.gov/pubmed/21999860
http://dx.doi.org/10.1186/1756-0500-4-411
_version_ 1782216171872845824
author Taudien, Stefan
Steuernagel, Burkhard
Ariyadasa, Ruvini
Schulte, Daniela
Schmutzer, Thomas
Groth, Marco
Felder, Marius
Petzold, Andreas
Scholz, Uwe
Mayer, Klaus FX
Stein, Nils
Platzer, Matthias
author_facet Taudien, Stefan
Steuernagel, Burkhard
Ariyadasa, Ruvini
Schulte, Daniela
Schmutzer, Thomas
Groth, Marco
Felder, Marius
Petzold, Andreas
Scholz, Uwe
Mayer, Klaus FX
Stein, Nils
Platzer, Matthias
author_sort Taudien, Stefan
collection PubMed
description BACKGROUND: Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs) improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. RESULTS: Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. CONCLUSION: Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.
format Online
Article
Text
id pubmed-3213688
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32136882011-11-12 Sequencing of BAC pools by different next generation sequencing platforms and strategies Taudien, Stefan Steuernagel, Burkhard Ariyadasa, Ruvini Schulte, Daniela Schmutzer, Thomas Groth, Marco Felder, Marius Petzold, Andreas Scholz, Uwe Mayer, Klaus FX Stein, Nils Platzer, Matthias BMC Res Notes Research Article BACKGROUND: Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs) improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. RESULTS: Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. CONCLUSION: Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs. BioMed Central 2011-10-14 /pmc/articles/PMC3213688/ /pubmed/21999860 http://dx.doi.org/10.1186/1756-0500-4-411 Text en Copyright ©2011 Taudien et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Taudien, Stefan
Steuernagel, Burkhard
Ariyadasa, Ruvini
Schulte, Daniela
Schmutzer, Thomas
Groth, Marco
Felder, Marius
Petzold, Andreas
Scholz, Uwe
Mayer, Klaus FX
Stein, Nils
Platzer, Matthias
Sequencing of BAC pools by different next generation sequencing platforms and strategies
title Sequencing of BAC pools by different next generation sequencing platforms and strategies
title_full Sequencing of BAC pools by different next generation sequencing platforms and strategies
title_fullStr Sequencing of BAC pools by different next generation sequencing platforms and strategies
title_full_unstemmed Sequencing of BAC pools by different next generation sequencing platforms and strategies
title_short Sequencing of BAC pools by different next generation sequencing platforms and strategies
title_sort sequencing of bac pools by different next generation sequencing platforms and strategies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3213688/
https://www.ncbi.nlm.nih.gov/pubmed/21999860
http://dx.doi.org/10.1186/1756-0500-4-411
work_keys_str_mv AT taudienstefan sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT steuernagelburkhard sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT ariyadasaruvini sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT schultedaniela sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT schmutzerthomas sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT grothmarco sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT feldermarius sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT petzoldandreas sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT scholzuwe sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT mayerklausfx sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT steinnils sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies
AT platzermatthias sequencingofbacpoolsbydifferentnextgenerationsequencingplatformsandstrategies