Cargando…

High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies

BACKGROUND: Massively parallel DNA sequencing instruments are enabling the decoding of whole genomes at significantly lower cost and higher throughput than classical Sanger technology. Each of these technologies have been estimated to yield assemblies with more problematic features than the standard...

Descripción completa

Detalles Bibliográficos
Autores principales: Aury, Jean-Marc, Cruaud, Corinne, Barbe, Valérie, Rogier, Odile, Mangenot, Sophie, Samson, Gaelle, Poulain, Julie, Anthouard, Véronique, Scarpelli, Claude, Artiguenave, François, Wincker, Patrick
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2625371/
https://www.ncbi.nlm.nih.gov/pubmed/19087275
http://dx.doi.org/10.1186/1471-2164-9-603
_version_ 1782163437110951936
author Aury, Jean-Marc
Cruaud, Corinne
Barbe, Valérie
Rogier, Odile
Mangenot, Sophie
Samson, Gaelle
Poulain, Julie
Anthouard, Véronique
Scarpelli, Claude
Artiguenave, François
Wincker, Patrick
author_facet Aury, Jean-Marc
Cruaud, Corinne
Barbe, Valérie
Rogier, Odile
Mangenot, Sophie
Samson, Gaelle
Poulain, Julie
Anthouard, Véronique
Scarpelli, Claude
Artiguenave, François
Wincker, Patrick
author_sort Aury, Jean-Marc
collection PubMed
description BACKGROUND: Massively parallel DNA sequencing instruments are enabling the decoding of whole genomes at significantly lower cost and higher throughput than classical Sanger technology. Each of these technologies have been estimated to yield assemblies with more problematic features than the standard method. These problems are of a different nature depending on the techniques used. So, an appropriate mix of technologies may help resolve most difficulties, and eventually provide assemblies of high quality without requiring any Sanger-based input. RESULTS: We compared assemblies obtained using Sanger data with those from different inputs from New Sequencing Technologies. The assemblies were systematically compared with a reference finished sequence. We found that the 454 GSFLX can efficiently produce high continuity when used at high coverage. The potential to enhance continuity by scaffolding was tested using 454 sequences from circularized genomic fragments. Finally, we explore the use of Solexa-Illumina short reads to polish the genome draft by implementing a technique to correct 454 consensus errors. CONCLUSION: High quality drafts can be produced for small genomes without any Sanger data input. We found that 454 GSFLX and Solexa/Illumina show great complementarity in producing large contigs and supercontigs with a low error rate.
format Text
id pubmed-2625371
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26253712009-01-14 High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies Aury, Jean-Marc Cruaud, Corinne Barbe, Valérie Rogier, Odile Mangenot, Sophie Samson, Gaelle Poulain, Julie Anthouard, Véronique Scarpelli, Claude Artiguenave, François Wincker, Patrick BMC Genomics Research Article BACKGROUND: Massively parallel DNA sequencing instruments are enabling the decoding of whole genomes at significantly lower cost and higher throughput than classical Sanger technology. Each of these technologies have been estimated to yield assemblies with more problematic features than the standard method. These problems are of a different nature depending on the techniques used. So, an appropriate mix of technologies may help resolve most difficulties, and eventually provide assemblies of high quality without requiring any Sanger-based input. RESULTS: We compared assemblies obtained using Sanger data with those from different inputs from New Sequencing Technologies. The assemblies were systematically compared with a reference finished sequence. We found that the 454 GSFLX can efficiently produce high continuity when used at high coverage. The potential to enhance continuity by scaffolding was tested using 454 sequences from circularized genomic fragments. Finally, we explore the use of Solexa-Illumina short reads to polish the genome draft by implementing a technique to correct 454 consensus errors. CONCLUSION: High quality drafts can be produced for small genomes without any Sanger data input. We found that 454 GSFLX and Solexa/Illumina show great complementarity in producing large contigs and supercontigs with a low error rate. BioMed Central 2008-12-16 /pmc/articles/PMC2625371/ /pubmed/19087275 http://dx.doi.org/10.1186/1471-2164-9-603 Text en Copyright © 2008 Aury et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Aury, Jean-Marc
Cruaud, Corinne
Barbe, Valérie
Rogier, Odile
Mangenot, Sophie
Samson, Gaelle
Poulain, Julie
Anthouard, Véronique
Scarpelli, Claude
Artiguenave, François
Wincker, Patrick
High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
title High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
title_full High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
title_fullStr High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
title_full_unstemmed High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
title_short High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
title_sort high quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2625371/
https://www.ncbi.nlm.nih.gov/pubmed/19087275
http://dx.doi.org/10.1186/1471-2164-9-603
work_keys_str_mv AT auryjeanmarc highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT cruaudcorinne highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT barbevalerie highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT rogierodile highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT mangenotsophie highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT samsongaelle highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT poulainjulie highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT anthouardveronique highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT scarpelliclaude highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT artiguenavefrancois highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies
AT winckerpatrick highqualitydraftsequencesforprokaryoticgenomesusingamixofnewsequencingtechnologies