Cargando…

Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences

Motivation: To assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Results: Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four diffe...

Descripción completa

Detalles Bibliográficos
Autores principales: Utturkar, Sagar M., Klingeman, Dawn M., Land, Miriam L., Schadt, Christopher W., Doktycz, Mitchel J., Pelletier, Dale A., Brown, Steven D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173024/
https://www.ncbi.nlm.nih.gov/pubmed/24930142
http://dx.doi.org/10.1093/bioinformatics/btu391
_version_ 1782336123468513280
author Utturkar, Sagar M.
Klingeman, Dawn M.
Land, Miriam L.
Schadt, Christopher W.
Doktycz, Mitchel J.
Pelletier, Dale A.
Brown, Steven D.
author_facet Utturkar, Sagar M.
Klingeman, Dawn M.
Land, Miriam L.
Schadt, Christopher W.
Doktycz, Mitchel J.
Pelletier, Dale A.
Brown, Steven D.
author_sort Utturkar, Sagar M.
collection PubMed
description Motivation: To assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Results: Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as an additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. Availability and implementation: All assembly tools except CLC Genomics Workbench are freely available under GNU General Public License. Contact: brownsd@ornl.gov Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4173024
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41730242014-09-25 Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences Utturkar, Sagar M. Klingeman, Dawn M. Land, Miriam L. Schadt, Christopher W. Doktycz, Mitchel J. Pelletier, Dale A. Brown, Steven D. Bioinformatics Original Papers Motivation: To assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Results: Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as an additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. Availability and implementation: All assembly tools except CLC Genomics Workbench are freely available under GNU General Public License. Contact: brownsd@ornl.gov Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-10 2014-06-14 /pmc/articles/PMC4173024/ /pubmed/24930142 http://dx.doi.org/10.1093/bioinformatics/btu391 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Utturkar, Sagar M.
Klingeman, Dawn M.
Land, Miriam L.
Schadt, Christopher W.
Doktycz, Mitchel J.
Pelletier, Dale A.
Brown, Steven D.
Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
title Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
title_full Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
title_fullStr Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
title_full_unstemmed Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
title_short Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
title_sort evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173024/
https://www.ncbi.nlm.nih.gov/pubmed/24930142
http://dx.doi.org/10.1093/bioinformatics/btu391
work_keys_str_mv AT utturkarsagarm evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences
AT klingemandawnm evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences
AT landmiriaml evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences
AT schadtchristopherw evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences
AT doktyczmitchelj evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences
AT pelletierdalea evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences
AT brownstevend evaluationandvalidationofdenovoandhybridassemblytechniquestoderivehighqualitygenomesequences