Cargando…

A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Utturkar, Sagar M., Klingeman, Dawn M., Hurt, Richard A., Brown, Steven D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5513972/
https://www.ncbi.nlm.nih.gov/pubmed/28769883
http://dx.doi.org/10.3389/fmicb.2017.01272
_version_ 1783250750511513600
author Utturkar, Sagar M.
Klingeman, Dawn M.
Hurt, Richard A.
Brown, Steven D.
author_facet Utturkar, Sagar M.
Klingeman, Dawn M.
Hurt, Richard A.
Brown, Steven D.
author_sort Utturkar, Sagar M.
collection PubMed
description This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
format Online
Article
Text
id pubmed-5513972
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-55139722017-08-02 A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies Utturkar, Sagar M. Klingeman, Dawn M. Hurt, Richard A. Brown, Steven D. Front Microbiol Microbiology This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. Frontiers Media S.A. 2017-07-18 /pmc/articles/PMC5513972/ /pubmed/28769883 http://dx.doi.org/10.3389/fmicb.2017.01272 Text en Copyright © 2017 Utturkar, Klingeman, Hurt and Brown. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Utturkar, Sagar M.
Klingeman, Dawn M.
Hurt, Richard A.
Brown, Steven D.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
title A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
title_full A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
title_fullStr A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
title_full_unstemmed A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
title_short A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
title_sort case study into microbial genome assembly gap sequences and finishing strategies
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5513972/
https://www.ncbi.nlm.nih.gov/pubmed/28769883
http://dx.doi.org/10.3389/fmicb.2017.01272
work_keys_str_mv AT utturkarsagarm acasestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT klingemandawnm acasestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT hurtricharda acasestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT brownstevend acasestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT utturkarsagarm casestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT klingemandawnm casestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT hurtricharda casestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies
AT brownstevend casestudyintomicrobialgenomeassemblygapsequencesandfinishingstrategies