Cargando…

A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats

The virulence and pathogenicity of bacterial pathogens are related to their adaptability to changing environments. One process enabling adaptation is based on minor changes in genome sequence, as small as a few base pairs, within segments of genome called simple sequence repeats (SSRs) that consist...

Descripción completa

Detalles Bibliográficos
Autores principales: Harhay, Gregory P., Harhay, Dayna M., Bono, James L., Capik, Sarah F., DeDonder, Keith D., Apley, Michael D., Lubbers, Brian V., White, Bradley J., Larson, Robert L., Smith, Timothy P. L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889271/
https://www.ncbi.nlm.nih.gov/pubmed/31792233
http://dx.doi.org/10.1038/s41598-019-53866-z
_version_ 1783475380114423808
author Harhay, Gregory P.
Harhay, Dayna M.
Bono, James L.
Capik, Sarah F.
DeDonder, Keith D.
Apley, Michael D.
Lubbers, Brian V.
White, Bradley J.
Larson, Robert L.
Smith, Timothy P. L.
author_facet Harhay, Gregory P.
Harhay, Dayna M.
Bono, James L.
Capik, Sarah F.
DeDonder, Keith D.
Apley, Michael D.
Lubbers, Brian V.
White, Bradley J.
Larson, Robert L.
Smith, Timothy P. L.
author_sort Harhay, Gregory P.
collection PubMed
description The virulence and pathogenicity of bacterial pathogens are related to their adaptability to changing environments. One process enabling adaptation is based on minor changes in genome sequence, as small as a few base pairs, within segments of genome called simple sequence repeats (SSRs) that consist of multiple copies of a short sequence (from one to several nucleotides), repeated in series. SSRs are found in eukaryotes as well as prokaryotes, and length variation in them occurs at frequencies up to a million-fold higher than bacterial point mutations through the process of slipped strand mispairing (SSM) by DNA polymerase during replication. The characterization of SSR length by standard sequencing methods is complicated by the appearance of length variation introduced during the sequencing process that obscures the lower abundance repeat number variants in a population. Here we report a computational approach to correct for sequencing process-induced artifacts, validated for tetranucleotide repeats by use of synthetic constructs of fixed, known length. We apply this method to a laboratory culture of Histophilus somni, prepared from a single colony, and demonstrate that the culture consists of populations of distinct sequence phase and length variants at individual tetranucleotide SSR loci.
format Online
Article
Text
id pubmed-6889271
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68892712019-12-10 A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats Harhay, Gregory P. Harhay, Dayna M. Bono, James L. Capik, Sarah F. DeDonder, Keith D. Apley, Michael D. Lubbers, Brian V. White, Bradley J. Larson, Robert L. Smith, Timothy P. L. Sci Rep Article The virulence and pathogenicity of bacterial pathogens are related to their adaptability to changing environments. One process enabling adaptation is based on minor changes in genome sequence, as small as a few base pairs, within segments of genome called simple sequence repeats (SSRs) that consist of multiple copies of a short sequence (from one to several nucleotides), repeated in series. SSRs are found in eukaryotes as well as prokaryotes, and length variation in them occurs at frequencies up to a million-fold higher than bacterial point mutations through the process of slipped strand mispairing (SSM) by DNA polymerase during replication. The characterization of SSR length by standard sequencing methods is complicated by the appearance of length variation introduced during the sequencing process that obscures the lower abundance repeat number variants in a population. Here we report a computational approach to correct for sequencing process-induced artifacts, validated for tetranucleotide repeats by use of synthetic constructs of fixed, known length. We apply this method to a laboratory culture of Histophilus somni, prepared from a single colony, and demonstrate that the culture consists of populations of distinct sequence phase and length variants at individual tetranucleotide SSR loci. Nature Publishing Group UK 2019-12-02 /pmc/articles/PMC6889271/ /pubmed/31792233 http://dx.doi.org/10.1038/s41598-019-53866-z Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Harhay, Gregory P.
Harhay, Dayna M.
Bono, James L.
Capik, Sarah F.
DeDonder, Keith D.
Apley, Michael D.
Lubbers, Brian V.
White, Bradley J.
Larson, Robert L.
Smith, Timothy P. L.
A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats
title A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats
title_full A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats
title_fullStr A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats
title_full_unstemmed A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats
title_short A Computational Method to Quantify the Effects of Slipped Strand Mispairing on Bacterial Tetranucleotide Repeats
title_sort computational method to quantify the effects of slipped strand mispairing on bacterial tetranucleotide repeats
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889271/
https://www.ncbi.nlm.nih.gov/pubmed/31792233
http://dx.doi.org/10.1038/s41598-019-53866-z
work_keys_str_mv AT harhaygregoryp acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT harhaydaynam acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT bonojamesl acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT capiksarahf acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT dedonderkeithd acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT apleymichaeld acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT lubbersbrianv acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT whitebradleyj acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT larsonrobertl acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT smithtimothypl acomputationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT harhaygregoryp computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT harhaydaynam computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT bonojamesl computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT capiksarahf computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT dedonderkeithd computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT apleymichaeld computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT lubbersbrianv computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT whitebradleyj computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT larsonrobertl computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats
AT smithtimothypl computationalmethodtoquantifytheeffectsofslippedstrandmispairingonbacterialtetranucleotiderepeats