Cargando…

Detecting Key Structural Features within Highly Recombined Genes

Many microorganisms exhibit high levels of intragenic recombination following horizontal gene transfer events. Furthermore, many microbial genes are subject to strong diversifying selection as part of the pathogenic process. A multiple sequence alignment is an essential starting point for many of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Wertz, John E, McGregor, Karen F, Bessen, Debra E
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1782043/
https://www.ncbi.nlm.nih.gov/pubmed/17257051
http://dx.doi.org/10.1371/journal.pcbi.0030014
_version_ 1782132005946785792
author Wertz, John E
McGregor, Karen F
Bessen, Debra E
author_facet Wertz, John E
McGregor, Karen F
Bessen, Debra E
author_sort Wertz, John E
collection PubMed
description Many microorganisms exhibit high levels of intragenic recombination following horizontal gene transfer events. Furthermore, many microbial genes are subject to strong diversifying selection as part of the pathogenic process. A multiple sequence alignment is an essential starting point for many of the tools that provide fundamental insights on gene structure and evolution, such as phylogenetics; however, an accurate alignment is not always possible to attain. In this study, a new analytic approach was developed in order to better quantify the genetic organization of highly diversified genes whose alleles do not align. This BLAST-based method, denoted BLAST Miner, employs an iterative process that places short segments of highly similar sequence into discrete datasets that are designated “modules.” The relative positions of modules along the length of the genes, and their frequency of occurrence, are used to identify sequence duplications, insertions, and rearrangements. Partial alleles of sof from Streptococcus pyogenes, encoding a surface protein under host immune selection, were analyzed for module content. High-frequency Modules 6 and 13 were identified and examined in depth. Nucleotide sequences corresponding to both modules contain numerous duplications and inverted repeats, whereby many codons form palindromic pairs. Combined with evidence for a strong codon usage bias, data suggest that Module 6 and 13 sequences are under selection to preserve their nucleic acid secondary structure. The concentration of overlapping tandem and inverted repeats within a small region of DNA is highly suggestive of a mechanistic role for Module 6 and 13 sequences in promoting aberrant recombination. Analysis of pbp2X alleles from Streptococcus pneumoniae, encoding cell wall enzymes that confer antibiotic resistance, supports the broad applicability of this tool in deciphering the genetic organization of highly recombined genes. BLAST Miner shares with phylogenetics the important predictive quality that leads to the generation of testable hypotheses based on sequence data.
format Text
id pubmed-1782043
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-17820432007-01-27 Detecting Key Structural Features within Highly Recombined Genes Wertz, John E McGregor, Karen F Bessen, Debra E PLoS Comput Biol Research Article Many microorganisms exhibit high levels of intragenic recombination following horizontal gene transfer events. Furthermore, many microbial genes are subject to strong diversifying selection as part of the pathogenic process. A multiple sequence alignment is an essential starting point for many of the tools that provide fundamental insights on gene structure and evolution, such as phylogenetics; however, an accurate alignment is not always possible to attain. In this study, a new analytic approach was developed in order to better quantify the genetic organization of highly diversified genes whose alleles do not align. This BLAST-based method, denoted BLAST Miner, employs an iterative process that places short segments of highly similar sequence into discrete datasets that are designated “modules.” The relative positions of modules along the length of the genes, and their frequency of occurrence, are used to identify sequence duplications, insertions, and rearrangements. Partial alleles of sof from Streptococcus pyogenes, encoding a surface protein under host immune selection, were analyzed for module content. High-frequency Modules 6 and 13 were identified and examined in depth. Nucleotide sequences corresponding to both modules contain numerous duplications and inverted repeats, whereby many codons form palindromic pairs. Combined with evidence for a strong codon usage bias, data suggest that Module 6 and 13 sequences are under selection to preserve their nucleic acid secondary structure. The concentration of overlapping tandem and inverted repeats within a small region of DNA is highly suggestive of a mechanistic role for Module 6 and 13 sequences in promoting aberrant recombination. Analysis of pbp2X alleles from Streptococcus pneumoniae, encoding cell wall enzymes that confer antibiotic resistance, supports the broad applicability of this tool in deciphering the genetic organization of highly recombined genes. BLAST Miner shares with phylogenetics the important predictive quality that leads to the generation of testable hypotheses based on sequence data. Public Library of Science 2007-01 2007-01-26 /pmc/articles/PMC1782043/ /pubmed/17257051 http://dx.doi.org/10.1371/journal.pcbi.0030014 Text en © 2007 Wertz et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wertz, John E
McGregor, Karen F
Bessen, Debra E
Detecting Key Structural Features within Highly Recombined Genes
title Detecting Key Structural Features within Highly Recombined Genes
title_full Detecting Key Structural Features within Highly Recombined Genes
title_fullStr Detecting Key Structural Features within Highly Recombined Genes
title_full_unstemmed Detecting Key Structural Features within Highly Recombined Genes
title_short Detecting Key Structural Features within Highly Recombined Genes
title_sort detecting key structural features within highly recombined genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1782043/
https://www.ncbi.nlm.nih.gov/pubmed/17257051
http://dx.doi.org/10.1371/journal.pcbi.0030014
work_keys_str_mv AT wertzjohne detectingkeystructuralfeatureswithinhighlyrecombinedgenes
AT mcgregorkarenf detectingkeystructuralfeatureswithinhighlyrecombinedgenes
AT bessendebrae detectingkeystructuralfeatureswithinhighlyrecombinedgenes