Cargando…

Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli

A majority of large-scale bacterial genome rearrangements involve mobile genetic elements such as insertion sequence (IS) elements. Here we report novel insertions and excisions of IS elements and recombination between homologous IS elements identified in a large collection of Escherichia coli mutat...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Heewook, Doak, Thomas G., Popodi, Ellen, Foster, Patricia L., Tang, Haixu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009759/
https://www.ncbi.nlm.nih.gov/pubmed/27431326
http://dx.doi.org/10.1093/nar/gkw647
_version_ 1782451577611616256
author Lee, Heewook
Doak, Thomas G.
Popodi, Ellen
Foster, Patricia L.
Tang, Haixu
author_facet Lee, Heewook
Doak, Thomas G.
Popodi, Ellen
Foster, Patricia L.
Tang, Haixu
author_sort Lee, Heewook
collection PubMed
description A majority of large-scale bacterial genome rearrangements involve mobile genetic elements such as insertion sequence (IS) elements. Here we report novel insertions and excisions of IS elements and recombination between homologous IS elements identified in a large collection of Escherichia coli mutation accumulation lines by analysis of whole genome shotgun sequencing data. Based on 857 identified events (758 IS insertions, 98 recombinations and 1 excision), we estimate that the rate of IS insertion is 3.5 × 10(−4) insertions per genome per generation and the rate of IS homologous recombination is 4.5 × 10(−5) recombinations per genome per generation. These events are mostly contributed by the IS elements IS1, IS2, IS5 and IS186. Spatial analysis of new insertions suggest that transposition is biased to proximal insertions, and the length spectrum of IS-caused deletions is largely explained by local hopping. For any of the ISs studied there is no region of the circular genome that is favored or disfavored for new insertions but there are notable hotspots for deletions. Some elements have preferences for non-coding sequence or for the beginning and end of coding regions, largely explained by target site motifs. Interestingly, transposition and deletion rates remain constant across the wild-type and 12 mutant E. coli lines, each deficient in a distinct DNA repair pathway. Finally, we characterized the target sites of four IS families, confirming previous results and characterizing a highly specific pattern at IS186 target-sites, 5′-GGGG(N6/N7)CCCC-3′. We also detected 48 long deletions not involving IS elements.
format Online
Article
Text
id pubmed-5009759
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50097592016-09-07 Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli Lee, Heewook Doak, Thomas G. Popodi, Ellen Foster, Patricia L. Tang, Haixu Nucleic Acids Res Computational Biology A majority of large-scale bacterial genome rearrangements involve mobile genetic elements such as insertion sequence (IS) elements. Here we report novel insertions and excisions of IS elements and recombination between homologous IS elements identified in a large collection of Escherichia coli mutation accumulation lines by analysis of whole genome shotgun sequencing data. Based on 857 identified events (758 IS insertions, 98 recombinations and 1 excision), we estimate that the rate of IS insertion is 3.5 × 10(−4) insertions per genome per generation and the rate of IS homologous recombination is 4.5 × 10(−5) recombinations per genome per generation. These events are mostly contributed by the IS elements IS1, IS2, IS5 and IS186. Spatial analysis of new insertions suggest that transposition is biased to proximal insertions, and the length spectrum of IS-caused deletions is largely explained by local hopping. For any of the ISs studied there is no region of the circular genome that is favored or disfavored for new insertions but there are notable hotspots for deletions. Some elements have preferences for non-coding sequence or for the beginning and end of coding regions, largely explained by target site motifs. Interestingly, transposition and deletion rates remain constant across the wild-type and 12 mutant E. coli lines, each deficient in a distinct DNA repair pathway. Finally, we characterized the target sites of four IS families, confirming previous results and characterizing a highly specific pattern at IS186 target-sites, 5′-GGGG(N6/N7)CCCC-3′. We also detected 48 long deletions not involving IS elements. Oxford University Press 2016-09-06 2016-07-18 /pmc/articles/PMC5009759/ /pubmed/27431326 http://dx.doi.org/10.1093/nar/gkw647 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Lee, Heewook
Doak, Thomas G.
Popodi, Ellen
Foster, Patricia L.
Tang, Haixu
Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
title Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
title_full Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
title_fullStr Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
title_full_unstemmed Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
title_short Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
title_sort insertion sequence-caused large-scale rearrangements in the genome of escherichia coli
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009759/
https://www.ncbi.nlm.nih.gov/pubmed/27431326
http://dx.doi.org/10.1093/nar/gkw647
work_keys_str_mv AT leeheewook insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli
AT doakthomasg insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli
AT popodiellen insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli
AT fosterpatricial insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli
AT tanghaixu insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli