Cargando…
Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli
A majority of large-scale bacterial genome rearrangements involve mobile genetic elements such as insertion sequence (IS) elements. Here we report novel insertions and excisions of IS elements and recombination between homologous IS elements identified in a large collection of Escherichia coli mutat...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009759/ https://www.ncbi.nlm.nih.gov/pubmed/27431326 http://dx.doi.org/10.1093/nar/gkw647 |
_version_ | 1782451577611616256 |
---|---|
author | Lee, Heewook Doak, Thomas G. Popodi, Ellen Foster, Patricia L. Tang, Haixu |
author_facet | Lee, Heewook Doak, Thomas G. Popodi, Ellen Foster, Patricia L. Tang, Haixu |
author_sort | Lee, Heewook |
collection | PubMed |
description | A majority of large-scale bacterial genome rearrangements involve mobile genetic elements such as insertion sequence (IS) elements. Here we report novel insertions and excisions of IS elements and recombination between homologous IS elements identified in a large collection of Escherichia coli mutation accumulation lines by analysis of whole genome shotgun sequencing data. Based on 857 identified events (758 IS insertions, 98 recombinations and 1 excision), we estimate that the rate of IS insertion is 3.5 × 10(−4) insertions per genome per generation and the rate of IS homologous recombination is 4.5 × 10(−5) recombinations per genome per generation. These events are mostly contributed by the IS elements IS1, IS2, IS5 and IS186. Spatial analysis of new insertions suggest that transposition is biased to proximal insertions, and the length spectrum of IS-caused deletions is largely explained by local hopping. For any of the ISs studied there is no region of the circular genome that is favored or disfavored for new insertions but there are notable hotspots for deletions. Some elements have preferences for non-coding sequence or for the beginning and end of coding regions, largely explained by target site motifs. Interestingly, transposition and deletion rates remain constant across the wild-type and 12 mutant E. coli lines, each deficient in a distinct DNA repair pathway. Finally, we characterized the target sites of four IS families, confirming previous results and characterizing a highly specific pattern at IS186 target-sites, 5′-GGGG(N6/N7)CCCC-3′. We also detected 48 long deletions not involving IS elements. |
format | Online Article Text |
id | pubmed-5009759 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-50097592016-09-07 Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli Lee, Heewook Doak, Thomas G. Popodi, Ellen Foster, Patricia L. Tang, Haixu Nucleic Acids Res Computational Biology A majority of large-scale bacterial genome rearrangements involve mobile genetic elements such as insertion sequence (IS) elements. Here we report novel insertions and excisions of IS elements and recombination between homologous IS elements identified in a large collection of Escherichia coli mutation accumulation lines by analysis of whole genome shotgun sequencing data. Based on 857 identified events (758 IS insertions, 98 recombinations and 1 excision), we estimate that the rate of IS insertion is 3.5 × 10(−4) insertions per genome per generation and the rate of IS homologous recombination is 4.5 × 10(−5) recombinations per genome per generation. These events are mostly contributed by the IS elements IS1, IS2, IS5 and IS186. Spatial analysis of new insertions suggest that transposition is biased to proximal insertions, and the length spectrum of IS-caused deletions is largely explained by local hopping. For any of the ISs studied there is no region of the circular genome that is favored or disfavored for new insertions but there are notable hotspots for deletions. Some elements have preferences for non-coding sequence or for the beginning and end of coding regions, largely explained by target site motifs. Interestingly, transposition and deletion rates remain constant across the wild-type and 12 mutant E. coli lines, each deficient in a distinct DNA repair pathway. Finally, we characterized the target sites of four IS families, confirming previous results and characterizing a highly specific pattern at IS186 target-sites, 5′-GGGG(N6/N7)CCCC-3′. We also detected 48 long deletions not involving IS elements. Oxford University Press 2016-09-06 2016-07-18 /pmc/articles/PMC5009759/ /pubmed/27431326 http://dx.doi.org/10.1093/nar/gkw647 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Lee, Heewook Doak, Thomas G. Popodi, Ellen Foster, Patricia L. Tang, Haixu Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli |
title | Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli |
title_full | Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli |
title_fullStr | Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli |
title_full_unstemmed | Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli |
title_short | Insertion sequence-caused large-scale rearrangements in the genome of Escherichia coli |
title_sort | insertion sequence-caused large-scale rearrangements in the genome of escherichia coli |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009759/ https://www.ncbi.nlm.nih.gov/pubmed/27431326 http://dx.doi.org/10.1093/nar/gkw647 |
work_keys_str_mv | AT leeheewook insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli AT doakthomasg insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli AT popodiellen insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli AT fosterpatricial insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli AT tanghaixu insertionsequencecausedlargescalerearrangementsinthegenomeofescherichiacoli |