Cargando…
Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases
BACKGROUND: Some mobile genetic elements target the lagging strand template during DNA replication. Bacterial examples are insertion sequences IS608 and ISDra2 (IS200/IS605 family members). They use obligatory single-stranded circular DNA intermediates for excision and insertion and encode a transpo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6006769/ https://www.ncbi.nlm.nih.gov/pubmed/29914351 http://dx.doi.org/10.1186/s12864-018-4836-1 |
_version_ | 1783332905284534272 |
---|---|
author | Quentin, Yves Siguier, Patricia Chandler, Mick Fichant, Gwennaele |
author_facet | Quentin, Yves Siguier, Patricia Chandler, Mick Fichant, Gwennaele |
author_sort | Quentin, Yves |
collection | PubMed |
description | BACKGROUND: Some mobile genetic elements target the lagging strand template during DNA replication. Bacterial examples are insertion sequences IS608 and ISDra2 (IS200/IS605 family members). They use obligatory single-stranded circular DNA intermediates for excision and insertion and encode a transposase, TnpA(IS200), which recognizes subterminal secondary structures at the insertion sequence ends. Similar secondary structures, Repeated Extragenic Palindromes (REP), are present in many bacterial genomes. TnpA(IS200)-related proteins, TnpA(REP), have been identified and could be responsible for REP sequence proliferation. These proteins share a conserved HuH/Tyrosine core domain responsible for catalysis and are involved in processes of ssDNA cleavage and ligation. Our goal is to characterize the diversity of these proteins collectively referred as the TnpA(Y1) family. RESULTS: A genome-wide analysis of sequences similar to TnpA(IS200) and TnpA(REP) in prokaryotes revealed a large number of family members with a wide taxonomic distribution. These can be arranged into three distinct classes and 12 subclasses based on sequence similarity. One subclass includes sequences similar to TnpA(IS200). Proteins from other subclasses are not associated with typical insertion sequence features. These are characterized by specific additional domains possibly involved in protein/DNA or protein/protein interactions. Their genes are found in more than 25% of species analyzed. They exhibit a patchy taxonomic distribution consistent with dissemination by horizontal gene transfers followed by loss. The tnpA(REP) genes of five subclasses are flanked by typical REP sequences in a REPtron-like arrangement. Four distinct REP types were characterized with a subclass specific distribution. Other subclasses are not associated with REP sequences but have a large conserved domain located in C-terminal end of their sequence. This unexpected diversity suggests that, while most likely involved in processing single-strand DNA, proteins from different subfamilies may play a number of different roles. CONCLUSIONS: We established a detailed classification of TnpA(Y1) proteins, consolidated by the analysis of the conserved core domains and the characterization of additional domains. The data obtained illustrate the unexpected diversity of the TnpA(Y1) family and provide a strong framework for future evolutionary and functional studies. By their potential function in ssDNA editing, they may confer adaptive responses to host cell physiology and metabolism. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4836-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6006769 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-60067692018-06-26 Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases Quentin, Yves Siguier, Patricia Chandler, Mick Fichant, Gwennaele BMC Genomics Research Article BACKGROUND: Some mobile genetic elements target the lagging strand template during DNA replication. Bacterial examples are insertion sequences IS608 and ISDra2 (IS200/IS605 family members). They use obligatory single-stranded circular DNA intermediates for excision and insertion and encode a transposase, TnpA(IS200), which recognizes subterminal secondary structures at the insertion sequence ends. Similar secondary structures, Repeated Extragenic Palindromes (REP), are present in many bacterial genomes. TnpA(IS200)-related proteins, TnpA(REP), have been identified and could be responsible for REP sequence proliferation. These proteins share a conserved HuH/Tyrosine core domain responsible for catalysis and are involved in processes of ssDNA cleavage and ligation. Our goal is to characterize the diversity of these proteins collectively referred as the TnpA(Y1) family. RESULTS: A genome-wide analysis of sequences similar to TnpA(IS200) and TnpA(REP) in prokaryotes revealed a large number of family members with a wide taxonomic distribution. These can be arranged into three distinct classes and 12 subclasses based on sequence similarity. One subclass includes sequences similar to TnpA(IS200). Proteins from other subclasses are not associated with typical insertion sequence features. These are characterized by specific additional domains possibly involved in protein/DNA or protein/protein interactions. Their genes are found in more than 25% of species analyzed. They exhibit a patchy taxonomic distribution consistent with dissemination by horizontal gene transfers followed by loss. The tnpA(REP) genes of five subclasses are flanked by typical REP sequences in a REPtron-like arrangement. Four distinct REP types were characterized with a subclass specific distribution. Other subclasses are not associated with REP sequences but have a large conserved domain located in C-terminal end of their sequence. This unexpected diversity suggests that, while most likely involved in processing single-strand DNA, proteins from different subfamilies may play a number of different roles. CONCLUSIONS: We established a detailed classification of TnpA(Y1) proteins, consolidated by the analysis of the conserved core domains and the characterization of additional domains. The data obtained illustrate the unexpected diversity of the TnpA(Y1) family and provide a strong framework for future evolutionary and functional studies. By their potential function in ssDNA editing, they may confer adaptive responses to host cell physiology and metabolism. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4836-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-19 /pmc/articles/PMC6006769/ /pubmed/29914351 http://dx.doi.org/10.1186/s12864-018-4836-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Quentin, Yves Siguier, Patricia Chandler, Mick Fichant, Gwennaele Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases |
title | Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases |
title_full | Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases |
title_fullStr | Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases |
title_full_unstemmed | Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases |
title_short | Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases |
title_sort | single-strand dna processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic huh endonucleases |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6006769/ https://www.ncbi.nlm.nih.gov/pubmed/29914351 http://dx.doi.org/10.1186/s12864-018-4836-1 |
work_keys_str_mv | AT quentinyves singlestranddnaprocessingphylogenomicsandsequencediversityofasuperfamilyofpotentialprokaryotichuhendonucleases AT siguierpatricia singlestranddnaprocessingphylogenomicsandsequencediversityofasuperfamilyofpotentialprokaryotichuhendonucleases AT chandlermick singlestranddnaprocessingphylogenomicsandsequencediversityofasuperfamilyofpotentialprokaryotichuhendonucleases AT fichantgwennaele singlestranddnaprocessingphylogenomicsandsequencediversityofasuperfamilyofpotentialprokaryotichuhendonucleases |