Cargando…
Comprehensive classification of the PIN domain-like superfamily
PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA de...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499597/ https://www.ncbi.nlm.nih.gov/pubmed/28575517 http://dx.doi.org/10.1093/nar/gkx494 |
_version_ | 1783248500768636928 |
---|---|
author | Matelska, Dorota Steczkiewicz, Kamil Ginalski, Krzysztof |
author_facet | Matelska, Dorota Steczkiewicz, Kamil Ginalski, Krzysztof |
author_sort | Matelska, Dorota |
collection | PubMed |
description | PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA degradation, transcription regulation and ncRNA maturation. In this work, we identify and classify the most complete set of PIN-like domains to provide the first comprehensive analysis of sequence–structure–function relationships within the whole PIN domain-like superfamily. Transitive sequence searches using highly sensitive methods for remote homology detection led to the identification of several new families, including representatives of Pfam (DUF1308, DUF4935) and CDD (COG2454), and 23 other families not classified in the public domain databases. Further sequence clustering revealed relationships between individual sequence clusters and showed heterogeneity within some families, suggesting a possible functional divergence. With five structural groups, 70 defined clusters, over 100,000 proteins, and broad biological functions, the PIN domain-like superfamily constitutes one of the largest and most diverse nuclease superfamilies. Detailed analyses of sequences and structures, domain architectures, and genomic contexts allowed us to predict biological function of several new families, including new toxin-antitoxin components, proteins involved in tRNA/rRNA maturation and transcription/translation regulation. |
format | Online Article Text |
id | pubmed-5499597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54995972017-07-10 Comprehensive classification of the PIN domain-like superfamily Matelska, Dorota Steczkiewicz, Kamil Ginalski, Krzysztof Nucleic Acids Res Survey and Summary PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA degradation, transcription regulation and ncRNA maturation. In this work, we identify and classify the most complete set of PIN-like domains to provide the first comprehensive analysis of sequence–structure–function relationships within the whole PIN domain-like superfamily. Transitive sequence searches using highly sensitive methods for remote homology detection led to the identification of several new families, including representatives of Pfam (DUF1308, DUF4935) and CDD (COG2454), and 23 other families not classified in the public domain databases. Further sequence clustering revealed relationships between individual sequence clusters and showed heterogeneity within some families, suggesting a possible functional divergence. With five structural groups, 70 defined clusters, over 100,000 proteins, and broad biological functions, the PIN domain-like superfamily constitutes one of the largest and most diverse nuclease superfamilies. Detailed analyses of sequences and structures, domain architectures, and genomic contexts allowed us to predict biological function of several new families, including new toxin-antitoxin components, proteins involved in tRNA/rRNA maturation and transcription/translation regulation. Oxford University Press 2017-07-07 2017-05-31 /pmc/articles/PMC5499597/ /pubmed/28575517 http://dx.doi.org/10.1093/nar/gkx494 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Survey and Summary Matelska, Dorota Steczkiewicz, Kamil Ginalski, Krzysztof Comprehensive classification of the PIN domain-like superfamily |
title | Comprehensive classification of the PIN domain-like superfamily |
title_full | Comprehensive classification of the PIN domain-like superfamily |
title_fullStr | Comprehensive classification of the PIN domain-like superfamily |
title_full_unstemmed | Comprehensive classification of the PIN domain-like superfamily |
title_short | Comprehensive classification of the PIN domain-like superfamily |
title_sort | comprehensive classification of the pin domain-like superfamily |
topic | Survey and Summary |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499597/ https://www.ncbi.nlm.nih.gov/pubmed/28575517 http://dx.doi.org/10.1093/nar/gkx494 |
work_keys_str_mv | AT matelskadorota comprehensiveclassificationofthepindomainlikesuperfamily AT steczkiewiczkamil comprehensiveclassificationofthepindomainlikesuperfamily AT ginalskikrzysztof comprehensiveclassificationofthepindomainlikesuperfamily |