Cargando…

Comprehensive classification of the PIN domain-like superfamily

PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA de...

Descripción completa

Detalles Bibliográficos
Autores principales: Matelska, Dorota, Steczkiewicz, Kamil, Ginalski, Krzysztof
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499597/
https://www.ncbi.nlm.nih.gov/pubmed/28575517
http://dx.doi.org/10.1093/nar/gkx494
_version_ 1783248500768636928
author Matelska, Dorota
Steczkiewicz, Kamil
Ginalski, Krzysztof
author_facet Matelska, Dorota
Steczkiewicz, Kamil
Ginalski, Krzysztof
author_sort Matelska, Dorota
collection PubMed
description PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA degradation, transcription regulation and ncRNA maturation. In this work, we identify and classify the most complete set of PIN-like domains to provide the first comprehensive analysis of sequence–structure–function relationships within the whole PIN domain-like superfamily. Transitive sequence searches using highly sensitive methods for remote homology detection led to the identification of several new families, including representatives of Pfam (DUF1308, DUF4935) and CDD (COG2454), and 23 other families not classified in the public domain databases. Further sequence clustering revealed relationships between individual sequence clusters and showed heterogeneity within some families, suggesting a possible functional divergence. With five structural groups, 70 defined clusters, over 100,000 proteins, and broad biological functions, the PIN domain-like superfamily constitutes one of the largest and most diverse nuclease superfamilies. Detailed analyses of sequences and structures, domain architectures, and genomic contexts allowed us to predict biological function of several new families, including new toxin-antitoxin components, proteins involved in tRNA/rRNA maturation and transcription/translation regulation.
format Online
Article
Text
id pubmed-5499597
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54995972017-07-10 Comprehensive classification of the PIN domain-like superfamily Matelska, Dorota Steczkiewicz, Kamil Ginalski, Krzysztof Nucleic Acids Res Survey and Summary PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA degradation, transcription regulation and ncRNA maturation. In this work, we identify and classify the most complete set of PIN-like domains to provide the first comprehensive analysis of sequence–structure–function relationships within the whole PIN domain-like superfamily. Transitive sequence searches using highly sensitive methods for remote homology detection led to the identification of several new families, including representatives of Pfam (DUF1308, DUF4935) and CDD (COG2454), and 23 other families not classified in the public domain databases. Further sequence clustering revealed relationships between individual sequence clusters and showed heterogeneity within some families, suggesting a possible functional divergence. With five structural groups, 70 defined clusters, over 100,000 proteins, and broad biological functions, the PIN domain-like superfamily constitutes one of the largest and most diverse nuclease superfamilies. Detailed analyses of sequences and structures, domain architectures, and genomic contexts allowed us to predict biological function of several new families, including new toxin-antitoxin components, proteins involved in tRNA/rRNA maturation and transcription/translation regulation. Oxford University Press 2017-07-07 2017-05-31 /pmc/articles/PMC5499597/ /pubmed/28575517 http://dx.doi.org/10.1093/nar/gkx494 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Survey and Summary
Matelska, Dorota
Steczkiewicz, Kamil
Ginalski, Krzysztof
Comprehensive classification of the PIN domain-like superfamily
title Comprehensive classification of the PIN domain-like superfamily
title_full Comprehensive classification of the PIN domain-like superfamily
title_fullStr Comprehensive classification of the PIN domain-like superfamily
title_full_unstemmed Comprehensive classification of the PIN domain-like superfamily
title_short Comprehensive classification of the PIN domain-like superfamily
title_sort comprehensive classification of the pin domain-like superfamily
topic Survey and Summary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499597/
https://www.ncbi.nlm.nih.gov/pubmed/28575517
http://dx.doi.org/10.1093/nar/gkx494
work_keys_str_mv AT matelskadorota comprehensiveclassificationofthepindomainlikesuperfamily
AT steczkiewiczkamil comprehensiveclassificationofthepindomainlikesuperfamily
AT ginalskikrzysztof comprehensiveclassificationofthepindomainlikesuperfamily