Cargando…

A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters

Proteome-scale bioinformatics research is increasingly conducted as the number of completely sequenced genomes increases, but analysis of protein domains (PDs) usually relies on similarity in their amino acid sequences and/or three-dimensional structures. Here, we present results from a bi-clusterin...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Chia-Tsen, Chiang, Austin W. T., Hwang, Ming-Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5660162/
https://www.ncbi.nlm.nih.gov/pubmed/29079755
http://dx.doi.org/10.1038/s41598-017-13297-0
_version_ 1783274243985768448
author Sun, Chia-Tsen
Chiang, Austin W. T.
Hwang, Ming-Jing
author_facet Sun, Chia-Tsen
Chiang, Austin W. T.
Hwang, Ming-Jing
author_sort Sun, Chia-Tsen
collection PubMed
description Proteome-scale bioinformatics research is increasingly conducted as the number of completely sequenced genomes increases, but analysis of protein domains (PDs) usually relies on similarity in their amino acid sequences and/or three-dimensional structures. Here, we present results from a bi-clustering analysis on presence/absence data for 6,580 unique PDs in 2,134 species with a sequenced genome, thus covering a complete set of proteins, for the three superkingdoms of life, Bacteria, Archaea, and Eukarya. Our analysis revealed eight distinctive PD clusters, which, following an analysis of enrichment of Gene Ontology functions and CATH classification of protein structures, were shown to exhibit structural and functional properties that are taxa-characteristic. For examples, the largest cluster is ubiquitous in all three superkingdoms, constituting a set of 1,472 persistent domains created early in evolution and retained in living organisms and characterized by basic cellular functions and ancient structural architectures, while an Archaea and Eukarya bi-superkingdom cluster suggests its PDs may have existed in the ancestor of the two superkingdoms, and others are single superkingdom- or taxa (e.g. Fungi)-specific. These results contribute to increase our appreciation of PD diversity and our knowledge of how PDs are used in species, yielding implications on species evolution.
format Online
Article
Text
id pubmed-5660162
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-56601622017-11-01 A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters Sun, Chia-Tsen Chiang, Austin W. T. Hwang, Ming-Jing Sci Rep Article Proteome-scale bioinformatics research is increasingly conducted as the number of completely sequenced genomes increases, but analysis of protein domains (PDs) usually relies on similarity in their amino acid sequences and/or three-dimensional structures. Here, we present results from a bi-clustering analysis on presence/absence data for 6,580 unique PDs in 2,134 species with a sequenced genome, thus covering a complete set of proteins, for the three superkingdoms of life, Bacteria, Archaea, and Eukarya. Our analysis revealed eight distinctive PD clusters, which, following an analysis of enrichment of Gene Ontology functions and CATH classification of protein structures, were shown to exhibit structural and functional properties that are taxa-characteristic. For examples, the largest cluster is ubiquitous in all three superkingdoms, constituting a set of 1,472 persistent domains created early in evolution and retained in living organisms and characterized by basic cellular functions and ancient structural architectures, while an Archaea and Eukarya bi-superkingdom cluster suggests its PDs may have existed in the ancestor of the two superkingdoms, and others are single superkingdom- or taxa (e.g. Fungi)-specific. These results contribute to increase our appreciation of PD diversity and our knowledge of how PDs are used in species, yielding implications on species evolution. Nature Publishing Group UK 2017-10-27 /pmc/articles/PMC5660162/ /pubmed/29079755 http://dx.doi.org/10.1038/s41598-017-13297-0 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sun, Chia-Tsen
Chiang, Austin W. T.
Hwang, Ming-Jing
A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
title A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
title_full A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
title_fullStr A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
title_full_unstemmed A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
title_short A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
title_sort proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5660162/
https://www.ncbi.nlm.nih.gov/pubmed/29079755
http://dx.doi.org/10.1038/s41598-017-13297-0
work_keys_str_mv AT sunchiatsen aproteomeviewofstructuralfunctionalandtaxonomiccharacteristicsofmajorproteindomainclusters
AT chiangaustinwt aproteomeviewofstructuralfunctionalandtaxonomiccharacteristicsofmajorproteindomainclusters
AT hwangmingjing aproteomeviewofstructuralfunctionalandtaxonomiccharacteristicsofmajorproteindomainclusters
AT sunchiatsen proteomeviewofstructuralfunctionalandtaxonomiccharacteristicsofmajorproteindomainclusters
AT chiangaustinwt proteomeviewofstructuralfunctionalandtaxonomiccharacteristicsofmajorproteindomainclusters
AT hwangmingjing proteomeviewofstructuralfunctionalandtaxonomiccharacteristicsofmajorproteindomainclusters