Cargando…
Exploring the dark foldable proteome by considering hydrophobic amino acids topology
The protein universe corresponds to the set of all proteins found in all organisms. A way to explore it is by taking into account the domain content of the proteins. However, some part of sequences and many entire sequences remain un-annotated despite a converging number of domain families. The un-a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5278394/ https://www.ncbi.nlm.nih.gov/pubmed/28134276 http://dx.doi.org/10.1038/srep41425 |
_version_ | 1782502641535811584 |
---|---|
author | Bitard-Feildel, Tristan Callebaut, Isabelle |
author_facet | Bitard-Feildel, Tristan Callebaut, Isabelle |
author_sort | Bitard-Feildel, Tristan |
collection | PubMed |
description | The protein universe corresponds to the set of all proteins found in all organisms. A way to explore it is by taking into account the domain content of the proteins. However, some part of sequences and many entire sequences remain un-annotated despite a converging number of domain families. The un-annotated part of the protein universe is referred to as the dark proteome and remains poorly characterized. In this study, we quantify the amount of foldable domains within the dark proteome by using the hydrophobic cluster analysis methodology. These un-annotated foldable domains were grouped using a combination of remote homology searches and domain annotations, leading to define different levels of darkness. The dark foldable domains were analyzed to understand what make them different from domains stored in databases and thus difficult to annotate. The un-annotated domains of the dark proteome universe display specific features relative to database domains: shorter length, non-canonical content and particular topology in hydrophobic residues, higher propensity for disorder, and a higher energy. These features make them hard to relate to known families. Based on these observations, we emphasize that domain annotation methodologies can still be improved to fully apprehend and decipher the molecular evolution of the protein universe. |
format | Online Article Text |
id | pubmed-5278394 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-52783942017-02-03 Exploring the dark foldable proteome by considering hydrophobic amino acids topology Bitard-Feildel, Tristan Callebaut, Isabelle Sci Rep Article The protein universe corresponds to the set of all proteins found in all organisms. A way to explore it is by taking into account the domain content of the proteins. However, some part of sequences and many entire sequences remain un-annotated despite a converging number of domain families. The un-annotated part of the protein universe is referred to as the dark proteome and remains poorly characterized. In this study, we quantify the amount of foldable domains within the dark proteome by using the hydrophobic cluster analysis methodology. These un-annotated foldable domains were grouped using a combination of remote homology searches and domain annotations, leading to define different levels of darkness. The dark foldable domains were analyzed to understand what make them different from domains stored in databases and thus difficult to annotate. The un-annotated domains of the dark proteome universe display specific features relative to database domains: shorter length, non-canonical content and particular topology in hydrophobic residues, higher propensity for disorder, and a higher energy. These features make them hard to relate to known families. Based on these observations, we emphasize that domain annotation methodologies can still be improved to fully apprehend and decipher the molecular evolution of the protein universe. Nature Publishing Group 2017-01-30 /pmc/articles/PMC5278394/ /pubmed/28134276 http://dx.doi.org/10.1038/srep41425 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Bitard-Feildel, Tristan Callebaut, Isabelle Exploring the dark foldable proteome by considering hydrophobic amino acids topology |
title | Exploring the dark foldable proteome by considering hydrophobic amino acids topology |
title_full | Exploring the dark foldable proteome by considering hydrophobic amino acids topology |
title_fullStr | Exploring the dark foldable proteome by considering hydrophobic amino acids topology |
title_full_unstemmed | Exploring the dark foldable proteome by considering hydrophobic amino acids topology |
title_short | Exploring the dark foldable proteome by considering hydrophobic amino acids topology |
title_sort | exploring the dark foldable proteome by considering hydrophobic amino acids topology |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5278394/ https://www.ncbi.nlm.nih.gov/pubmed/28134276 http://dx.doi.org/10.1038/srep41425 |
work_keys_str_mv | AT bitardfeildeltristan exploringthedarkfoldableproteomebyconsideringhydrophobicaminoacidstopology AT callebautisabelle exploringthedarkfoldableproteomebyconsideringhydrophobicaminoacidstopology |