Cargando…

An ambiguity principle for assigning protein structural domains

Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in...

Descripción completa

Detalles Bibliográficos
Autores principales: Postic, Guillaume, Ghouzam, Yassine, Chebrek, Romain, Gelly, Jean-Christophe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5235333/
https://www.ncbi.nlm.nih.gov/pubmed/28097215
http://dx.doi.org/10.1126/sciadv.1600552
_version_ 1782495139774595072
author Postic, Guillaume
Ghouzam, Yassine
Chebrek, Romain
Gelly, Jean-Christophe
author_facet Postic, Guillaume
Ghouzam, Yassine
Chebrek, Romain
Gelly, Jean-Christophe
author_sort Postic, Guillaume
collection PubMed
description Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object—in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our “multipartitioning” approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules.
format Online
Article
Text
id pubmed-5235333
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-52353332017-01-17 An ambiguity principle for assigning protein structural domains Postic, Guillaume Ghouzam, Yassine Chebrek, Romain Gelly, Jean-Christophe Sci Adv Research Articles Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object—in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our “multipartitioning” approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules. American Association for the Advancement of Science 2017-01-13 /pmc/articles/PMC5235333/ /pubmed/28097215 http://dx.doi.org/10.1126/sciadv.1600552 Text en Copyright © 2017, The Authors http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (http://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.
spellingShingle Research Articles
Postic, Guillaume
Ghouzam, Yassine
Chebrek, Romain
Gelly, Jean-Christophe
An ambiguity principle for assigning protein structural domains
title An ambiguity principle for assigning protein structural domains
title_full An ambiguity principle for assigning protein structural domains
title_fullStr An ambiguity principle for assigning protein structural domains
title_full_unstemmed An ambiguity principle for assigning protein structural domains
title_short An ambiguity principle for assigning protein structural domains
title_sort ambiguity principle for assigning protein structural domains
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5235333/
https://www.ncbi.nlm.nih.gov/pubmed/28097215
http://dx.doi.org/10.1126/sciadv.1600552
work_keys_str_mv AT posticguillaume anambiguityprincipleforassigningproteinstructuraldomains
AT ghouzamyassine anambiguityprincipleforassigningproteinstructuraldomains
AT chebrekromain anambiguityprincipleforassigningproteinstructuraldomains
AT gellyjeanchristophe anambiguityprincipleforassigningproteinstructuraldomains
AT posticguillaume ambiguityprincipleforassigningproteinstructuraldomains
AT ghouzamyassine ambiguityprincipleforassigningproteinstructuraldomains
AT chebrekromain ambiguityprincipleforassigningproteinstructuraldomains
AT gellyjeanchristophe ambiguityprincipleforassigningproteinstructuraldomains