Cargando…

SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information

In this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims t...

Descripción completa

Detalles Bibliográficos
Autores principales: Allam, Ikram, Flatters, Delphine, Caumes, Géraldine, Regad, Leslie, Delos, Vincent, Nuel, Gregory, Camproux, Anne-Claude
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033379/
https://www.ncbi.nlm.nih.gov/pubmed/29975698
http://dx.doi.org/10.1371/journal.pone.0198854
_version_ 1783337690594279424
author Allam, Ikram
Flatters, Delphine
Caumes, Géraldine
Regad, Leslie
Delos, Vincent
Nuel, Gregory
Camproux, Anne-Claude
author_facet Allam, Ikram
Flatters, Delphine
Caumes, Géraldine
Regad, Leslie
Delos, Vincent
Nuel, Gregory
Camproux, Anne-Claude
author_sort Allam, Ikram
collection PubMed
description In this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims to reduce three dimensional conformations of proteins as well as their analysis and comparison complexity by simplifying any conformation in a series of structural letters. Our methodology presents several novelties. Firstly, it can account for the encoding uncertainty by providing a wide range of encoding options: the maximum a posteriori, the marginal posterior distribution, and the effective number of letters at each given position. Secondly, our new algorithm deals with the missing data in the protein structure files (concerning more than 75% of the proteins from the Protein Data Bank) in a rigorous probabilistic framework. Thirdly, SAFlex is able to encode and to build a consensus encoding from different replicates of a single protein such as several homomer chains. This allows localizing structural differences between different chains and detecting structural variability, which is essential for protein flexibility identification. These improvements are illustrated on different proteins, such as the crystal structure of an eukaryotic small heat shock protein. They are promising to explore increasing protein redundancy data and obtain useful quantification of their flexibility.
format Online
Article
Text
id pubmed-6033379
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60333792018-07-19 SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information Allam, Ikram Flatters, Delphine Caumes, Géraldine Regad, Leslie Delos, Vincent Nuel, Gregory Camproux, Anne-Claude PLoS One Research Article In this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims to reduce three dimensional conformations of proteins as well as their analysis and comparison complexity by simplifying any conformation in a series of structural letters. Our methodology presents several novelties. Firstly, it can account for the encoding uncertainty by providing a wide range of encoding options: the maximum a posteriori, the marginal posterior distribution, and the effective number of letters at each given position. Secondly, our new algorithm deals with the missing data in the protein structure files (concerning more than 75% of the proteins from the Protein Data Bank) in a rigorous probabilistic framework. Thirdly, SAFlex is able to encode and to build a consensus encoding from different replicates of a single protein such as several homomer chains. This allows localizing structural differences between different chains and detecting structural variability, which is essential for protein flexibility identification. These improvements are illustrated on different proteins, such as the crystal structure of an eukaryotic small heat shock protein. They are promising to explore increasing protein redundancy data and obtain useful quantification of their flexibility. Public Library of Science 2018-07-05 /pmc/articles/PMC6033379/ /pubmed/29975698 http://dx.doi.org/10.1371/journal.pone.0198854 Text en © 2018 Allam et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Allam, Ikram
Flatters, Delphine
Caumes, Géraldine
Regad, Leslie
Delos, Vincent
Nuel, Gregory
Camproux, Anne-Claude
SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
title SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
title_full SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
title_fullStr SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
title_full_unstemmed SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
title_short SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
title_sort saflex: a structural alphabet extension to integrate protein structural flexibility and missing data information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033379/
https://www.ncbi.nlm.nih.gov/pubmed/29975698
http://dx.doi.org/10.1371/journal.pone.0198854
work_keys_str_mv AT allamikram saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT flattersdelphine saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT caumesgeraldine saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT regadleslie saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT delosvincent saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT nuelgregory saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation
AT camprouxanneclaude saflexastructuralalphabetextensiontointegrateproteinstructuralflexibilityandmissingdatainformation