Cargando…

Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting

Extracting structural information from sequence co-variation has become a common computational biology practice in the recent years, mainly due to the availability of large sequence alignments of protein families. However, identifying features that are specific to sub-classes and not shared by all m...

Descripción completa

Detalles Bibliográficos
Autores principales: Malinverni, Duccio, Barducci, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992422/
https://www.ncbi.nlm.nih.gov/pubmed/32002010
http://dx.doi.org/10.3390/e21111127
_version_ 1783492835494854656
author Malinverni, Duccio
Barducci, Alessandro
author_facet Malinverni, Duccio
Barducci, Alessandro
author_sort Malinverni, Duccio
collection PubMed
description Extracting structural information from sequence co-variation has become a common computational biology practice in the recent years, mainly due to the availability of large sequence alignments of protein families. However, identifying features that are specific to sub-classes and not shared by all members of the family using sequence-based approaches has remained an elusive problem. We here present a coevolutionary-based method to differentially analyze subfamily specific structural features by a continuous sequence reweighting (SR) approach. We introduce the underlying principles and test its predictive capabilities on the Response Regulator family, whose subfamilies have been previously shown to display distinct, specific homo-dimerization patterns. Our results show that this reweighting scheme is effective in assigning structural features known a priori to subfamilies, even when sequence data is relatively scarce. Furthermore, sequence reweighting allows assessing if individual structural contacts pertain to specific subfamilies and it thus paves the way for the identification specificity-determining contacts from sequence variation data.
format Online
Article
Text
id pubmed-6992422
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-69924222020-01-30 Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting Malinverni, Duccio Barducci, Alessandro Entropy (Basel) Article Extracting structural information from sequence co-variation has become a common computational biology practice in the recent years, mainly due to the availability of large sequence alignments of protein families. However, identifying features that are specific to sub-classes and not shared by all members of the family using sequence-based approaches has remained an elusive problem. We here present a coevolutionary-based method to differentially analyze subfamily specific structural features by a continuous sequence reweighting (SR) approach. We introduce the underlying principles and test its predictive capabilities on the Response Regulator family, whose subfamilies have been previously shown to display distinct, specific homo-dimerization patterns. Our results show that this reweighting scheme is effective in assigning structural features known a priori to subfamilies, even when sequence data is relatively scarce. Furthermore, sequence reweighting allows assessing if individual structural contacts pertain to specific subfamilies and it thus paves the way for the identification specificity-determining contacts from sequence variation data. MDPI 2019-11-16 /pmc/articles/PMC6992422/ /pubmed/32002010 http://dx.doi.org/10.3390/e21111127 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Malinverni, Duccio
Barducci, Alessandro
Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
title Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
title_full Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
title_fullStr Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
title_full_unstemmed Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
title_short Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
title_sort coevolutionary analysis of protein subfamilies by sequence reweighting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992422/
https://www.ncbi.nlm.nih.gov/pubmed/32002010
http://dx.doi.org/10.3390/e21111127
work_keys_str_mv AT malinverniduccio coevolutionaryanalysisofproteinsubfamiliesbysequencereweighting
AT barduccialessandro coevolutionaryanalysisofproteinsubfamiliesbysequencereweighting