Cargando…
Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting
Extracting structural information from sequence co-variation has become a common computational biology practice in the recent years, mainly due to the availability of large sequence alignments of protein families. However, identifying features that are specific to sub-classes and not shared by all m...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992422/ https://www.ncbi.nlm.nih.gov/pubmed/32002010 http://dx.doi.org/10.3390/e21111127 |
_version_ | 1783492835494854656 |
---|---|
author | Malinverni, Duccio Barducci, Alessandro |
author_facet | Malinverni, Duccio Barducci, Alessandro |
author_sort | Malinverni, Duccio |
collection | PubMed |
description | Extracting structural information from sequence co-variation has become a common computational biology practice in the recent years, mainly due to the availability of large sequence alignments of protein families. However, identifying features that are specific to sub-classes and not shared by all members of the family using sequence-based approaches has remained an elusive problem. We here present a coevolutionary-based method to differentially analyze subfamily specific structural features by a continuous sequence reweighting (SR) approach. We introduce the underlying principles and test its predictive capabilities on the Response Regulator family, whose subfamilies have been previously shown to display distinct, specific homo-dimerization patterns. Our results show that this reweighting scheme is effective in assigning structural features known a priori to subfamilies, even when sequence data is relatively scarce. Furthermore, sequence reweighting allows assessing if individual structural contacts pertain to specific subfamilies and it thus paves the way for the identification specificity-determining contacts from sequence variation data. |
format | Online Article Text |
id | pubmed-6992422 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-69924222020-01-30 Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting Malinverni, Duccio Barducci, Alessandro Entropy (Basel) Article Extracting structural information from sequence co-variation has become a common computational biology practice in the recent years, mainly due to the availability of large sequence alignments of protein families. However, identifying features that are specific to sub-classes and not shared by all members of the family using sequence-based approaches has remained an elusive problem. We here present a coevolutionary-based method to differentially analyze subfamily specific structural features by a continuous sequence reweighting (SR) approach. We introduce the underlying principles and test its predictive capabilities on the Response Regulator family, whose subfamilies have been previously shown to display distinct, specific homo-dimerization patterns. Our results show that this reweighting scheme is effective in assigning structural features known a priori to subfamilies, even when sequence data is relatively scarce. Furthermore, sequence reweighting allows assessing if individual structural contacts pertain to specific subfamilies and it thus paves the way for the identification specificity-determining contacts from sequence variation data. MDPI 2019-11-16 /pmc/articles/PMC6992422/ /pubmed/32002010 http://dx.doi.org/10.3390/e21111127 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Malinverni, Duccio Barducci, Alessandro Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting |
title | Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting |
title_full | Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting |
title_fullStr | Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting |
title_full_unstemmed | Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting |
title_short | Coevolutionary Analysis of Protein Subfamilies by Sequence Reweighting |
title_sort | coevolutionary analysis of protein subfamilies by sequence reweighting |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992422/ https://www.ncbi.nlm.nih.gov/pubmed/32002010 http://dx.doi.org/10.3390/e21111127 |
work_keys_str_mv | AT malinverniduccio coevolutionaryanalysisofproteinsubfamiliesbysequencereweighting AT barduccialessandro coevolutionaryanalysisofproteinsubfamiliesbysequencereweighting |