Cargando…
Classification of protein–protein association rates based on biophysical informatics
BACKGROUND: Proteins form various complexes to carry out their versatile functions in cells. The dynamic properties of protein complex formation are mainly characterized by the association rates which measures how fast these complexes can be formed. It was experimentally observed that the associatio...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8371850/ https://www.ncbi.nlm.nih.gov/pubmed/34404340 http://dx.doi.org/10.1186/s12859-021-04323-0 |
_version_ | 1783739723773116416 |
---|---|
author | Dhusia, Kalyani Wu, Yinghao |
author_facet | Dhusia, Kalyani Wu, Yinghao |
author_sort | Dhusia, Kalyani |
collection | PubMed |
description | BACKGROUND: Proteins form various complexes to carry out their versatile functions in cells. The dynamic properties of protein complex formation are mainly characterized by the association rates which measures how fast these complexes can be formed. It was experimentally observed that the association rates span an extremely wide range with over ten orders of magnitudes. Identification of association rates within this spectrum for specific protein complexes is therefore essential for us to understand their functional roles. RESULTS: To tackle this problem, we integrate physics-based coarse-grained simulations into a neural-network-based classification model to estimate the range of association rates for protein complexes in a large-scale benchmark set. The cross-validation results show that, when an optimal threshold was selected, we can reach the best performance with specificity, precision, sensitivity and overall accuracy all higher than 70%. The quality of our cross-validation data has also been testified by further statistical analysis. Additionally, given an independent testing set, we can successfully predict the group of association rates for eight protein complexes out of ten. Finally, the analysis of failed cases suggests the future implementation of conformational dynamics into simulation can further improve model. CONCLUSIONS: In summary, this study demonstrated that a new modeling framework that combines biophysical simulations with bioinformatics approaches is able to identify protein–protein interactions with low association rates from those with higher association rates. This method thereby can serve as a useful addition to a collection of existing experimental approaches that measure biomolecular recognition. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04323-0. |
format | Online Article Text |
id | pubmed-8371850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-83718502021-08-18 Classification of protein–protein association rates based on biophysical informatics Dhusia, Kalyani Wu, Yinghao BMC Bioinformatics Research Article BACKGROUND: Proteins form various complexes to carry out their versatile functions in cells. The dynamic properties of protein complex formation are mainly characterized by the association rates which measures how fast these complexes can be formed. It was experimentally observed that the association rates span an extremely wide range with over ten orders of magnitudes. Identification of association rates within this spectrum for specific protein complexes is therefore essential for us to understand their functional roles. RESULTS: To tackle this problem, we integrate physics-based coarse-grained simulations into a neural-network-based classification model to estimate the range of association rates for protein complexes in a large-scale benchmark set. The cross-validation results show that, when an optimal threshold was selected, we can reach the best performance with specificity, precision, sensitivity and overall accuracy all higher than 70%. The quality of our cross-validation data has also been testified by further statistical analysis. Additionally, given an independent testing set, we can successfully predict the group of association rates for eight protein complexes out of ten. Finally, the analysis of failed cases suggests the future implementation of conformational dynamics into simulation can further improve model. CONCLUSIONS: In summary, this study demonstrated that a new modeling framework that combines biophysical simulations with bioinformatics approaches is able to identify protein–protein interactions with low association rates from those with higher association rates. This method thereby can serve as a useful addition to a collection of existing experimental approaches that measure biomolecular recognition. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04323-0. BioMed Central 2021-08-17 /pmc/articles/PMC8371850/ /pubmed/34404340 http://dx.doi.org/10.1186/s12859-021-04323-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Dhusia, Kalyani Wu, Yinghao Classification of protein–protein association rates based on biophysical informatics |
title | Classification of protein–protein association rates based on biophysical informatics |
title_full | Classification of protein–protein association rates based on biophysical informatics |
title_fullStr | Classification of protein–protein association rates based on biophysical informatics |
title_full_unstemmed | Classification of protein–protein association rates based on biophysical informatics |
title_short | Classification of protein–protein association rates based on biophysical informatics |
title_sort | classification of protein–protein association rates based on biophysical informatics |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8371850/ https://www.ncbi.nlm.nih.gov/pubmed/34404340 http://dx.doi.org/10.1186/s12859-021-04323-0 |
work_keys_str_mv | AT dhusiakalyani classificationofproteinproteinassociationratesbasedonbiophysicalinformatics AT wuyinghao classificationofproteinproteinassociationratesbasedonbiophysicalinformatics |