Cargando…
Prediction of Interactions between Cell Surface Proteins by Machine Learning
Cells detect changes of external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and thus chal...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10515853/ https://www.ncbi.nlm.nih.gov/pubmed/37745607 http://dx.doi.org/10.1101/2023.09.12.557337 |
_version_ | 1785109032785149952 |
---|---|
author | Su, Zhaoqian Griffin, Brian Emmons, Scott Wu, Yinghao |
author_facet | Su, Zhaoqian Griffin, Brian Emmons, Scott Wu, Yinghao |
author_sort | Su, Zhaoqian |
collection | PubMed |
description | Cells detect changes of external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and thus challenging to detect using traditional experimental techniques. Here we tackle this challenge by a computational framework. The primary focus of the framework is to develop new tools to identify interactions between domains in immunoglobulin (Ig) fold, which is the most abundant domain family in cell surface proteins. These interactions could be formed between ligands and receptors from different cells, or between proteins on the same cell surface. In practice, we collected all structural data of Ig domain interactions and transformed them into an interface fragment pair library. A high dimensional profile can be then constructed from the library for a given pair of query protein sequences. Multiple machine learning models were used to read this profile, so that the probability of interaction between the query proteins can be predicted. We tested our models to an experimentally derived dataset which contains 564 cell surface proteins in human. The cross-validation results show that we can achieve higher than 70% accuracy in identifying the PPIs within this dataset. We then applied this method to a group of 46 cell surface proteins in C elegans. We screened every possible interaction between these proteins. Many interactions recognized by our machine learning classifiers have been experimentally confirmed in the literatures. In conclusion, our computational platform serves a useful tool to help identifying potential new interactions between cell surface proteins in addition to current state-of-the-art experimental techniques. The tool is freely accessible for use by the scientific community. Moreover, the general framework of the machine learning classification can also be extended to study interactions of proteins in other domain superfamilies. |
format | Online Article Text |
id | pubmed-10515853 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-105158532023-09-23 Prediction of Interactions between Cell Surface Proteins by Machine Learning Su, Zhaoqian Griffin, Brian Emmons, Scott Wu, Yinghao bioRxiv Article Cells detect changes of external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and thus challenging to detect using traditional experimental techniques. Here we tackle this challenge by a computational framework. The primary focus of the framework is to develop new tools to identify interactions between domains in immunoglobulin (Ig) fold, which is the most abundant domain family in cell surface proteins. These interactions could be formed between ligands and receptors from different cells, or between proteins on the same cell surface. In practice, we collected all structural data of Ig domain interactions and transformed them into an interface fragment pair library. A high dimensional profile can be then constructed from the library for a given pair of query protein sequences. Multiple machine learning models were used to read this profile, so that the probability of interaction between the query proteins can be predicted. We tested our models to an experimentally derived dataset which contains 564 cell surface proteins in human. The cross-validation results show that we can achieve higher than 70% accuracy in identifying the PPIs within this dataset. We then applied this method to a group of 46 cell surface proteins in C elegans. We screened every possible interaction between these proteins. Many interactions recognized by our machine learning classifiers have been experimentally confirmed in the literatures. In conclusion, our computational platform serves a useful tool to help identifying potential new interactions between cell surface proteins in addition to current state-of-the-art experimental techniques. The tool is freely accessible for use by the scientific community. Moreover, the general framework of the machine learning classification can also be extended to study interactions of proteins in other domain superfamilies. Cold Spring Harbor Laboratory 2023-09-14 /pmc/articles/PMC10515853/ /pubmed/37745607 http://dx.doi.org/10.1101/2023.09.12.557337 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Su, Zhaoqian Griffin, Brian Emmons, Scott Wu, Yinghao Prediction of Interactions between Cell Surface Proteins by Machine Learning |
title | Prediction of Interactions between Cell Surface Proteins by Machine Learning |
title_full | Prediction of Interactions between Cell Surface Proteins by Machine Learning |
title_fullStr | Prediction of Interactions between Cell Surface Proteins by Machine Learning |
title_full_unstemmed | Prediction of Interactions between Cell Surface Proteins by Machine Learning |
title_short | Prediction of Interactions between Cell Surface Proteins by Machine Learning |
title_sort | prediction of interactions between cell surface proteins by machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10515853/ https://www.ncbi.nlm.nih.gov/pubmed/37745607 http://dx.doi.org/10.1101/2023.09.12.557337 |
work_keys_str_mv | AT suzhaoqian predictionofinteractionsbetweencellsurfaceproteinsbymachinelearning AT griffinbrian predictionofinteractionsbetweencellsurfaceproteinsbymachinelearning AT emmonsscott predictionofinteractionsbetweencellsurfaceproteinsbymachinelearning AT wuyinghao predictionofinteractionsbetweencellsurfaceproteinsbymachinelearning |