Cargando…

MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions

The accurate prediction of the effect of amino acid mutations for protein–protein interactions (PPI [Formula: see text]) is a crucial task in protein engineering, as it provides insight into the relevant biological processes underpinning protein binding and provides a basis for further drug discover...

Descripción completa

Detalles Bibliográficos
Autores principales: Yue, Yang, Li, Shu, Wang, Lingling, Liu, Huanxiang, Tong, Henry H Y, He, Shan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516393/
https://www.ncbi.nlm.nih.gov/pubmed/37651610
http://dx.doi.org/10.1093/bib/bbad310
_version_ 1785109120566689792
author Yue, Yang
Li, Shu
Wang, Lingling
Liu, Huanxiang
Tong, Henry H Y
He, Shan
author_facet Yue, Yang
Li, Shu
Wang, Lingling
Liu, Huanxiang
Tong, Henry H Y
He, Shan
author_sort Yue, Yang
collection PubMed
description The accurate prediction of the effect of amino acid mutations for protein–protein interactions (PPI [Formula: see text]) is a crucial task in protein engineering, as it provides insight into the relevant biological processes underpinning protein binding and provides a basis for further drug discovery. In this study, we propose MpbPPI, a novel multi-task pre-training-based geometric equivariance-preserving framework to predict PPI   [Formula: see text]. Pre-training on a strictly screened pre-training dataset is employed to address the scarcity of protein–protein complex structures annotated with PPI [Formula: see text] values. MpbPPI employs a multi-task pre-training technique, forcing the framework to learn comprehensive backbone and side chain geometric regulations of protein–protein complexes at different scales. After pre-training, MpbPPI can generate high-quality representations capturing the effective geometric characteristics of labeled protein–protein complexes for downstream [Formula: see text] predictions. MpbPPI serves as a scalable framework supporting different sources of mutant-type (MT) protein–protein complexes for flexible application. Experimental results on four benchmark datasets demonstrate that MpbPPI is a state-of-the-art framework for PPI [Formula: see text] predictions. The data and source code are available at https://github.com/arantir123/MpbPPI.
format Online
Article
Text
id pubmed-10516393
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105163932023-09-23 MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions Yue, Yang Li, Shu Wang, Lingling Liu, Huanxiang Tong, Henry H Y He, Shan Brief Bioinform Problem Solving Protocol The accurate prediction of the effect of amino acid mutations for protein–protein interactions (PPI [Formula: see text]) is a crucial task in protein engineering, as it provides insight into the relevant biological processes underpinning protein binding and provides a basis for further drug discovery. In this study, we propose MpbPPI, a novel multi-task pre-training-based geometric equivariance-preserving framework to predict PPI   [Formula: see text]. Pre-training on a strictly screened pre-training dataset is employed to address the scarcity of protein–protein complex structures annotated with PPI [Formula: see text] values. MpbPPI employs a multi-task pre-training technique, forcing the framework to learn comprehensive backbone and side chain geometric regulations of protein–protein complexes at different scales. After pre-training, MpbPPI can generate high-quality representations capturing the effective geometric characteristics of labeled protein–protein complexes for downstream [Formula: see text] predictions. MpbPPI serves as a scalable framework supporting different sources of mutant-type (MT) protein–protein complexes for flexible application. Experimental results on four benchmark datasets demonstrate that MpbPPI is a state-of-the-art framework for PPI [Formula: see text] predictions. The data and source code are available at https://github.com/arantir123/MpbPPI. Oxford University Press 2023-08-31 /pmc/articles/PMC10516393/ /pubmed/37651610 http://dx.doi.org/10.1093/bib/bbad310 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Yue, Yang
Li, Shu
Wang, Lingling
Liu, Huanxiang
Tong, Henry H Y
He, Shan
MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
title MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
title_full MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
title_fullStr MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
title_full_unstemmed MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
title_short MpbPPI: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
title_sort mpbppi: a multi-task pre-training-based equivariant approach for the prediction of the effect of amino acid mutations on protein–protein interactions
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516393/
https://www.ncbi.nlm.nih.gov/pubmed/37651610
http://dx.doi.org/10.1093/bib/bbad310
work_keys_str_mv AT yueyang mpbppiamultitaskpretrainingbasedequivariantapproachforthepredictionoftheeffectofaminoacidmutationsonproteinproteininteractions
AT lishu mpbppiamultitaskpretrainingbasedequivariantapproachforthepredictionoftheeffectofaminoacidmutationsonproteinproteininteractions
AT wanglingling mpbppiamultitaskpretrainingbasedequivariantapproachforthepredictionoftheeffectofaminoacidmutationsonproteinproteininteractions
AT liuhuanxiang mpbppiamultitaskpretrainingbasedequivariantapproachforthepredictionoftheeffectofaminoacidmutationsonproteinproteininteractions
AT tonghenryhy mpbppiamultitaskpretrainingbasedequivariantapproachforthepredictionoftheeffectofaminoacidmutationsonproteinproteininteractions
AT heshan mpbppiamultitaskpretrainingbasedequivariantapproachforthepredictionoftheeffectofaminoacidmutationsonproteinproteininteractions