Cargando…
Learning protein binding affinity using privileged information
BACKGROUND: Determining protein-protein interactions and their binding affinity are important in understanding cellular biological processes, discovery and design of novel therapeutics, protein engineering, and mutagenesis studies. Due to the time and effort required in wet lab experiments, computat...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6238365/ https://www.ncbi.nlm.nih.gov/pubmed/30442086 http://dx.doi.org/10.1186/s12859-018-2448-z |
_version_ | 1783371363171434496 |
---|---|
author | Abbasi, Wajid Arshad Asif, Amina Ben-Hur, Asa Minhas, Fayyaz ul Amir Afsar |
author_facet | Abbasi, Wajid Arshad Asif, Amina Ben-Hur, Asa Minhas, Fayyaz ul Amir Afsar |
author_sort | Abbasi, Wajid Arshad |
collection | PubMed |
description | BACKGROUND: Determining protein-protein interactions and their binding affinity are important in understanding cellular biological processes, discovery and design of novel therapeutics, protein engineering, and mutagenesis studies. Due to the time and effort required in wet lab experiments, computational prediction of binding affinity from sequence or structure is an important area of research. Structure-based methods, though more accurate than sequence-based techniques, are limited in their applicability due to limited availability of protein structure data. RESULTS: In this study, we propose a novel machine learning method for predicting binding affinity that uses protein 3D structure as privileged information at training time while expecting only protein sequence information during testing. Using the method, which is based on the framework of learning using privileged information (LUPI), we have achieved improved performance over corresponding sequence-based binding affinity prediction methods that do not have access to privileged information during training. Our experiments show that with the proposed framework which uses structure only during training, it is possible to achieve classification performance comparable to that which is obtained using structure-based features. Evaluation on an independent test set shows improved performance over the PPA-Pred2 method as well. CONCLUSIONS: The proposed method outperforms several baseline learners and a state-of-the-art binding affinity predictor not only in cross-validation, but also on an additional validation dataset, demonstrating the utility of the LUPI framework for problems that would benefit from classification using structure-based features. The implementation of LUPI developed for this work is expected to be useful in other areas of bioinformatics as well. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2448-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6238365 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62383652018-11-26 Learning protein binding affinity using privileged information Abbasi, Wajid Arshad Asif, Amina Ben-Hur, Asa Minhas, Fayyaz ul Amir Afsar BMC Bioinformatics Methodology Article BACKGROUND: Determining protein-protein interactions and their binding affinity are important in understanding cellular biological processes, discovery and design of novel therapeutics, protein engineering, and mutagenesis studies. Due to the time and effort required in wet lab experiments, computational prediction of binding affinity from sequence or structure is an important area of research. Structure-based methods, though more accurate than sequence-based techniques, are limited in their applicability due to limited availability of protein structure data. RESULTS: In this study, we propose a novel machine learning method for predicting binding affinity that uses protein 3D structure as privileged information at training time while expecting only protein sequence information during testing. Using the method, which is based on the framework of learning using privileged information (LUPI), we have achieved improved performance over corresponding sequence-based binding affinity prediction methods that do not have access to privileged information during training. Our experiments show that with the proposed framework which uses structure only during training, it is possible to achieve classification performance comparable to that which is obtained using structure-based features. Evaluation on an independent test set shows improved performance over the PPA-Pred2 method as well. CONCLUSIONS: The proposed method outperforms several baseline learners and a state-of-the-art binding affinity predictor not only in cross-validation, but also on an additional validation dataset, demonstrating the utility of the LUPI framework for problems that would benefit from classification using structure-based features. The implementation of LUPI developed for this work is expected to be useful in other areas of bioinformatics as well. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2448-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-15 /pmc/articles/PMC6238365/ /pubmed/30442086 http://dx.doi.org/10.1186/s12859-018-2448-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Abbasi, Wajid Arshad Asif, Amina Ben-Hur, Asa Minhas, Fayyaz ul Amir Afsar Learning protein binding affinity using privileged information |
title | Learning protein binding affinity using privileged information |
title_full | Learning protein binding affinity using privileged information |
title_fullStr | Learning protein binding affinity using privileged information |
title_full_unstemmed | Learning protein binding affinity using privileged information |
title_short | Learning protein binding affinity using privileged information |
title_sort | learning protein binding affinity using privileged information |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6238365/ https://www.ncbi.nlm.nih.gov/pubmed/30442086 http://dx.doi.org/10.1186/s12859-018-2448-z |
work_keys_str_mv | AT abbasiwajidarshad learningproteinbindingaffinityusingprivilegedinformation AT asifamina learningproteinbindingaffinityusingprivilegedinformation AT benhurasa learningproteinbindingaffinityusingprivilegedinformation AT minhasfayyazulamirafsar learningproteinbindingaffinityusingprivilegedinformation |