Cargando…
Protein quality assessment with a loss function designed for high-quality decoys
Motivation: The prediction of a protein 3D structure is essential for understanding protein function, drug discovery, and disease mechanisms; with the advent of methods like AlphaFold that are capable of producing very high-quality decoys, ensuring the quality of those decoys can provide further con...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616882/ https://www.ncbi.nlm.nih.gov/pubmed/37915563 http://dx.doi.org/10.3389/fbinf.2023.1198218 |
_version_ | 1785129491832504320 |
---|---|
author | Roy, Soumyadip Ben-Hur, Asa |
author_facet | Roy, Soumyadip Ben-Hur, Asa |
author_sort | Roy, Soumyadip |
collection | PubMed |
description | Motivation: The prediction of a protein 3D structure is essential for understanding protein function, drug discovery, and disease mechanisms; with the advent of methods like AlphaFold that are capable of producing very high-quality decoys, ensuring the quality of those decoys can provide further confidence in the accuracy of their predictions. Results: In this work, we describe Q( ϵ ), a graph convolutional network (GCN) that utilizes a minimal set of atom and residue features as inputs to predict the global distance test total score (GDTTS) and local distance difference test (lDDT) score of a decoy. To improve the model’s performance, we introduce a novel loss function based on the ϵ-insensitive loss function used for SVM regression. This loss function is specifically designed for evaluating the characteristics of the quality assessment problem and provides predictions with improved accuracy over standard loss functions used for this task. Despite using only a minimal set of features, it matches the performance of recent state-of-the-art methods like DeepUMQA. Availability: The code for Q( ϵ ) is available at https://github.com/soumyadip1997/qepsilon. |
format | Online Article Text |
id | pubmed-10616882 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106168822023-11-01 Protein quality assessment with a loss function designed for high-quality decoys Roy, Soumyadip Ben-Hur, Asa Front Bioinform Bioinformatics Motivation: The prediction of a protein 3D structure is essential for understanding protein function, drug discovery, and disease mechanisms; with the advent of methods like AlphaFold that are capable of producing very high-quality decoys, ensuring the quality of those decoys can provide further confidence in the accuracy of their predictions. Results: In this work, we describe Q( ϵ ), a graph convolutional network (GCN) that utilizes a minimal set of atom and residue features as inputs to predict the global distance test total score (GDTTS) and local distance difference test (lDDT) score of a decoy. To improve the model’s performance, we introduce a novel loss function based on the ϵ-insensitive loss function used for SVM regression. This loss function is specifically designed for evaluating the characteristics of the quality assessment problem and provides predictions with improved accuracy over standard loss functions used for this task. Despite using only a minimal set of features, it matches the performance of recent state-of-the-art methods like DeepUMQA. Availability: The code for Q( ϵ ) is available at https://github.com/soumyadip1997/qepsilon. Frontiers Media S.A. 2023-10-17 /pmc/articles/PMC10616882/ /pubmed/37915563 http://dx.doi.org/10.3389/fbinf.2023.1198218 Text en Copyright © 2023 Roy and Ben-Hur. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Roy, Soumyadip Ben-Hur, Asa Protein quality assessment with a loss function designed for high-quality decoys |
title | Protein quality assessment with a loss function designed for high-quality decoys |
title_full | Protein quality assessment with a loss function designed for high-quality decoys |
title_fullStr | Protein quality assessment with a loss function designed for high-quality decoys |
title_full_unstemmed | Protein quality assessment with a loss function designed for high-quality decoys |
title_short | Protein quality assessment with a loss function designed for high-quality decoys |
title_sort | protein quality assessment with a loss function designed for high-quality decoys |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616882/ https://www.ncbi.nlm.nih.gov/pubmed/37915563 http://dx.doi.org/10.3389/fbinf.2023.1198218 |
work_keys_str_mv | AT roysoumyadip proteinqualityassessmentwithalossfunctiondesignedforhighqualitydecoys AT benhurasa proteinqualityassessmentwithalossfunctiondesignedforhighqualitydecoys |