Cargando…

Protein quality assessment with a loss function designed for high-quality decoys

Motivation: The prediction of a protein 3D structure is essential for understanding protein function, drug discovery, and disease mechanisms; with the advent of methods like AlphaFold that are capable of producing very high-quality decoys, ensuring the quality of those decoys can provide further con...

Descripción completa

Detalles Bibliográficos
Autores principales: Roy, Soumyadip, Ben-Hur, Asa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616882/
https://www.ncbi.nlm.nih.gov/pubmed/37915563
http://dx.doi.org/10.3389/fbinf.2023.1198218
_version_ 1785129491832504320
author Roy, Soumyadip
Ben-Hur, Asa
author_facet Roy, Soumyadip
Ben-Hur, Asa
author_sort Roy, Soumyadip
collection PubMed
description Motivation: The prediction of a protein 3D structure is essential for understanding protein function, drug discovery, and disease mechanisms; with the advent of methods like AlphaFold that are capable of producing very high-quality decoys, ensuring the quality of those decoys can provide further confidence in the accuracy of their predictions. Results: In this work, we describe Q( ϵ ), a graph convolutional network (GCN) that utilizes a minimal set of atom and residue features as inputs to predict the global distance test total score (GDTTS) and local distance difference test (lDDT) score of a decoy. To improve the model’s performance, we introduce a novel loss function based on the ϵ-insensitive loss function used for SVM regression. This loss function is specifically designed for evaluating the characteristics of the quality assessment problem and provides predictions with improved accuracy over standard loss functions used for this task. Despite using only a minimal set of features, it matches the performance of recent state-of-the-art methods like DeepUMQA. Availability: The code for Q( ϵ ) is available at https://github.com/soumyadip1997/qepsilon.
format Online
Article
Text
id pubmed-10616882
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106168822023-11-01 Protein quality assessment with a loss function designed for high-quality decoys Roy, Soumyadip Ben-Hur, Asa Front Bioinform Bioinformatics Motivation: The prediction of a protein 3D structure is essential for understanding protein function, drug discovery, and disease mechanisms; with the advent of methods like AlphaFold that are capable of producing very high-quality decoys, ensuring the quality of those decoys can provide further confidence in the accuracy of their predictions. Results: In this work, we describe Q( ϵ ), a graph convolutional network (GCN) that utilizes a minimal set of atom and residue features as inputs to predict the global distance test total score (GDTTS) and local distance difference test (lDDT) score of a decoy. To improve the model’s performance, we introduce a novel loss function based on the ϵ-insensitive loss function used for SVM regression. This loss function is specifically designed for evaluating the characteristics of the quality assessment problem and provides predictions with improved accuracy over standard loss functions used for this task. Despite using only a minimal set of features, it matches the performance of recent state-of-the-art methods like DeepUMQA. Availability: The code for Q( ϵ ) is available at https://github.com/soumyadip1997/qepsilon. Frontiers Media S.A. 2023-10-17 /pmc/articles/PMC10616882/ /pubmed/37915563 http://dx.doi.org/10.3389/fbinf.2023.1198218 Text en Copyright © 2023 Roy and Ben-Hur. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Roy, Soumyadip
Ben-Hur, Asa
Protein quality assessment with a loss function designed for high-quality decoys
title Protein quality assessment with a loss function designed for high-quality decoys
title_full Protein quality assessment with a loss function designed for high-quality decoys
title_fullStr Protein quality assessment with a loss function designed for high-quality decoys
title_full_unstemmed Protein quality assessment with a loss function designed for high-quality decoys
title_short Protein quality assessment with a loss function designed for high-quality decoys
title_sort protein quality assessment with a loss function designed for high-quality decoys
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616882/
https://www.ncbi.nlm.nih.gov/pubmed/37915563
http://dx.doi.org/10.3389/fbinf.2023.1198218
work_keys_str_mv AT roysoumyadip proteinqualityassessmentwithalossfunctiondesignedforhighqualitydecoys
AT benhurasa proteinqualityassessmentwithalossfunctiondesignedforhighqualitydecoys