Cargando…

PDBench: evaluating computational methods for protein-sequence design

SUMMARY: Ever increasing amounts of protein structure data, combined with advances in machine learning, have led to the rapid proliferation of methods available for protein-sequence design. In order to utilize a design method effectively, it is important to understand the nuances of its performance...

Descripción completa

Detalles Bibliográficos
Autores principales: Castorina, Leonardo V, Petrenas, Rokas, Subr, Kartic, Wood, Christopher W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9869650/
https://www.ncbi.nlm.nih.gov/pubmed/36637198
http://dx.doi.org/10.1093/bioinformatics/btad027
_version_ 1784876812965249024
author Castorina, Leonardo V
Petrenas, Rokas
Subr, Kartic
Wood, Christopher W
author_facet Castorina, Leonardo V
Petrenas, Rokas
Subr, Kartic
Wood, Christopher W
author_sort Castorina, Leonardo V
collection PubMed
description SUMMARY: Ever increasing amounts of protein structure data, combined with advances in machine learning, have led to the rapid proliferation of methods available for protein-sequence design. In order to utilize a design method effectively, it is important to understand the nuances of its performance and how it varies by design target. Here, we present PDBench, a set of proteins and a number of standard tests for assessing the performance of sequence-design methods. PDBench aims to maximize the structural diversity of the benchmark, compared with previous benchmarking sets, in order to provide useful biological insight into the behaviour of sequence-design methods, which is essential for evaluating their performance and practical utility. We believe that these tools are useful for guiding the development of novel sequence design algorithms and will enable users to choose a method that best suits their design target. AVAILABILITY AND IMPLEMENTATION: https://github.com/wells-wood-research/PDBench SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9869650
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98696502023-01-23 PDBench: evaluating computational methods for protein-sequence design Castorina, Leonardo V Petrenas, Rokas Subr, Kartic Wood, Christopher W Bioinformatics Applications Note SUMMARY: Ever increasing amounts of protein structure data, combined with advances in machine learning, have led to the rapid proliferation of methods available for protein-sequence design. In order to utilize a design method effectively, it is important to understand the nuances of its performance and how it varies by design target. Here, we present PDBench, a set of proteins and a number of standard tests for assessing the performance of sequence-design methods. PDBench aims to maximize the structural diversity of the benchmark, compared with previous benchmarking sets, in order to provide useful biological insight into the behaviour of sequence-design methods, which is essential for evaluating their performance and practical utility. We believe that these tools are useful for guiding the development of novel sequence design algorithms and will enable users to choose a method that best suits their design target. AVAILABILITY AND IMPLEMENTATION: https://github.com/wells-wood-research/PDBench SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-13 /pmc/articles/PMC9869650/ /pubmed/36637198 http://dx.doi.org/10.1093/bioinformatics/btad027 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Castorina, Leonardo V
Petrenas, Rokas
Subr, Kartic
Wood, Christopher W
PDBench: evaluating computational methods for protein-sequence design
title PDBench: evaluating computational methods for protein-sequence design
title_full PDBench: evaluating computational methods for protein-sequence design
title_fullStr PDBench: evaluating computational methods for protein-sequence design
title_full_unstemmed PDBench: evaluating computational methods for protein-sequence design
title_short PDBench: evaluating computational methods for protein-sequence design
title_sort pdbench: evaluating computational methods for protein-sequence design
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9869650/
https://www.ncbi.nlm.nih.gov/pubmed/36637198
http://dx.doi.org/10.1093/bioinformatics/btad027
work_keys_str_mv AT castorinaleonardov pdbenchevaluatingcomputationalmethodsforproteinsequencedesign
AT petrenasrokas pdbenchevaluatingcomputationalmethodsforproteinsequencedesign
AT subrkartic pdbenchevaluatingcomputationalmethodsforproteinsequencedesign
AT woodchristopherw pdbenchevaluatingcomputationalmethodsforproteinsequencedesign