Cargando…

Limits and potential of combined folding and docking

MOTIVATION: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, al...

Descripción completa

Detalles Bibliográficos
Autores principales: Pozzati, Gabriele, Zhu, Wensi, Bassot, Claudio, Lamb, John, Kundrotas, Petras, Elofsson, Arne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796369/
https://www.ncbi.nlm.nih.gov/pubmed/34788800
http://dx.doi.org/10.1093/bioinformatics/btab760
_version_ 1784641292332957696
author Pozzati, Gabriele
Zhu, Wensi
Bassot, Claudio
Lamb, John
Kundrotas, Petras
Elofsson, Arne
author_facet Pozzati, Gabriele
Zhu, Wensi
Bassot, Claudio
Lamb, John
Kundrotas, Petras
Elofsson, Arne
author_sort Pozzati, Gabriele
collection PubMed
description MOTIVATION: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, also be used to extract information about evolutionary-based contacts across protein–protein interfaces. However, most earlier studies have not used the latest DL methods for inter-chain contact distance prediction. This article introduces a fold-and-dock method based on predicted residue-residue distances with trRosetta. RESULTS: The method can simultaneously predict the tertiary and quaternary structure of a protein pair, even when the structures of the monomers are not known. The straightforward application of this method to a standard dataset for protein–protein docking yielded limited success. However, using alternative methods for generating MSAs allowed us to dock accurately significantly more proteins. We also introduced a novel scoring function, PconsDock, that accurately separates 98% of correctly and incorrectly folded and docked proteins. The average performance of the method is comparable to the use of traditional, template-based or ab initio shape-complementarity-only docking methods. Moreover, the results of conventional and fold-and-dock approaches are complementary, and thus a combined docking pipeline could increase overall docking success significantly. This methodology contributed to the best model for one of the CASP14 oligomeric targets, H1065. AVAILABILITY AND IMPLEMENTATION: All scripts for predictions and analysis are available from https://github.com/ElofssonLab/bioinfo-toolbox/ and https://gitlab.com/ElofssonLab/benchmark5/. All models joined alignments, and evaluation results are available from the following figshare repository https://doi.org/10.6084/m9.figshare.14654886.v2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8796369
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87963692022-01-31 Limits and potential of combined folding and docking Pozzati, Gabriele Zhu, Wensi Bassot, Claudio Lamb, John Kundrotas, Petras Elofsson, Arne Bioinformatics Original Papers MOTIVATION: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, also be used to extract information about evolutionary-based contacts across protein–protein interfaces. However, most earlier studies have not used the latest DL methods for inter-chain contact distance prediction. This article introduces a fold-and-dock method based on predicted residue-residue distances with trRosetta. RESULTS: The method can simultaneously predict the tertiary and quaternary structure of a protein pair, even when the structures of the monomers are not known. The straightforward application of this method to a standard dataset for protein–protein docking yielded limited success. However, using alternative methods for generating MSAs allowed us to dock accurately significantly more proteins. We also introduced a novel scoring function, PconsDock, that accurately separates 98% of correctly and incorrectly folded and docked proteins. The average performance of the method is comparable to the use of traditional, template-based or ab initio shape-complementarity-only docking methods. Moreover, the results of conventional and fold-and-dock approaches are complementary, and thus a combined docking pipeline could increase overall docking success significantly. This methodology contributed to the best model for one of the CASP14 oligomeric targets, H1065. AVAILABILITY AND IMPLEMENTATION: All scripts for predictions and analysis are available from https://github.com/ElofssonLab/bioinfo-toolbox/ and https://gitlab.com/ElofssonLab/benchmark5/. All models joined alignments, and evaluation results are available from the following figshare repository https://doi.org/10.6084/m9.figshare.14654886.v2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-11-12 /pmc/articles/PMC8796369/ /pubmed/34788800 http://dx.doi.org/10.1093/bioinformatics/btab760 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Pozzati, Gabriele
Zhu, Wensi
Bassot, Claudio
Lamb, John
Kundrotas, Petras
Elofsson, Arne
Limits and potential of combined folding and docking
title Limits and potential of combined folding and docking
title_full Limits and potential of combined folding and docking
title_fullStr Limits and potential of combined folding and docking
title_full_unstemmed Limits and potential of combined folding and docking
title_short Limits and potential of combined folding and docking
title_sort limits and potential of combined folding and docking
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796369/
https://www.ncbi.nlm.nih.gov/pubmed/34788800
http://dx.doi.org/10.1093/bioinformatics/btab760
work_keys_str_mv AT pozzatigabriele limitsandpotentialofcombinedfoldinganddocking
AT zhuwensi limitsandpotentialofcombinedfoldinganddocking
AT bassotclaudio limitsandpotentialofcombinedfoldinganddocking
AT lambjohn limitsandpotentialofcombinedfoldinganddocking
AT kundrotaspetras limitsandpotentialofcombinedfoldinganddocking
AT elofssonarne limitsandpotentialofcombinedfoldinganddocking