Cargando…
Limits and potential of combined folding and docking
MOTIVATION: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, al...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796369/ https://www.ncbi.nlm.nih.gov/pubmed/34788800 http://dx.doi.org/10.1093/bioinformatics/btab760 |
_version_ | 1784641292332957696 |
---|---|
author | Pozzati, Gabriele Zhu, Wensi Bassot, Claudio Lamb, John Kundrotas, Petras Elofsson, Arne |
author_facet | Pozzati, Gabriele Zhu, Wensi Bassot, Claudio Lamb, John Kundrotas, Petras Elofsson, Arne |
author_sort | Pozzati, Gabriele |
collection | PubMed |
description | MOTIVATION: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, also be used to extract information about evolutionary-based contacts across protein–protein interfaces. However, most earlier studies have not used the latest DL methods for inter-chain contact distance prediction. This article introduces a fold-and-dock method based on predicted residue-residue distances with trRosetta. RESULTS: The method can simultaneously predict the tertiary and quaternary structure of a protein pair, even when the structures of the monomers are not known. The straightforward application of this method to a standard dataset for protein–protein docking yielded limited success. However, using alternative methods for generating MSAs allowed us to dock accurately significantly more proteins. We also introduced a novel scoring function, PconsDock, that accurately separates 98% of correctly and incorrectly folded and docked proteins. The average performance of the method is comparable to the use of traditional, template-based or ab initio shape-complementarity-only docking methods. Moreover, the results of conventional and fold-and-dock approaches are complementary, and thus a combined docking pipeline could increase overall docking success significantly. This methodology contributed to the best model for one of the CASP14 oligomeric targets, H1065. AVAILABILITY AND IMPLEMENTATION: All scripts for predictions and analysis are available from https://github.com/ElofssonLab/bioinfo-toolbox/ and https://gitlab.com/ElofssonLab/benchmark5/. All models joined alignments, and evaluation results are available from the following figshare repository https://doi.org/10.6084/m9.figshare.14654886.v2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8796369 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87963692022-01-31 Limits and potential of combined folding and docking Pozzati, Gabriele Zhu, Wensi Bassot, Claudio Lamb, John Kundrotas, Petras Elofsson, Arne Bioinformatics Original Papers MOTIVATION: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, also be used to extract information about evolutionary-based contacts across protein–protein interfaces. However, most earlier studies have not used the latest DL methods for inter-chain contact distance prediction. This article introduces a fold-and-dock method based on predicted residue-residue distances with trRosetta. RESULTS: The method can simultaneously predict the tertiary and quaternary structure of a protein pair, even when the structures of the monomers are not known. The straightforward application of this method to a standard dataset for protein–protein docking yielded limited success. However, using alternative methods for generating MSAs allowed us to dock accurately significantly more proteins. We also introduced a novel scoring function, PconsDock, that accurately separates 98% of correctly and incorrectly folded and docked proteins. The average performance of the method is comparable to the use of traditional, template-based or ab initio shape-complementarity-only docking methods. Moreover, the results of conventional and fold-and-dock approaches are complementary, and thus a combined docking pipeline could increase overall docking success significantly. This methodology contributed to the best model for one of the CASP14 oligomeric targets, H1065. AVAILABILITY AND IMPLEMENTATION: All scripts for predictions and analysis are available from https://github.com/ElofssonLab/bioinfo-toolbox/ and https://gitlab.com/ElofssonLab/benchmark5/. All models joined alignments, and evaluation results are available from the following figshare repository https://doi.org/10.6084/m9.figshare.14654886.v2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-11-12 /pmc/articles/PMC8796369/ /pubmed/34788800 http://dx.doi.org/10.1093/bioinformatics/btab760 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Pozzati, Gabriele Zhu, Wensi Bassot, Claudio Lamb, John Kundrotas, Petras Elofsson, Arne Limits and potential of combined folding and docking |
title | Limits and potential of combined folding and docking |
title_full | Limits and potential of combined folding and docking |
title_fullStr | Limits and potential of combined folding and docking |
title_full_unstemmed | Limits and potential of combined folding and docking |
title_short | Limits and potential of combined folding and docking |
title_sort | limits and potential of combined folding and docking |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796369/ https://www.ncbi.nlm.nih.gov/pubmed/34788800 http://dx.doi.org/10.1093/bioinformatics/btab760 |
work_keys_str_mv | AT pozzatigabriele limitsandpotentialofcombinedfoldinganddocking AT zhuwensi limitsandpotentialofcombinedfoldinganddocking AT bassotclaudio limitsandpotentialofcombinedfoldinganddocking AT lambjohn limitsandpotentialofcombinedfoldinganddocking AT kundrotaspetras limitsandpotentialofcombinedfoldinganddocking AT elofssonarne limitsandpotentialofcombinedfoldinganddocking |