Cargando…
FALCON2: a web server for high-quality prediction of protein tertiary structures
BACKGROUND: Accurate prediction of protein tertiary structures is highly desired as the knowledge of protein structures provides invaluable insights into protein functions. We have designed two approaches to protein structure prediction, including a template-based modeling approach (called ProALIGN)...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8444573/ https://www.ncbi.nlm.nih.gov/pubmed/34525939 http://dx.doi.org/10.1186/s12859-021-04353-8 |
_version_ | 1784568524187893760 |
---|---|
author | Kong, Lupeng Ju, Fusong Zhang, Haicang Sun, Shiwei Bu, Dongbo |
author_facet | Kong, Lupeng Ju, Fusong Zhang, Haicang Sun, Shiwei Bu, Dongbo |
author_sort | Kong, Lupeng |
collection | PubMed |
description | BACKGROUND: Accurate prediction of protein tertiary structures is highly desired as the knowledge of protein structures provides invaluable insights into protein functions. We have designed two approaches to protein structure prediction, including a template-based modeling approach (called ProALIGN) and an ab initio prediction approach (called ProFOLD). Briefly speaking, ProALIGN aligns a target protein with templates through exploiting the patterns of context-specific alignment motifs and then builds the final structure with reference to the homologous templates. In contrast, ProFOLD uses an end-to-end neural network to estimate inter-residue distances of target proteins and builds structures that satisfy these distance constraints. These two approaches emphasize different characteristics of target proteins: ProALIGN exploits structure information of homologous templates of target proteins while ProFOLD exploits the co-evolutionary information carried by homologous protein sequences. Recent progress has shown that the combination of template-based modeling and ab initio approaches is promising. RESULTS: In the study, we present FALCON2, a web server that integrates ProALIGN and ProFOLD to provide high-quality protein structure prediction service. For a target protein, FALCON2 executes ProALIGN and ProFOLD simultaneously to predict possible structures and selects the most likely one as the final prediction result. We evaluated FALCON2 on widely-used benchmarks, including 104 CASP13 (the 13th Critical Assessment of protein Structure Prediction) targets and 91 CASP14 targets. In-depth examination suggests that when high-quality templates are available, ProALIGN is superior to ProFOLD and in other cases, ProFOLD shows better performance. By integrating these two approaches with different emphasis, FALCON2 server outperforms the two individual approaches and also achieves state-of-the-art performance compared with existing approaches. CONCLUSIONS: By integrating template-based modeling and ab initio approaches, FALCON2 provides an easy-to-use and high-quality protein structure prediction service for the community and we expect it to enable insights into a deep understanding of protein functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04353-8. |
format | Online Article Text |
id | pubmed-8444573 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-84445732021-09-17 FALCON2: a web server for high-quality prediction of protein tertiary structures Kong, Lupeng Ju, Fusong Zhang, Haicang Sun, Shiwei Bu, Dongbo BMC Bioinformatics Software BACKGROUND: Accurate prediction of protein tertiary structures is highly desired as the knowledge of protein structures provides invaluable insights into protein functions. We have designed two approaches to protein structure prediction, including a template-based modeling approach (called ProALIGN) and an ab initio prediction approach (called ProFOLD). Briefly speaking, ProALIGN aligns a target protein with templates through exploiting the patterns of context-specific alignment motifs and then builds the final structure with reference to the homologous templates. In contrast, ProFOLD uses an end-to-end neural network to estimate inter-residue distances of target proteins and builds structures that satisfy these distance constraints. These two approaches emphasize different characteristics of target proteins: ProALIGN exploits structure information of homologous templates of target proteins while ProFOLD exploits the co-evolutionary information carried by homologous protein sequences. Recent progress has shown that the combination of template-based modeling and ab initio approaches is promising. RESULTS: In the study, we present FALCON2, a web server that integrates ProALIGN and ProFOLD to provide high-quality protein structure prediction service. For a target protein, FALCON2 executes ProALIGN and ProFOLD simultaneously to predict possible structures and selects the most likely one as the final prediction result. We evaluated FALCON2 on widely-used benchmarks, including 104 CASP13 (the 13th Critical Assessment of protein Structure Prediction) targets and 91 CASP14 targets. In-depth examination suggests that when high-quality templates are available, ProALIGN is superior to ProFOLD and in other cases, ProFOLD shows better performance. By integrating these two approaches with different emphasis, FALCON2 server outperforms the two individual approaches and also achieves state-of-the-art performance compared with existing approaches. CONCLUSIONS: By integrating template-based modeling and ab initio approaches, FALCON2 provides an easy-to-use and high-quality protein structure prediction service for the community and we expect it to enable insights into a deep understanding of protein functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04353-8. BioMed Central 2021-09-15 /pmc/articles/PMC8444573/ /pubmed/34525939 http://dx.doi.org/10.1186/s12859-021-04353-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Kong, Lupeng Ju, Fusong Zhang, Haicang Sun, Shiwei Bu, Dongbo FALCON2: a web server for high-quality prediction of protein tertiary structures |
title | FALCON2: a web server for high-quality prediction of protein tertiary structures |
title_full | FALCON2: a web server for high-quality prediction of protein tertiary structures |
title_fullStr | FALCON2: a web server for high-quality prediction of protein tertiary structures |
title_full_unstemmed | FALCON2: a web server for high-quality prediction of protein tertiary structures |
title_short | FALCON2: a web server for high-quality prediction of protein tertiary structures |
title_sort | falcon2: a web server for high-quality prediction of protein tertiary structures |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8444573/ https://www.ncbi.nlm.nih.gov/pubmed/34525939 http://dx.doi.org/10.1186/s12859-021-04353-8 |
work_keys_str_mv | AT konglupeng falcon2awebserverforhighqualitypredictionofproteintertiarystructures AT jufusong falcon2awebserverforhighqualitypredictionofproteintertiarystructures AT zhanghaicang falcon2awebserverforhighqualitypredictionofproteintertiarystructures AT sunshiwei falcon2awebserverforhighqualitypredictionofproteintertiarystructures AT budongbo falcon2awebserverforhighqualitypredictionofproteintertiarystructures |