Cargando…
NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning
Recent advances in machine learning and natural language processing have made it possible to profoundly advance our ability to accurately predict protein structures and their functions. While such improvements are significantly impacting the fields of biology and biotechnology at large, such methods...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252760/ https://www.ncbi.nlm.nih.gov/pubmed/35648435 http://dx.doi.org/10.1093/nar/gkac439 |
_version_ | 1784740341738373120 |
---|---|
author | Høie, Magnus Haraldson Kiehl, Erik Nicolas Petersen, Bent Nielsen, Morten Winther, Ole Nielsen, Henrik Hallgren, Jeppe Marcatili, Paolo |
author_facet | Høie, Magnus Haraldson Kiehl, Erik Nicolas Petersen, Bent Nielsen, Morten Winther, Ole Nielsen, Henrik Hallgren, Jeppe Marcatili, Paolo |
author_sort | Høie, Magnus Haraldson |
collection | PubMed |
description | Recent advances in machine learning and natural language processing have made it possible to profoundly advance our ability to accurately predict protein structures and their functions. While such improvements are significantly impacting the fields of biology and biotechnology at large, such methods have the downside of high demands in terms of computing power and runtime, hampering their applicability to large datasets. Here, we present NetSurfP-3.0, a tool for predicting solvent accessibility, secondary structure, structural disorder and backbone dihedral angles for each residue of an amino acid sequence. This NetSurfP update exploits recent advances in pre-trained protein language models to drastically improve the runtime of its predecessor by two orders of magnitude, while displaying similar prediction performance. We assessed the accuracy of NetSurfP-3.0 on several independent test datasets and found it to consistently produce state-of-the-art predictions for each of its output features, with a runtime that is up to to 600 times faster than the most commonly available methods performing the same tasks. The tool is freely available as a web server with a user-friendly interface to navigate the results, as well as a standalone downloadable package. |
format | Online Article Text |
id | pubmed-9252760 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92527602022-07-05 NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning Høie, Magnus Haraldson Kiehl, Erik Nicolas Petersen, Bent Nielsen, Morten Winther, Ole Nielsen, Henrik Hallgren, Jeppe Marcatili, Paolo Nucleic Acids Res Web Server Issue Recent advances in machine learning and natural language processing have made it possible to profoundly advance our ability to accurately predict protein structures and their functions. While such improvements are significantly impacting the fields of biology and biotechnology at large, such methods have the downside of high demands in terms of computing power and runtime, hampering their applicability to large datasets. Here, we present NetSurfP-3.0, a tool for predicting solvent accessibility, secondary structure, structural disorder and backbone dihedral angles for each residue of an amino acid sequence. This NetSurfP update exploits recent advances in pre-trained protein language models to drastically improve the runtime of its predecessor by two orders of magnitude, while displaying similar prediction performance. We assessed the accuracy of NetSurfP-3.0 on several independent test datasets and found it to consistently produce state-of-the-art predictions for each of its output features, with a runtime that is up to to 600 times faster than the most commonly available methods performing the same tasks. The tool is freely available as a web server with a user-friendly interface to navigate the results, as well as a standalone downloadable package. Oxford University Press 2022-06-01 /pmc/articles/PMC9252760/ /pubmed/35648435 http://dx.doi.org/10.1093/nar/gkac439 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Web Server Issue Høie, Magnus Haraldson Kiehl, Erik Nicolas Petersen, Bent Nielsen, Morten Winther, Ole Nielsen, Henrik Hallgren, Jeppe Marcatili, Paolo NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
title | NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
title_full | NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
title_fullStr | NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
title_full_unstemmed | NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
title_short | NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
title_sort | netsurfp-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning |
topic | Web Server Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252760/ https://www.ncbi.nlm.nih.gov/pubmed/35648435 http://dx.doi.org/10.1093/nar/gkac439 |
work_keys_str_mv | AT høiemagnusharaldson netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT kiehleriknicolas netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT petersenbent netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT nielsenmorten netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT wintherole netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT nielsenhenrik netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT hallgrenjeppe netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning AT marcatilipaolo netsurfp30accurateandfastpredictionofproteinstructuralfeaturesbyproteinlanguagemodelsanddeeplearning |