Cargando…
Structure-based protein function prediction using graph convolutional networks
The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features ext...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8155034/ https://www.ncbi.nlm.nih.gov/pubmed/34039967 http://dx.doi.org/10.1038/s41467-021-23303-9 |
_version_ | 1783699123803783168 |
---|---|
author | Gligorijević, Vladimir Renfrew, P. Douglas Kosciolek, Tomasz Leman, Julia Koehler Berenberg, Daniel Vatanen, Tommi Chandler, Chris Taylor, Bryn C. Fisk, Ian M. Vlamakis, Hera Xavier, Ramnik J. Knight, Rob Cho, Kyunghyun Bonneau, Richard |
author_facet | Gligorijević, Vladimir Renfrew, P. Douglas Kosciolek, Tomasz Leman, Julia Koehler Berenberg, Daniel Vatanen, Tommi Chandler, Chris Taylor, Bryn C. Fisk, Ian M. Vlamakis, Hera Xavier, Ramnik J. Knight, Rob Cho, Kyunghyun Bonneau, Richard |
author_sort | Gligorijević, Vladimir |
collection | PubMed |
description | The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/. |
format | Online Article Text |
id | pubmed-8155034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81550342021-06-11 Structure-based protein function prediction using graph convolutional networks Gligorijević, Vladimir Renfrew, P. Douglas Kosciolek, Tomasz Leman, Julia Koehler Berenberg, Daniel Vatanen, Tommi Chandler, Chris Taylor, Bryn C. Fisk, Ian M. Vlamakis, Hera Xavier, Ramnik J. Knight, Rob Cho, Kyunghyun Bonneau, Richard Nat Commun Article The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/. Nature Publishing Group UK 2021-05-26 /pmc/articles/PMC8155034/ /pubmed/34039967 http://dx.doi.org/10.1038/s41467-021-23303-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Gligorijević, Vladimir Renfrew, P. Douglas Kosciolek, Tomasz Leman, Julia Koehler Berenberg, Daniel Vatanen, Tommi Chandler, Chris Taylor, Bryn C. Fisk, Ian M. Vlamakis, Hera Xavier, Ramnik J. Knight, Rob Cho, Kyunghyun Bonneau, Richard Structure-based protein function prediction using graph convolutional networks |
title | Structure-based protein function prediction using graph convolutional networks |
title_full | Structure-based protein function prediction using graph convolutional networks |
title_fullStr | Structure-based protein function prediction using graph convolutional networks |
title_full_unstemmed | Structure-based protein function prediction using graph convolutional networks |
title_short | Structure-based protein function prediction using graph convolutional networks |
title_sort | structure-based protein function prediction using graph convolutional networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8155034/ https://www.ncbi.nlm.nih.gov/pubmed/34039967 http://dx.doi.org/10.1038/s41467-021-23303-9 |
work_keys_str_mv | AT gligorijevicvladimir structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT renfrewpdouglas structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT kosciolektomasz structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT lemanjuliakoehler structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT berenbergdaniel structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT vatanentommi structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT chandlerchris structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT taylorbrync structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT fiskianm structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT vlamakishera structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT xavierramnikj structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT knightrob structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT chokyunghyun structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks AT bonneaurichard structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks |