Cargando…

Structure-based protein function prediction using graph convolutional networks

The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features ext...

Descripción completa

Detalles Bibliográficos
Autores principales: Gligorijević, Vladimir, Renfrew, P. Douglas, Kosciolek, Tomasz, Leman, Julia Koehler, Berenberg, Daniel, Vatanen, Tommi, Chandler, Chris, Taylor, Bryn C., Fisk, Ian M., Vlamakis, Hera, Xavier, Ramnik J., Knight, Rob, Cho, Kyunghyun, Bonneau, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8155034/
https://www.ncbi.nlm.nih.gov/pubmed/34039967
http://dx.doi.org/10.1038/s41467-021-23303-9
_version_ 1783699123803783168
author Gligorijević, Vladimir
Renfrew, P. Douglas
Kosciolek, Tomasz
Leman, Julia Koehler
Berenberg, Daniel
Vatanen, Tommi
Chandler, Chris
Taylor, Bryn C.
Fisk, Ian M.
Vlamakis, Hera
Xavier, Ramnik J.
Knight, Rob
Cho, Kyunghyun
Bonneau, Richard
author_facet Gligorijević, Vladimir
Renfrew, P. Douglas
Kosciolek, Tomasz
Leman, Julia Koehler
Berenberg, Daniel
Vatanen, Tommi
Chandler, Chris
Taylor, Bryn C.
Fisk, Ian M.
Vlamakis, Hera
Xavier, Ramnik J.
Knight, Rob
Cho, Kyunghyun
Bonneau, Richard
author_sort Gligorijević, Vladimir
collection PubMed
description The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/.
format Online
Article
Text
id pubmed-8155034
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81550342021-06-11 Structure-based protein function prediction using graph convolutional networks Gligorijević, Vladimir Renfrew, P. Douglas Kosciolek, Tomasz Leman, Julia Koehler Berenberg, Daniel Vatanen, Tommi Chandler, Chris Taylor, Bryn C. Fisk, Ian M. Vlamakis, Hera Xavier, Ramnik J. Knight, Rob Cho, Kyunghyun Bonneau, Richard Nat Commun Article The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/. Nature Publishing Group UK 2021-05-26 /pmc/articles/PMC8155034/ /pubmed/34039967 http://dx.doi.org/10.1038/s41467-021-23303-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Gligorijević, Vladimir
Renfrew, P. Douglas
Kosciolek, Tomasz
Leman, Julia Koehler
Berenberg, Daniel
Vatanen, Tommi
Chandler, Chris
Taylor, Bryn C.
Fisk, Ian M.
Vlamakis, Hera
Xavier, Ramnik J.
Knight, Rob
Cho, Kyunghyun
Bonneau, Richard
Structure-based protein function prediction using graph convolutional networks
title Structure-based protein function prediction using graph convolutional networks
title_full Structure-based protein function prediction using graph convolutional networks
title_fullStr Structure-based protein function prediction using graph convolutional networks
title_full_unstemmed Structure-based protein function prediction using graph convolutional networks
title_short Structure-based protein function prediction using graph convolutional networks
title_sort structure-based protein function prediction using graph convolutional networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8155034/
https://www.ncbi.nlm.nih.gov/pubmed/34039967
http://dx.doi.org/10.1038/s41467-021-23303-9
work_keys_str_mv AT gligorijevicvladimir structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT renfrewpdouglas structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT kosciolektomasz structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT lemanjuliakoehler structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT berenbergdaniel structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT vatanentommi structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT chandlerchris structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT taylorbrync structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT fiskianm structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT vlamakishera structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT xavierramnikj structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT knightrob structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT chokyunghyun structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks
AT bonneaurichard structurebasedproteinfunctionpredictionusinggraphconvolutionalnetworks