Cargando…

Improving Protein Fold Recognition by Deep Learning Networks

For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evalua...

Descripción completa

Detalles Bibliográficos
Autores principales: Jo, Taeho, Hou, Jie, Eickholt, Jesse, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4669437/
https://www.ncbi.nlm.nih.gov/pubmed/26634993
http://dx.doi.org/10.1038/srep17573
_version_ 1782404101966921728
author Jo, Taeho
Hou, Jie
Eickholt, Jesse
Cheng, Jianlin
author_facet Jo, Taeho
Hou, Jie
Eickholt, Jesse
Cheng, Jianlin
author_sort Jo, Taeho
collection PubMed
description For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl’s benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.
format Online
Article
Text
id pubmed-4669437
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-46694372015-12-09 Improving Protein Fold Recognition by Deep Learning Networks Jo, Taeho Hou, Jie Eickholt, Jesse Cheng, Jianlin Sci Rep Article For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl’s benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold. Nature Publishing Group 2015-12-04 /pmc/articles/PMC4669437/ /pubmed/26634993 http://dx.doi.org/10.1038/srep17573 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Jo, Taeho
Hou, Jie
Eickholt, Jesse
Cheng, Jianlin
Improving Protein Fold Recognition by Deep Learning Networks
title Improving Protein Fold Recognition by Deep Learning Networks
title_full Improving Protein Fold Recognition by Deep Learning Networks
title_fullStr Improving Protein Fold Recognition by Deep Learning Networks
title_full_unstemmed Improving Protein Fold Recognition by Deep Learning Networks
title_short Improving Protein Fold Recognition by Deep Learning Networks
title_sort improving protein fold recognition by deep learning networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4669437/
https://www.ncbi.nlm.nih.gov/pubmed/26634993
http://dx.doi.org/10.1038/srep17573
work_keys_str_mv AT jotaeho improvingproteinfoldrecognitionbydeeplearningnetworks
AT houjie improvingproteinfoldrecognitionbydeeplearningnetworks
AT eickholtjesse improvingproteinfoldrecognitionbydeeplearningnetworks
AT chengjianlin improvingproteinfoldrecognitionbydeeplearningnetworks