Cargando…

Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13

Predicting residue‐residue distance relationships (eg, contacts) has become the key direction to advance protein structure prediction since 2014 CASP11 experiment, while deep learning has revolutionized the technology for contact and distance distribution prediction since its debut in 2012 CASP10 ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Hou, Jie, Wu, Tianqi, Cao, Renzhi, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6800999/
https://www.ncbi.nlm.nih.gov/pubmed/30985027
http://dx.doi.org/10.1002/prot.25697
_version_ 1783460515180183552
author Hou, Jie
Wu, Tianqi
Cao, Renzhi
Cheng, Jianlin
author_facet Hou, Jie
Wu, Tianqi
Cao, Renzhi
Cheng, Jianlin
author_sort Hou, Jie
collection PubMed
description Predicting residue‐residue distance relationships (eg, contacts) has become the key direction to advance protein structure prediction since 2014 CASP11 experiment, while deep learning has revolutionized the technology for contact and distance distribution prediction since its debut in 2012 CASP10 experiment. During 2018 CASP13 experiment, we enhanced our MULTICOM protein structure prediction system with three major components: contact distance prediction based on deep convolutional neural networks, distance‐driven template‐free (ab initio) modeling, and protein model ranking empowered by deep learning and contact prediction. Our experiment demonstrates that contact distance prediction and deep learning methods are the key reasons that MULTICOM was ranked 3rd out of all 98 predictors in both template‐free and template‐based structure modeling in CASP13. Deep convolutional neural network can utilize global information in pairwise residue‐residue features such as coevolution scores to substantially improve contact distance prediction, which played a decisive role in correctly folding some free modeling and hard template‐based modeling targets. Deep learning also successfully integrated one‐dimensional structural features, two‐dimensional contact information, and three‐dimensional structural quality scores to improve protein model quality assessment, where the contact prediction was demonstrated to consistently enhance ranking of protein models for the first time. The success of MULTICOM system clearly shows that protein contact distance prediction and model selection driven by deep learning holds the key of solving protein structure prediction problem. However, there are still challenges in accurately predicting protein contact distance when there are few homologous sequences, folding proteins from noisy contact distances, and ranking models of hard targets.
format Online
Article
Text
id pubmed-6800999
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-68009992019-12-19 Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13 Hou, Jie Wu, Tianqi Cao, Renzhi Cheng, Jianlin Proteins 3d Structure Modeling Predicting residue‐residue distance relationships (eg, contacts) has become the key direction to advance protein structure prediction since 2014 CASP11 experiment, while deep learning has revolutionized the technology for contact and distance distribution prediction since its debut in 2012 CASP10 experiment. During 2018 CASP13 experiment, we enhanced our MULTICOM protein structure prediction system with three major components: contact distance prediction based on deep convolutional neural networks, distance‐driven template‐free (ab initio) modeling, and protein model ranking empowered by deep learning and contact prediction. Our experiment demonstrates that contact distance prediction and deep learning methods are the key reasons that MULTICOM was ranked 3rd out of all 98 predictors in both template‐free and template‐based structure modeling in CASP13. Deep convolutional neural network can utilize global information in pairwise residue‐residue features such as coevolution scores to substantially improve contact distance prediction, which played a decisive role in correctly folding some free modeling and hard template‐based modeling targets. Deep learning also successfully integrated one‐dimensional structural features, two‐dimensional contact information, and three‐dimensional structural quality scores to improve protein model quality assessment, where the contact prediction was demonstrated to consistently enhance ranking of protein models for the first time. The success of MULTICOM system clearly shows that protein contact distance prediction and model selection driven by deep learning holds the key of solving protein structure prediction problem. However, there are still challenges in accurately predicting protein contact distance when there are few homologous sequences, folding proteins from noisy contact distances, and ranking models of hard targets. John Wiley & Sons, Inc. 2019-04-25 2019-12 /pmc/articles/PMC6800999/ /pubmed/30985027 http://dx.doi.org/10.1002/prot.25697 Text en © 2019 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle 3d Structure Modeling
Hou, Jie
Wu, Tianqi
Cao, Renzhi
Cheng, Jianlin
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
title Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
title_full Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
title_fullStr Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
title_full_unstemmed Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
title_short Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
title_sort protein tertiary structure modeling driven by deep learning and contact distance prediction in casp13
topic 3d Structure Modeling
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6800999/
https://www.ncbi.nlm.nih.gov/pubmed/30985027
http://dx.doi.org/10.1002/prot.25697
work_keys_str_mv AT houjie proteintertiarystructuremodelingdrivenbydeeplearningandcontactdistancepredictionincasp13
AT wutianqi proteintertiarystructuremodelingdrivenbydeeplearningandcontactdistancepredictionincasp13
AT caorenzhi proteintertiarystructuremodelingdrivenbydeeplearningandcontactdistancepredictionincasp13
AT chengjianlin proteintertiarystructuremodelingdrivenbydeeplearningandcontactdistancepredictionincasp13