Cargando…

Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14

The trRosetta structure prediction method employs deep learning to generate predicted residue‐residue distance and orientation distributions from which 3D models are built. We sought to improve the method by incorporating as inputs (in addition to sequence information) both language model embeddings...

Descripción completa

Detalles Bibliográficos
Autores principales:	Anishchenko, Ivan, Baek, Minkyung, Park, Hahnbeom, Hiranuma, Naozumi, Kim, David E., Dauparas, Justas, Mansoor, Sanaa, Humphreys, Ian R., Baker, David
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley & Sons, Inc. 2021
Materias:	Research Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8616808/ https://www.ncbi.nlm.nih.gov/pubmed/34331359 http://dx.doi.org/10.1002/prot.26194

_version_	1784604418575958016
author	Anishchenko, Ivan Baek, Minkyung Park, Hahnbeom Hiranuma, Naozumi Kim, David E. Dauparas, Justas Mansoor, Sanaa Humphreys, Ian R. Baker, David
author_facet	Anishchenko, Ivan Baek, Minkyung Park, Hahnbeom Hiranuma, Naozumi Kim, David E. Dauparas, Justas Mansoor, Sanaa Humphreys, Ian R. Baker, David
author_sort	Anishchenko, Ivan
collection	PubMed
description	The trRosetta structure prediction method employs deep learning to generate predicted residue‐residue distance and orientation distributions from which 3D models are built. We sought to improve the method by incorporating as inputs (in addition to sequence information) both language model embeddings and template information weighted by sequence similarity to the target. We also developed a refinement pipeline that recombines models generated by template‐free and template utilizing versions of trRosetta guided by the DeepAccNet accuracy predictor. Both benchmark tests and CASP results show that the new pipeline is a considerable improvement over the original trRosetta, and it is faster and requires less computing resources, completing the entire modeling process in a median < 3 h in CASP14. Our human group improved results with this pipeline primarily by identifying additional homologous sequences for input into the network. We also used the DeepAccNet accuracy predictor to guide Rosetta high‐resolution refinement for submissions in the regular and refinement categories; although performance was quite good on a CASP relative scale, the overall improvements were rather modest in part due to missing inter‐domain or inter‐chain contacts.
format	Online Article Text
id	pubmed-8616808
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	John Wiley & Sons, Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-86168082022-10-14 Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14 Anishchenko, Ivan Baek, Minkyung Park, Hahnbeom Hiranuma, Naozumi Kim, David E. Dauparas, Justas Mansoor, Sanaa Humphreys, Ian R. Baker, David Proteins Research Articles The trRosetta structure prediction method employs deep learning to generate predicted residue‐residue distance and orientation distributions from which 3D models are built. We sought to improve the method by incorporating as inputs (in addition to sequence information) both language model embeddings and template information weighted by sequence similarity to the target. We also developed a refinement pipeline that recombines models generated by template‐free and template utilizing versions of trRosetta guided by the DeepAccNet accuracy predictor. Both benchmark tests and CASP results show that the new pipeline is a considerable improvement over the original trRosetta, and it is faster and requires less computing resources, completing the entire modeling process in a median < 3 h in CASP14. Our human group improved results with this pipeline primarily by identifying additional homologous sequences for input into the network. We also used the DeepAccNet accuracy predictor to guide Rosetta high‐resolution refinement for submissions in the regular and refinement categories; although performance was quite good on a CASP relative scale, the overall improvements were rather modest in part due to missing inter‐domain or inter‐chain contacts. John Wiley & Sons, Inc. 2021-08-17 2021-12 /pmc/articles/PMC8616808/ /pubmed/34331359 http://dx.doi.org/10.1002/prot.26194 Text en © 2021 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle	Research Articles Anishchenko, Ivan Baek, Minkyung Park, Hahnbeom Hiranuma, Naozumi Kim, David E. Dauparas, Justas Mansoor, Sanaa Humphreys, Ian R. Baker, David Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
title	Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
title_full	Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
title_fullStr	Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
title_full_unstemmed	Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
title_short	Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
title_sort	protein tertiary structure prediction and refinement using deep learning and rosetta in casp14
topic	Research Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8616808/ https://www.ncbi.nlm.nih.gov/pubmed/34331359 http://dx.doi.org/10.1002/prot.26194
work_keys_str_mv	AT anishchenkoivan proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT baekminkyung proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT parkhahnbeom proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT hiranumanaozumi proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT kimdavide proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT dauparasjustas proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT mansoorsanaa proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT humphreysianr proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14 AT bakerdavid proteintertiarystructurepredictionandrefinementusingdeeplearningandrosettaincasp14

Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14

Ejemplares similares