Cargando…

Large-scale structure prediction by improved contact predictions and model quality assessment

MOTIVATION: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it po...

Descripción completa

Detalles Bibliográficos
Autores principales: Michel, Mirco, Menéndez Hurtado, David, Uziela, Karolis, Elofsson, Arne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870574/
https://www.ncbi.nlm.nih.gov/pubmed/28881974
http://dx.doi.org/10.1093/bioinformatics/btx239
_version_ 1783309512517615616
author Michel, Mirco
Menéndez Hurtado, David
Uziela, Karolis
Elofsson, Arne
author_facet Michel, Mirco
Menéndez Hurtado, David
Uziela, Karolis
Elofsson, Arne
author_sort Michel, Mirco
collection PubMed
description MOTIVATION: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. RESULTS: We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these, 415 have not been reported before. AVAILABILITY AND IMPLEMENTATION: Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net/. All programs used here are freely available.
format Online
Article
Text
id pubmed-5870574
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58705742018-04-05 Large-scale structure prediction by improved contact predictions and model quality assessment Michel, Mirco Menéndez Hurtado, David Uziela, Karolis Elofsson, Arne Bioinformatics Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017 MOTIVATION: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. RESULTS: We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these, 415 have not been reported before. AVAILABILITY AND IMPLEMENTATION: Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net/. All programs used here are freely available. Oxford University Press 2017-07-15 2017-07-12 /pmc/articles/PMC5870574/ /pubmed/28881974 http://dx.doi.org/10.1093/bioinformatics/btx239 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017
Michel, Mirco
Menéndez Hurtado, David
Uziela, Karolis
Elofsson, Arne
Large-scale structure prediction by improved contact predictions and model quality assessment
title Large-scale structure prediction by improved contact predictions and model quality assessment
title_full Large-scale structure prediction by improved contact predictions and model quality assessment
title_fullStr Large-scale structure prediction by improved contact predictions and model quality assessment
title_full_unstemmed Large-scale structure prediction by improved contact predictions and model quality assessment
title_short Large-scale structure prediction by improved contact predictions and model quality assessment
title_sort large-scale structure prediction by improved contact predictions and model quality assessment
topic Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870574/
https://www.ncbi.nlm.nih.gov/pubmed/28881974
http://dx.doi.org/10.1093/bioinformatics/btx239
work_keys_str_mv AT michelmirco largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment
AT menendezhurtadodavid largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment
AT uzielakarolis largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment
AT elofssonarne largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment