Cargando…
Large-scale structure prediction by improved contact predictions and model quality assessment
MOTIVATION: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it po...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870574/ https://www.ncbi.nlm.nih.gov/pubmed/28881974 http://dx.doi.org/10.1093/bioinformatics/btx239 |
_version_ | 1783309512517615616 |
---|---|
author | Michel, Mirco Menéndez Hurtado, David Uziela, Karolis Elofsson, Arne |
author_facet | Michel, Mirco Menéndez Hurtado, David Uziela, Karolis Elofsson, Arne |
author_sort | Michel, Mirco |
collection | PubMed |
description | MOTIVATION: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. RESULTS: We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these, 415 have not been reported before. AVAILABILITY AND IMPLEMENTATION: Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net/. All programs used here are freely available. |
format | Online Article Text |
id | pubmed-5870574 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58705742018-04-05 Large-scale structure prediction by improved contact predictions and model quality assessment Michel, Mirco Menéndez Hurtado, David Uziela, Karolis Elofsson, Arne Bioinformatics Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017 MOTIVATION: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. RESULTS: We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these, 415 have not been reported before. AVAILABILITY AND IMPLEMENTATION: Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net/. All programs used here are freely available. Oxford University Press 2017-07-15 2017-07-12 /pmc/articles/PMC5870574/ /pubmed/28881974 http://dx.doi.org/10.1093/bioinformatics/btx239 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017 Michel, Mirco Menéndez Hurtado, David Uziela, Karolis Elofsson, Arne Large-scale structure prediction by improved contact predictions and model quality assessment |
title | Large-scale structure prediction by improved contact predictions and model quality assessment |
title_full | Large-scale structure prediction by improved contact predictions and model quality assessment |
title_fullStr | Large-scale structure prediction by improved contact predictions and model quality assessment |
title_full_unstemmed | Large-scale structure prediction by improved contact predictions and model quality assessment |
title_short | Large-scale structure prediction by improved contact predictions and model quality assessment |
title_sort | large-scale structure prediction by improved contact predictions and model quality assessment |
topic | Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870574/ https://www.ncbi.nlm.nih.gov/pubmed/28881974 http://dx.doi.org/10.1093/bioinformatics/btx239 |
work_keys_str_mv | AT michelmirco largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment AT menendezhurtadodavid largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment AT uzielakarolis largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment AT elofssonarne largescalestructurepredictionbyimprovedcontactpredictionsandmodelqualityassessment |