Cargando…
Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotransl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030820/ https://www.ncbi.nlm.nih.gov/pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722 |
_version_ | 1783337201322426368 |
---|---|
author | de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M |
author_facet | de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M |
author_sort | de Oliveira, Saulo H P |
collection | PubMed |
description | MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. RESULTS: We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5–2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. AVAILABILITY AND IMPLEMENTATION: Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6030820 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60308202018-07-10 Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M Bioinformatics Original Papers MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. RESULTS: We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5–2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. AVAILABILITY AND IMPLEMENTATION: Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-04-01 2017-11-09 /pmc/articles/PMC6030820/ /pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
title | Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
title_full | Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
title_fullStr | Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
title_full_unstemmed | Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
title_short | Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
title_sort | sequential search leads to faster, more efficient fragment-based de novo protein structure prediction |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030820/ https://www.ncbi.nlm.nih.gov/pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722 |
work_keys_str_mv | AT deoliveirasaulohp sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction AT laweleanorc sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction AT shijiye sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction AT deanecharlottem sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction |