Cargando…

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction

MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotransl...

Descripción completa

Detalles Bibliográficos
Autores principales: de Oliveira, Saulo H P, Law, Eleanor C, Shi, Jiye, Deane, Charlotte M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030820/
https://www.ncbi.nlm.nih.gov/pubmed/29136098
http://dx.doi.org/10.1093/bioinformatics/btx722
_version_ 1783337201322426368
author de Oliveira, Saulo H P
Law, Eleanor C
Shi, Jiye
Deane, Charlotte M
author_facet de Oliveira, Saulo H P
Law, Eleanor C
Shi, Jiye
Deane, Charlotte M
author_sort de Oliveira, Saulo H P
collection PubMed
description MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. RESULTS: We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5–2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. AVAILABILITY AND IMPLEMENTATION: Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6030820
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60308202018-07-10 Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M Bioinformatics Original Papers MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. RESULTS: We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5–2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. AVAILABILITY AND IMPLEMENTATION: Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-04-01 2017-11-09 /pmc/articles/PMC6030820/ /pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
de Oliveira, Saulo H P
Law, Eleanor C
Shi, Jiye
Deane, Charlotte M
Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_full Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_fullStr Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_full_unstemmed Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_short Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_sort sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030820/
https://www.ncbi.nlm.nih.gov/pubmed/29136098
http://dx.doi.org/10.1093/bioinformatics/btx722
work_keys_str_mv AT deoliveirasaulohp sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction
AT laweleanorc sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction
AT shijiye sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction
AT deanecharlottem sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction