Cargando…

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction

MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotransl...

Descripción completa

Detalles Bibliográficos
Autores principales:	de Oliveira, Saulo H P, Law, Eleanor C, Shi, Jiye, Deane, Charlotte M
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030820/ https://www.ncbi.nlm.nih.gov/pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722

_version_	1783337201322426368
author	de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M
author_facet	de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M
author_sort	de Oliveira, Saulo H P
collection	PubMed
description	MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. RESULTS: We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5–2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. AVAILABILITY AND IMPLEMENTATION: Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-6030820
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-60308202018-07-10 Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M Bioinformatics Original Papers MOTIVATION: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. RESULTS: We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5–2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. AVAILABILITY AND IMPLEMENTATION: Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-04-01 2017-11-09 /pmc/articles/PMC6030820/ /pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Original Papers de Oliveira, Saulo H P Law, Eleanor C Shi, Jiye Deane, Charlotte M Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title	Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_full	Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_fullStr	Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_full_unstemmed	Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_short	Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
title_sort	sequential search leads to faster, more efficient fragment-based de novo protein structure prediction
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030820/ https://www.ncbi.nlm.nih.gov/pubmed/29136098 http://dx.doi.org/10.1093/bioinformatics/btx722
work_keys_str_mv	AT deoliveirasaulohp sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction AT laweleanorc sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction AT shijiye sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction AT deanecharlottem sequentialsearchleadstofastermoreefficientfragmentbaseddenovoproteinstructureprediction

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction

Ejemplares similares