Cargando…

fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization

BACKGROUND: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipeline...

Descripción completa

Detalles Bibliográficos
Autores principales: Meng, Fanchi, Wang, Chen, Kurgan, Lukasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6389161/
https://www.ncbi.nlm.nih.gov/pubmed/29295714
http://dx.doi.org/10.1186/s12859-017-1995-z
_version_ 1783397901979877376
author Meng, Fanchi
Wang, Chen
Kurgan, Lukasz
author_facet Meng, Fanchi
Wang, Chen
Kurgan, Lukasz
author_sort Meng, Fanchi
collection PubMed
description BACKGROUND: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. RESULTS: We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/fDETECT/. CONCLUSIONS: fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-of-the-art tools and is especially suitable for the analysis of large protein sets.
format Online
Article
Text
id pubmed-6389161
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63891612019-03-19 fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization Meng, Fanchi Wang, Chen Kurgan, Lukasz BMC Bioinformatics Methodology Article BACKGROUND: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. RESULTS: We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/fDETECT/. CONCLUSIONS: fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-of-the-art tools and is especially suitable for the analysis of large protein sets. BioMed Central 2018-01-03 /pmc/articles/PMC6389161/ /pubmed/29295714 http://dx.doi.org/10.1186/s12859-017-1995-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Meng, Fanchi
Wang, Chen
Kurgan, Lukasz
fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
title fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
title_full fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
title_fullStr fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
title_full_unstemmed fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
title_short fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
title_sort fdetect webserver: fast predictor of propensity for protein production, purification, and crystallization
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6389161/
https://www.ncbi.nlm.nih.gov/pubmed/29295714
http://dx.doi.org/10.1186/s12859-017-1995-z
work_keys_str_mv AT mengfanchi fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization
AT wangchen fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization
AT kurganlukasz fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization