Cargando…
fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
BACKGROUND: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipeline...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6389161/ https://www.ncbi.nlm.nih.gov/pubmed/29295714 http://dx.doi.org/10.1186/s12859-017-1995-z |
_version_ | 1783397901979877376 |
---|---|
author | Meng, Fanchi Wang, Chen Kurgan, Lukasz |
author_facet | Meng, Fanchi Wang, Chen Kurgan, Lukasz |
author_sort | Meng, Fanchi |
collection | PubMed |
description | BACKGROUND: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. RESULTS: We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/fDETECT/. CONCLUSIONS: fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-of-the-art tools and is especially suitable for the analysis of large protein sets. |
format | Online Article Text |
id | pubmed-6389161 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63891612019-03-19 fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization Meng, Fanchi Wang, Chen Kurgan, Lukasz BMC Bioinformatics Methodology Article BACKGROUND: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. RESULTS: We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/fDETECT/. CONCLUSIONS: fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-of-the-art tools and is especially suitable for the analysis of large protein sets. BioMed Central 2018-01-03 /pmc/articles/PMC6389161/ /pubmed/29295714 http://dx.doi.org/10.1186/s12859-017-1995-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Meng, Fanchi Wang, Chen Kurgan, Lukasz fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_full | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_fullStr | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_full_unstemmed | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_short | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_sort | fdetect webserver: fast predictor of propensity for protein production, purification, and crystallization |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6389161/ https://www.ncbi.nlm.nih.gov/pubmed/29295714 http://dx.doi.org/10.1186/s12859-017-1995-z |
work_keys_str_mv | AT mengfanchi fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization AT wangchen fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization AT kurganlukasz fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization |