Cargando…

Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana

BACKGROUND: Many studies have correlated characteristics of amino acids with crystallization propensity, as part of the effort to determine the factors that affect the propensity of protein crystallization. However, these characteristics are constant; that is, the encoded amino acid sequences have t...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Shaomin, Wu, Guang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657326/
https://www.ncbi.nlm.nih.gov/pubmed/26604856
http://dx.doi.org/10.1186/s12575-015-0029-3
_version_ 1782402376932524032
author Yan, Shaomin
Wu, Guang
author_facet Yan, Shaomin
Wu, Guang
author_sort Yan, Shaomin
collection PubMed
description BACKGROUND: Many studies have correlated characteristics of amino acids with crystallization propensity, as part of the effort to determine the factors that affect the propensity of protein crystallization. However, these characteristics are constant; that is, the encoded amino acid sequences have the same value for each type of amino acid. To overcome this inflexibility, three dynamic characteristics of amino acids and protein were introduced to analyze the crystallization propensity of proteins. Both logistic regression and neural network models were used to correlate each of two dynamic characteristics with the crystallization propensity of 301 proteins from Arabidopsis thaliana, and their results were compared with those obtained from each of 531 constant amino acid characteristics, which served as the benchmark. RESULTS: The neural network model was more powerful for predicting the crystallization propensity of proteins than the logistic regression model. Compared with the benchmark, the dynamic characteristics of amino acids provided good prediction results for the crystallization propensity, and the distribution probability gave the highest sensitivity. Using 90 % accuracy as a cutoff point, the predictable portion of A. thaliana portions was ranked, and the statistical analysis showed that the larger the predictable portion, the better the prediction. CONCLUSIONS: These results demonstrate that dynamic characteristics have a certain relationship with the crystallization propensity, and they could be helpful for the prediction of protein crystallization, which may provide a theoretical concept for certain proteins before conducting experimental crystallization. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12575-015-0029-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4657326
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46573262015-11-25 Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana Yan, Shaomin Wu, Guang Biol Proced Online Research BACKGROUND: Many studies have correlated characteristics of amino acids with crystallization propensity, as part of the effort to determine the factors that affect the propensity of protein crystallization. However, these characteristics are constant; that is, the encoded amino acid sequences have the same value for each type of amino acid. To overcome this inflexibility, three dynamic characteristics of amino acids and protein were introduced to analyze the crystallization propensity of proteins. Both logistic regression and neural network models were used to correlate each of two dynamic characteristics with the crystallization propensity of 301 proteins from Arabidopsis thaliana, and their results were compared with those obtained from each of 531 constant amino acid characteristics, which served as the benchmark. RESULTS: The neural network model was more powerful for predicting the crystallization propensity of proteins than the logistic regression model. Compared with the benchmark, the dynamic characteristics of amino acids provided good prediction results for the crystallization propensity, and the distribution probability gave the highest sensitivity. Using 90 % accuracy as a cutoff point, the predictable portion of A. thaliana portions was ranked, and the statistical analysis showed that the larger the predictable portion, the better the prediction. CONCLUSIONS: These results demonstrate that dynamic characteristics have a certain relationship with the crystallization propensity, and they could be helpful for the prediction of protein crystallization, which may provide a theoretical concept for certain proteins before conducting experimental crystallization. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12575-015-0029-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-23 /pmc/articles/PMC4657326/ /pubmed/26604856 http://dx.doi.org/10.1186/s12575-015-0029-3 Text en © Yan and Wu. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Yan, Shaomin
Wu, Guang
Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana
title Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana
title_full Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana
title_fullStr Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana
title_full_unstemmed Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana
title_short Predicting Crystallization Propensity of Proteins from Arabidopsis Thaliana
title_sort predicting crystallization propensity of proteins from arabidopsis thaliana
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657326/
https://www.ncbi.nlm.nih.gov/pubmed/26604856
http://dx.doi.org/10.1186/s12575-015-0029-3
work_keys_str_mv AT yanshaomin predictingcrystallizationpropensityofproteinsfromarabidopsisthaliana
AT wuguang predictingcrystallizationpropensityofproteinsfromarabidopsisthaliana