Cargando…

Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes

The engineering of thermostable enzymes is receiving increased attention. The paper, detergent, and biofuel industries, in particular, seek to use environmentally friendly enzymes instead of toxic chlorine chemicals. Enzymes typically function at temperatures below 60°C and denature if exposed to hi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ebrahimi, Mansour, Lakizadeh, Amir, Agha-Golzadeh, Parisa, Ebrahimie, Esmaeil, Ebrahimi, Mahdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154288/
https://www.ncbi.nlm.nih.gov/pubmed/21853079
http://dx.doi.org/10.1371/journal.pone.0023146
_version_ 1782210002970214400
author Ebrahimi, Mansour
Lakizadeh, Amir
Agha-Golzadeh, Parisa
Ebrahimie, Esmaeil
Ebrahimi, Mahdi
author_facet Ebrahimi, Mansour
Lakizadeh, Amir
Agha-Golzadeh, Parisa
Ebrahimie, Esmaeil
Ebrahimi, Mahdi
author_sort Ebrahimi, Mansour
collection PubMed
description The engineering of thermostable enzymes is receiving increased attention. The paper, detergent, and biofuel industries, in particular, seek to use environmentally friendly enzymes instead of toxic chlorine chemicals. Enzymes typically function at temperatures below 60°C and denature if exposed to higher temperatures. In contrast, a small portion of enzymes can withstand higher temperatures as a result of various structural adaptations. Understanding the protein attributes that are involved in this adaptation is the first step toward engineering thermostable enzymes. We employed various supervised and unsupervised machine learning algorithms as well as attribute weighting approaches to find amino acid composition attributes that contribute to enzyme thermostability. Specifically, we compared two groups of enzymes: mesostable and thermostable enzymes. Furthermore, a combination of attribute weighting with supervised and unsupervised clustering algorithms was used for prediction and modelling of protein thermostability from amino acid composition properties. Mining a large number of protein sequences (2090) through a variety of machine learning algorithms, which were based on the analysis of more than 800 amino acid attributes, increased the accuracy of this study. Moreover, these models were successful in predicting thermostability from the primary structure of proteins. The results showed that expectation maximization clustering in combination with uncertainly and correlation attribute weighting algorithms can effectively (100%) classify thermostable and mesostable proteins. Seventy per cent of the weighting methods selected Gln content and frequency of hydrophilic residues as the most important protein attributes. On the dipeptide level, the frequency of Asn-Glu was the key factor in distinguishing mesostable from thermostable enzymes. This study demonstrates the feasibility of predicting thermostability irrespective of sequence similarity and will serve as a basis for engineering thermostable enzymes in the laboratory.
format Online
Article
Text
id pubmed-3154288
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31542882011-08-18 Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes Ebrahimi, Mansour Lakizadeh, Amir Agha-Golzadeh, Parisa Ebrahimie, Esmaeil Ebrahimi, Mahdi PLoS One Research Article The engineering of thermostable enzymes is receiving increased attention. The paper, detergent, and biofuel industries, in particular, seek to use environmentally friendly enzymes instead of toxic chlorine chemicals. Enzymes typically function at temperatures below 60°C and denature if exposed to higher temperatures. In contrast, a small portion of enzymes can withstand higher temperatures as a result of various structural adaptations. Understanding the protein attributes that are involved in this adaptation is the first step toward engineering thermostable enzymes. We employed various supervised and unsupervised machine learning algorithms as well as attribute weighting approaches to find amino acid composition attributes that contribute to enzyme thermostability. Specifically, we compared two groups of enzymes: mesostable and thermostable enzymes. Furthermore, a combination of attribute weighting with supervised and unsupervised clustering algorithms was used for prediction and modelling of protein thermostability from amino acid composition properties. Mining a large number of protein sequences (2090) through a variety of machine learning algorithms, which were based on the analysis of more than 800 amino acid attributes, increased the accuracy of this study. Moreover, these models were successful in predicting thermostability from the primary structure of proteins. The results showed that expectation maximization clustering in combination with uncertainly and correlation attribute weighting algorithms can effectively (100%) classify thermostable and mesostable proteins. Seventy per cent of the weighting methods selected Gln content and frequency of hydrophilic residues as the most important protein attributes. On the dipeptide level, the frequency of Asn-Glu was the key factor in distinguishing mesostable from thermostable enzymes. This study demonstrates the feasibility of predicting thermostability irrespective of sequence similarity and will serve as a basis for engineering thermostable enzymes in the laboratory. Public Library of Science 2011-08-10 /pmc/articles/PMC3154288/ /pubmed/21853079 http://dx.doi.org/10.1371/journal.pone.0023146 Text en Ebrahimi et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ebrahimi, Mansour
Lakizadeh, Amir
Agha-Golzadeh, Parisa
Ebrahimie, Esmaeil
Ebrahimi, Mahdi
Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
title Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
title_full Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
title_fullStr Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
title_full_unstemmed Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
title_short Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
title_sort prediction of thermostability from amino acid attributes by combination of clustering with attribute weighting: a new vista in engineering enzymes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154288/
https://www.ncbi.nlm.nih.gov/pubmed/21853079
http://dx.doi.org/10.1371/journal.pone.0023146
work_keys_str_mv AT ebrahimimansour predictionofthermostabilityfromaminoacidattributesbycombinationofclusteringwithattributeweightinganewvistainengineeringenzymes
AT lakizadehamir predictionofthermostabilityfromaminoacidattributesbycombinationofclusteringwithattributeweightinganewvistainengineeringenzymes
AT aghagolzadehparisa predictionofthermostabilityfromaminoacidattributesbycombinationofclusteringwithattributeweightinganewvistainengineeringenzymes
AT ebrahimieesmaeil predictionofthermostabilityfromaminoacidattributesbycombinationofclusteringwithattributeweightinganewvistainengineeringenzymes
AT ebrahimimahdi predictionofthermostabilityfromaminoacidattributesbycombinationofclusteringwithattributeweightinganewvistainengineeringenzymes