Cargando…

LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches

BACKGROUND: Laccases (E.C. 1.10.3.2) are multi-copper oxidases that have gained importance in many industries such as biofuels, pulp production, textile dye bleaching, bioremediation, and food production. Their usefulness stems from the ability to act on a diverse range of phenolic compounds such as...

Descripción completa

Detalles Bibliográficos
Autores principales: Weirick, Tyler, Sahu, Sitanshu S, Mahalingam, Ramamurthy, Kaundal, Rakesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4251044/
https://www.ncbi.nlm.nih.gov/pubmed/25350584
http://dx.doi.org/10.1186/1471-2105-15-S11-S15
_version_ 1782346993564123136
author Weirick, Tyler
Sahu, Sitanshu S
Mahalingam, Ramamurthy
Kaundal, Rakesh
author_facet Weirick, Tyler
Sahu, Sitanshu S
Mahalingam, Ramamurthy
Kaundal, Rakesh
author_sort Weirick, Tyler
collection PubMed
description BACKGROUND: Laccases (E.C. 1.10.3.2) are multi-copper oxidases that have gained importance in many industries such as biofuels, pulp production, textile dye bleaching, bioremediation, and food production. Their usefulness stems from the ability to act on a diverse range of phenolic compounds such as o-/p-quinols, aminophenols, polyphenols, polyamines, aryl diamines, and aromatic thiols. Despite acting on a wide range of compounds as a family, individual Laccases often exhibit distinctive and varied substrate ranges. This is likely due to Laccases involvement in many metabolic roles across diverse taxa. Classification systems for multi-copper oxidases have been developed using multiple sequence alignments, however, these systems seem to largely follow species taxonomy rather than substrate ranges, enzyme properties, or specific function. It has been suggested that the roles and substrates of various Laccases are related to their optimal pH. This is consistent with the observation that fungal Laccases usually prefer acidic conditions, whereas plant and bacterial Laccases prefer basic conditions. Based on these observations, we hypothesize that a descriptor-based unsupervised learning system could generate homology independent classification system for better describing the functional properties of Laccases. RESULTS: In this study, we first utilized unsupervised learning approach to develop a novel homology independent Laccase classification system. From the descriptors considered, physicochemical properties showed the best performance. Physicochemical properties divided the Laccases into twelve subtypes. Analysis of the clusters using a t-test revealed that the majority of the physicochemical descriptors had statistically significant differences between the classes. Feature selection identified the most important features as negatively charges residues, the peptide isoelectric point, and acidic or amidic residues. Secondly, to allow for classification of new Laccases, a supervised learning system was developed from the clusters. The models showed high performance with an overall accuracy of 99.03%, error of 0.49%, MCC of 0.9367, precision of 94.20%, sensitivity of 94.20%, and specificity of 99.47% in a 5-fold cross-validation test. In an independent test, our models still provide a high accuracy of 97.98%, error rate of 1.02%, MCC of 0.8678, precision of 87.88%, sensitivity of 87.88% and specificity of 98.90%. CONCLUSION: This study provides a useful classification system for better understanding of Laccases from their physicochemical properties perspective. We also developed a publically available web tool for the characterization of Laccase protein sequences (http://lacsubpred.bioinfo.ucr.edu/). Finally, the programs used in the study are made available for researchers interested in applying the system to other enzyme classes (https://github.com/tweirick/SubClPred).
format Online
Article
Text
id pubmed-4251044
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42510442014-12-02 LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches Weirick, Tyler Sahu, Sitanshu S Mahalingam, Ramamurthy Kaundal, Rakesh BMC Bioinformatics Proceedings BACKGROUND: Laccases (E.C. 1.10.3.2) are multi-copper oxidases that have gained importance in many industries such as biofuels, pulp production, textile dye bleaching, bioremediation, and food production. Their usefulness stems from the ability to act on a diverse range of phenolic compounds such as o-/p-quinols, aminophenols, polyphenols, polyamines, aryl diamines, and aromatic thiols. Despite acting on a wide range of compounds as a family, individual Laccases often exhibit distinctive and varied substrate ranges. This is likely due to Laccases involvement in many metabolic roles across diverse taxa. Classification systems for multi-copper oxidases have been developed using multiple sequence alignments, however, these systems seem to largely follow species taxonomy rather than substrate ranges, enzyme properties, or specific function. It has been suggested that the roles and substrates of various Laccases are related to their optimal pH. This is consistent with the observation that fungal Laccases usually prefer acidic conditions, whereas plant and bacterial Laccases prefer basic conditions. Based on these observations, we hypothesize that a descriptor-based unsupervised learning system could generate homology independent classification system for better describing the functional properties of Laccases. RESULTS: In this study, we first utilized unsupervised learning approach to develop a novel homology independent Laccase classification system. From the descriptors considered, physicochemical properties showed the best performance. Physicochemical properties divided the Laccases into twelve subtypes. Analysis of the clusters using a t-test revealed that the majority of the physicochemical descriptors had statistically significant differences between the classes. Feature selection identified the most important features as negatively charges residues, the peptide isoelectric point, and acidic or amidic residues. Secondly, to allow for classification of new Laccases, a supervised learning system was developed from the clusters. The models showed high performance with an overall accuracy of 99.03%, error of 0.49%, MCC of 0.9367, precision of 94.20%, sensitivity of 94.20%, and specificity of 99.47% in a 5-fold cross-validation test. In an independent test, our models still provide a high accuracy of 97.98%, error rate of 1.02%, MCC of 0.8678, precision of 87.88%, sensitivity of 87.88% and specificity of 98.90%. CONCLUSION: This study provides a useful classification system for better understanding of Laccases from their physicochemical properties perspective. We also developed a publically available web tool for the characterization of Laccase protein sequences (http://lacsubpred.bioinfo.ucr.edu/). Finally, the programs used in the study are made available for researchers interested in applying the system to other enzyme classes (https://github.com/tweirick/SubClPred). BioMed Central 2014-10-21 /pmc/articles/PMC4251044/ /pubmed/25350584 http://dx.doi.org/10.1186/1471-2105-15-S11-S15 Text en Copyright © 2014 Weirick et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Weirick, Tyler
Sahu, Sitanshu S
Mahalingam, Ramamurthy
Kaundal, Rakesh
LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches
title LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches
title_full LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches
title_fullStr LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches
title_full_unstemmed LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches
title_short LacSubPred: predicting subtypes of Laccases, an important lignin metabolism-related enzyme class, using in silico approaches
title_sort lacsubpred: predicting subtypes of laccases, an important lignin metabolism-related enzyme class, using in silico approaches
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4251044/
https://www.ncbi.nlm.nih.gov/pubmed/25350584
http://dx.doi.org/10.1186/1471-2105-15-S11-S15
work_keys_str_mv AT weiricktyler lacsubpredpredictingsubtypesoflaccasesanimportantligninmetabolismrelatedenzymeclassusinginsilicoapproaches
AT sahusitanshus lacsubpredpredictingsubtypesoflaccasesanimportantligninmetabolismrelatedenzymeclassusinginsilicoapproaches
AT mahalingamramamurthy lacsubpredpredictingsubtypesoflaccasesanimportantligninmetabolismrelatedenzymeclassusinginsilicoapproaches
AT kaundalrakesh lacsubpredpredictingsubtypesoflaccasesanimportantligninmetabolismrelatedenzymeclassusinginsilicoapproaches