Cargando…

In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases

The glycoside hydrolase 9 superfamily, mainly comprising the endoglucanases, is represented in all three domains of life. The current division of GH9 enzymes, into three subclasses, namely A, B, and C, is centered on parameters derived from sequence information alone. However, this classification is...

Descripción completa

Detalles Bibliográficos
Autores principales: Kundu, Siddhartha, Sharma, Rita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4981690/
https://www.ncbi.nlm.nih.gov/pubmed/27570528
http://dx.doi.org/10.3389/fpls.2016.01185
_version_ 1782447644408283136
author Kundu, Siddhartha
Sharma, Rita
author_facet Kundu, Siddhartha
Sharma, Rita
author_sort Kundu, Siddhartha
collection PubMed
description The glycoside hydrolase 9 superfamily, mainly comprising the endoglucanases, is represented in all three domains of life. The current division of GH9 enzymes, into three subclasses, namely A, B, and C, is centered on parameters derived from sequence information alone. However, this classification is ambiguous, and is limited by the paralogous ancestry of classes B and C endoglucanases, and paucity of biochemical and structural data. Here, we extend this classification schema to putative GH9 endoglucanases present in green plants, with an emphasis on identifying novel members of the class C subset. These enzymes cleave the β(1 → 4) linkage between non-terminal adjacent D-glucopyranose residues, in both, amorphous and crystalline regions of cellulose. We utilized non redundant plant GH9 enzymes with characterized molecular data, as the training set to construct Hidden Markov Models (HMMs) and train an Artificial Neural Network (ANN). The parameters that were used for predicting dominant enzyme function, were derived from this training set, and subsequently refined on 147 sequences with available expression data. Our knowledge-based approach, can ascribe differential endoglucanase activity (A, B, or C) to a query sequence with high confidence, and was used to construct a local repository of class C GH9 endoglucanases (GH9C = 241) from 32 sequenced green plants.
format Online
Article
Text
id pubmed-4981690
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-49816902016-08-26 In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases Kundu, Siddhartha Sharma, Rita Front Plant Sci Plant Science The glycoside hydrolase 9 superfamily, mainly comprising the endoglucanases, is represented in all three domains of life. The current division of GH9 enzymes, into three subclasses, namely A, B, and C, is centered on parameters derived from sequence information alone. However, this classification is ambiguous, and is limited by the paralogous ancestry of classes B and C endoglucanases, and paucity of biochemical and structural data. Here, we extend this classification schema to putative GH9 endoglucanases present in green plants, with an emphasis on identifying novel members of the class C subset. These enzymes cleave the β(1 → 4) linkage between non-terminal adjacent D-glucopyranose residues, in both, amorphous and crystalline regions of cellulose. We utilized non redundant plant GH9 enzymes with characterized molecular data, as the training set to construct Hidden Markov Models (HMMs) and train an Artificial Neural Network (ANN). The parameters that were used for predicting dominant enzyme function, were derived from this training set, and subsequently refined on 147 sequences with available expression data. Our knowledge-based approach, can ascribe differential endoglucanase activity (A, B, or C) to a query sequence with high confidence, and was used to construct a local repository of class C GH9 endoglucanases (GH9C = 241) from 32 sequenced green plants. Frontiers Media S.A. 2016-08-12 /pmc/articles/PMC4981690/ /pubmed/27570528 http://dx.doi.org/10.3389/fpls.2016.01185 Text en Copyright © 2016 Kundu and Sharma. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Kundu, Siddhartha
Sharma, Rita
In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
title In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
title_full In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
title_fullStr In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
title_full_unstemmed In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
title_short In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
title_sort in silico identification and taxonomic distribution of plant class c gh9 endoglucanases
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4981690/
https://www.ncbi.nlm.nih.gov/pubmed/27570528
http://dx.doi.org/10.3389/fpls.2016.01185
work_keys_str_mv AT kundusiddhartha insilicoidentificationandtaxonomicdistributionofplantclasscgh9endoglucanases
AT sharmarita insilicoidentificationandtaxonomicdistributionofplantclasscgh9endoglucanases