Cargando…
In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases
The glycoside hydrolase 9 superfamily, mainly comprising the endoglucanases, is represented in all three domains of life. The current division of GH9 enzymes, into three subclasses, namely A, B, and C, is centered on parameters derived from sequence information alone. However, this classification is...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4981690/ https://www.ncbi.nlm.nih.gov/pubmed/27570528 http://dx.doi.org/10.3389/fpls.2016.01185 |
_version_ | 1782447644408283136 |
---|---|
author | Kundu, Siddhartha Sharma, Rita |
author_facet | Kundu, Siddhartha Sharma, Rita |
author_sort | Kundu, Siddhartha |
collection | PubMed |
description | The glycoside hydrolase 9 superfamily, mainly comprising the endoglucanases, is represented in all three domains of life. The current division of GH9 enzymes, into three subclasses, namely A, B, and C, is centered on parameters derived from sequence information alone. However, this classification is ambiguous, and is limited by the paralogous ancestry of classes B and C endoglucanases, and paucity of biochemical and structural data. Here, we extend this classification schema to putative GH9 endoglucanases present in green plants, with an emphasis on identifying novel members of the class C subset. These enzymes cleave the β(1 → 4) linkage between non-terminal adjacent D-glucopyranose residues, in both, amorphous and crystalline regions of cellulose. We utilized non redundant plant GH9 enzymes with characterized molecular data, as the training set to construct Hidden Markov Models (HMMs) and train an Artificial Neural Network (ANN). The parameters that were used for predicting dominant enzyme function, were derived from this training set, and subsequently refined on 147 sequences with available expression data. Our knowledge-based approach, can ascribe differential endoglucanase activity (A, B, or C) to a query sequence with high confidence, and was used to construct a local repository of class C GH9 endoglucanases (GH9C = 241) from 32 sequenced green plants. |
format | Online Article Text |
id | pubmed-4981690 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-49816902016-08-26 In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases Kundu, Siddhartha Sharma, Rita Front Plant Sci Plant Science The glycoside hydrolase 9 superfamily, mainly comprising the endoglucanases, is represented in all three domains of life. The current division of GH9 enzymes, into three subclasses, namely A, B, and C, is centered on parameters derived from sequence information alone. However, this classification is ambiguous, and is limited by the paralogous ancestry of classes B and C endoglucanases, and paucity of biochemical and structural data. Here, we extend this classification schema to putative GH9 endoglucanases present in green plants, with an emphasis on identifying novel members of the class C subset. These enzymes cleave the β(1 → 4) linkage between non-terminal adjacent D-glucopyranose residues, in both, amorphous and crystalline regions of cellulose. We utilized non redundant plant GH9 enzymes with characterized molecular data, as the training set to construct Hidden Markov Models (HMMs) and train an Artificial Neural Network (ANN). The parameters that were used for predicting dominant enzyme function, were derived from this training set, and subsequently refined on 147 sequences with available expression data. Our knowledge-based approach, can ascribe differential endoglucanase activity (A, B, or C) to a query sequence with high confidence, and was used to construct a local repository of class C GH9 endoglucanases (GH9C = 241) from 32 sequenced green plants. Frontiers Media S.A. 2016-08-12 /pmc/articles/PMC4981690/ /pubmed/27570528 http://dx.doi.org/10.3389/fpls.2016.01185 Text en Copyright © 2016 Kundu and Sharma. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Kundu, Siddhartha Sharma, Rita In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases |
title | In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases |
title_full | In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases |
title_fullStr | In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases |
title_full_unstemmed | In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases |
title_short | In silico Identification and Taxonomic Distribution of Plant Class C GH9 Endoglucanases |
title_sort | in silico identification and taxonomic distribution of plant class c gh9 endoglucanases |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4981690/ https://www.ncbi.nlm.nih.gov/pubmed/27570528 http://dx.doi.org/10.3389/fpls.2016.01185 |
work_keys_str_mv | AT kundusiddhartha insilicoidentificationandtaxonomicdistributionofplantclasscgh9endoglucanases AT sharmarita insilicoidentificationandtaxonomicdistributionofplantclasscgh9endoglucanases |