Cargando…
dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes
Carbohydrate Active EnZymes (CAZymes) are significantly important for microbial communities to thrive in carbohydrate rich environments such as animal guts, agricultural soils, forest floors, and ocean sediments. Since 2017, microbiome sequencing and assembly have produced numerous metagenome assemb...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825555/ https://www.ncbi.nlm.nih.gov/pubmed/36399503 http://dx.doi.org/10.1093/nar/gkac1068 |
_version_ | 1784866658104377344 |
---|---|
author | Zheng, Jinfang Hu, Boyang Zhang, Xinpeng Ge, Qiwei Yan, Yuchen Akresi, Jerry Piyush, Ved Huang, Le Yin, Yanbin |
author_facet | Zheng, Jinfang Hu, Boyang Zhang, Xinpeng Ge, Qiwei Yan, Yuchen Akresi, Jerry Piyush, Ved Huang, Le Yin, Yanbin |
author_sort | Zheng, Jinfang |
collection | PubMed |
description | Carbohydrate Active EnZymes (CAZymes) are significantly important for microbial communities to thrive in carbohydrate rich environments such as animal guts, agricultural soils, forest floors, and ocean sediments. Since 2017, microbiome sequencing and assembly have produced numerous metagenome assembled genomes (MAGs). We have updated our dbCAN-seq database (https://bcb.unl.edu/dbCAN_seq) to include the following new data and features: (i) ∼498 000 CAZymes and ∼169 000 CAZyme gene clusters (CGCs) from 9421 MAGs of four ecological (human gut, human oral, cow rumen, and marine) environments; (ii) Glycan substrates for 41 447 (24.54%) CGCs inferred by two novel approaches (dbCAN-PUL homology search and eCAMI subfamily majority voting) (the two approaches agreed on 4183 CGCs for substrate assignments); (iii) A redesigned CGC page to include the graphical display of CGC gene compositions, the alignment of query CGC and subject PUL (polysaccharide utilization loci) of dbCAN-PUL, and the eCAMI subfamily table to support the predicted substrates; (iv) A statistics page to organize all the data for easy CGC access according to substrates and taxonomic phyla; and (v) A batch download page. In summary, this updated dbCAN-seq database highlights glycan substrates predicted for CGCs from microbiomes. Future work will implement the substrate prediction function in our dbCAN2 web server. |
format | Online Article Text |
id | pubmed-9825555 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98255552023-01-10 dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes Zheng, Jinfang Hu, Boyang Zhang, Xinpeng Ge, Qiwei Yan, Yuchen Akresi, Jerry Piyush, Ved Huang, Le Yin, Yanbin Nucleic Acids Res Database Issue Carbohydrate Active EnZymes (CAZymes) are significantly important for microbial communities to thrive in carbohydrate rich environments such as animal guts, agricultural soils, forest floors, and ocean sediments. Since 2017, microbiome sequencing and assembly have produced numerous metagenome assembled genomes (MAGs). We have updated our dbCAN-seq database (https://bcb.unl.edu/dbCAN_seq) to include the following new data and features: (i) ∼498 000 CAZymes and ∼169 000 CAZyme gene clusters (CGCs) from 9421 MAGs of four ecological (human gut, human oral, cow rumen, and marine) environments; (ii) Glycan substrates for 41 447 (24.54%) CGCs inferred by two novel approaches (dbCAN-PUL homology search and eCAMI subfamily majority voting) (the two approaches agreed on 4183 CGCs for substrate assignments); (iii) A redesigned CGC page to include the graphical display of CGC gene compositions, the alignment of query CGC and subject PUL (polysaccharide utilization loci) of dbCAN-PUL, and the eCAMI subfamily table to support the predicted substrates; (iv) A statistics page to organize all the data for easy CGC access according to substrates and taxonomic phyla; and (v) A batch download page. In summary, this updated dbCAN-seq database highlights glycan substrates predicted for CGCs from microbiomes. Future work will implement the substrate prediction function in our dbCAN2 web server. Oxford University Press 2022-11-18 /pmc/articles/PMC9825555/ /pubmed/36399503 http://dx.doi.org/10.1093/nar/gkac1068 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Issue Zheng, Jinfang Hu, Boyang Zhang, Xinpeng Ge, Qiwei Yan, Yuchen Akresi, Jerry Piyush, Ved Huang, Le Yin, Yanbin dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes |
title | dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes |
title_full | dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes |
title_fullStr | dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes |
title_full_unstemmed | dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes |
title_short | dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes |
title_sort | dbcan-seq update: cazyme gene clusters and substrates in microbiomes |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825555/ https://www.ncbi.nlm.nih.gov/pubmed/36399503 http://dx.doi.org/10.1093/nar/gkac1068 |
work_keys_str_mv | AT zhengjinfang dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT huboyang dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT zhangxinpeng dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT geqiwei dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT yanyuchen dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT akresijerry dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT piyushved dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT huangle dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes AT yinyanbin dbcansequpdatecazymegeneclustersandsubstratesinmicrobiomes |