Cargando…
AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
Arsenic (As) is the most ubiquitous toxic metalloid in nature. Microbe-mediated As metabolism plays an important role in global As biogeochemical processes, greatly changing its toxicity and bioavailability. While metagenomic sequencing may advance our understanding of the As metabolism capacity of...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623898/ https://www.ncbi.nlm.nih.gov/pubmed/36330044 http://dx.doi.org/10.1093/nargab/lqac080 |
_version_ | 1784822108940926976 |
---|---|
author | Song, Xinwei Li, Yiqun Stirling, Erinne Zhao, Kankan Wang, Binhao Zhu, Yongguan Luo, Yongming Xu, Jianming Ma, Bin |
author_facet | Song, Xinwei Li, Yiqun Stirling, Erinne Zhao, Kankan Wang, Binhao Zhu, Yongguan Luo, Yongming Xu, Jianming Ma, Bin |
author_sort | Song, Xinwei |
collection | PubMed |
description | Arsenic (As) is the most ubiquitous toxic metalloid in nature. Microbe-mediated As metabolism plays an important role in global As biogeochemical processes, greatly changing its toxicity and bioavailability. While metagenomic sequencing may advance our understanding of the As metabolism capacity of microbial communities in different environments, accurate metagenomic profiling of As metabolism remains challenging due to low coverage and inaccurate definitions of As metabolism gene families in public orthology databases. Here we developed a manually curated As metabolism gene database (AsgeneDB) comprising 400 242 representative sequences from 59 As metabolism gene families, which are affiliated with 1653 microbial genera from 46 phyla. AsgeneDB achieved 100% annotation sensitivity and 99.96% annotation accuracy for an artificial gene dataset. We then applied AsgeneDB for functional and taxonomic profiling of As metabolism in metagenomes from various habitats (freshwater, hot spring, marine sediment and soil). The results showed that AsgeneDB substantially improved the mapping ratio of short reads in metagenomes from various environments. Compared with other databases, AsgeneDB provides more accurate, more comprehensive and faster analysis of As metabolic genes. In addition, we developed an R package, Asgene, to facilitate the analysis of metagenome sequencing data. Therefore, AsgeneDB and the associated Asgene package will greatly promote the study of As metabolism in microbial communities in various environments. |
format | Online Article Text |
id | pubmed-9623898 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-96238982022-11-02 AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation Song, Xinwei Li, Yiqun Stirling, Erinne Zhao, Kankan Wang, Binhao Zhu, Yongguan Luo, Yongming Xu, Jianming Ma, Bin NAR Genom Bioinform Standard Article Arsenic (As) is the most ubiquitous toxic metalloid in nature. Microbe-mediated As metabolism plays an important role in global As biogeochemical processes, greatly changing its toxicity and bioavailability. While metagenomic sequencing may advance our understanding of the As metabolism capacity of microbial communities in different environments, accurate metagenomic profiling of As metabolism remains challenging due to low coverage and inaccurate definitions of As metabolism gene families in public orthology databases. Here we developed a manually curated As metabolism gene database (AsgeneDB) comprising 400 242 representative sequences from 59 As metabolism gene families, which are affiliated with 1653 microbial genera from 46 phyla. AsgeneDB achieved 100% annotation sensitivity and 99.96% annotation accuracy for an artificial gene dataset. We then applied AsgeneDB for functional and taxonomic profiling of As metabolism in metagenomes from various habitats (freshwater, hot spring, marine sediment and soil). The results showed that AsgeneDB substantially improved the mapping ratio of short reads in metagenomes from various environments. Compared with other databases, AsgeneDB provides more accurate, more comprehensive and faster analysis of As metabolic genes. In addition, we developed an R package, Asgene, to facilitate the analysis of metagenome sequencing data. Therefore, AsgeneDB and the associated Asgene package will greatly promote the study of As metabolism in microbial communities in various environments. Oxford University Press 2022-11-01 /pmc/articles/PMC9623898/ /pubmed/36330044 http://dx.doi.org/10.1093/nargab/lqac080 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Song, Xinwei Li, Yiqun Stirling, Erinne Zhao, Kankan Wang, Binhao Zhu, Yongguan Luo, Yongming Xu, Jianming Ma, Bin AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
title | AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
title_full | AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
title_fullStr | AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
title_full_unstemmed | AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
title_short | AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
title_sort | asgenedb: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623898/ https://www.ncbi.nlm.nih.gov/pubmed/36330044 http://dx.doi.org/10.1093/nargab/lqac080 |
work_keys_str_mv | AT songxinwei asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT liyiqun asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT stirlingerinne asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT zhaokankan asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT wangbinhao asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT zhuyongguan asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT luoyongming asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT xujianming asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation AT mabin asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation |