Cargando…

AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation

Arsenic (As) is the most ubiquitous toxic metalloid in nature. Microbe-mediated As metabolism plays an important role in global As biogeochemical processes, greatly changing its toxicity and bioavailability. While metagenomic sequencing may advance our understanding of the As metabolism capacity of...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Xinwei, Li, Yiqun, Stirling, Erinne, Zhao, Kankan, Wang, Binhao, Zhu, Yongguan, Luo, Yongming, Xu, Jianming, Ma, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623898/
https://www.ncbi.nlm.nih.gov/pubmed/36330044
http://dx.doi.org/10.1093/nargab/lqac080
_version_ 1784822108940926976
author Song, Xinwei
Li, Yiqun
Stirling, Erinne
Zhao, Kankan
Wang, Binhao
Zhu, Yongguan
Luo, Yongming
Xu, Jianming
Ma, Bin
author_facet Song, Xinwei
Li, Yiqun
Stirling, Erinne
Zhao, Kankan
Wang, Binhao
Zhu, Yongguan
Luo, Yongming
Xu, Jianming
Ma, Bin
author_sort Song, Xinwei
collection PubMed
description Arsenic (As) is the most ubiquitous toxic metalloid in nature. Microbe-mediated As metabolism plays an important role in global As biogeochemical processes, greatly changing its toxicity and bioavailability. While metagenomic sequencing may advance our understanding of the As metabolism capacity of microbial communities in different environments, accurate metagenomic profiling of As metabolism remains challenging due to low coverage and inaccurate definitions of As metabolism gene families in public orthology databases. Here we developed a manually curated As metabolism gene database (AsgeneDB) comprising 400 242 representative sequences from 59 As metabolism gene families, which are affiliated with 1653 microbial genera from 46 phyla. AsgeneDB achieved 100% annotation sensitivity and 99.96% annotation accuracy for an artificial gene dataset. We then applied AsgeneDB for functional and taxonomic profiling of As metabolism in metagenomes from various habitats (freshwater, hot spring, marine sediment and soil). The results showed that AsgeneDB substantially improved the mapping ratio of short reads in metagenomes from various environments. Compared with other databases, AsgeneDB provides more accurate, more comprehensive and faster analysis of As metabolic genes. In addition, we developed an R package, Asgene, to facilitate the analysis of metagenome sequencing data. Therefore, AsgeneDB and the associated Asgene package will greatly promote the study of As metabolism in microbial communities in various environments.
format Online
Article
Text
id pubmed-9623898
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96238982022-11-02 AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation Song, Xinwei Li, Yiqun Stirling, Erinne Zhao, Kankan Wang, Binhao Zhu, Yongguan Luo, Yongming Xu, Jianming Ma, Bin NAR Genom Bioinform Standard Article Arsenic (As) is the most ubiquitous toxic metalloid in nature. Microbe-mediated As metabolism plays an important role in global As biogeochemical processes, greatly changing its toxicity and bioavailability. While metagenomic sequencing may advance our understanding of the As metabolism capacity of microbial communities in different environments, accurate metagenomic profiling of As metabolism remains challenging due to low coverage and inaccurate definitions of As metabolism gene families in public orthology databases. Here we developed a manually curated As metabolism gene database (AsgeneDB) comprising 400 242 representative sequences from 59 As metabolism gene families, which are affiliated with 1653 microbial genera from 46 phyla. AsgeneDB achieved 100% annotation sensitivity and 99.96% annotation accuracy for an artificial gene dataset. We then applied AsgeneDB for functional and taxonomic profiling of As metabolism in metagenomes from various habitats (freshwater, hot spring, marine sediment and soil). The results showed that AsgeneDB substantially improved the mapping ratio of short reads in metagenomes from various environments. Compared with other databases, AsgeneDB provides more accurate, more comprehensive and faster analysis of As metabolic genes. In addition, we developed an R package, Asgene, to facilitate the analysis of metagenome sequencing data. Therefore, AsgeneDB and the associated Asgene package will greatly promote the study of As metabolism in microbial communities in various environments. Oxford University Press 2022-11-01 /pmc/articles/PMC9623898/ /pubmed/36330044 http://dx.doi.org/10.1093/nargab/lqac080 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Song, Xinwei
Li, Yiqun
Stirling, Erinne
Zhao, Kankan
Wang, Binhao
Zhu, Yongguan
Luo, Yongming
Xu, Jianming
Ma, Bin
AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
title AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
title_full AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
title_fullStr AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
title_full_unstemmed AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
title_short AsgeneDB: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
title_sort asgenedb: a curated orthology arsenic metabolism gene database and computational tool for metagenome annotation
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623898/
https://www.ncbi.nlm.nih.gov/pubmed/36330044
http://dx.doi.org/10.1093/nargab/lqac080
work_keys_str_mv AT songxinwei asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT liyiqun asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT stirlingerinne asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT zhaokankan asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT wangbinhao asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT zhuyongguan asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT luoyongming asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT xujianming asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation
AT mabin asgenedbacuratedorthologyarsenicmetabolismgenedatabaseandcomputationaltoolformetagenomeannotation