Cargando…

PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes

BACKGROUND: Phosphorus (P) is one of the most essential macronutrients on the planet, and microorganisms (including bacteria and archaea) play a key role in P cycling in all living things and ecosystems. However, our comprehensive understanding of key P cycling genes (PCGs) and microorganisms (PCMs)...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Jiaxiong, Tu, Qichao, Yu, Xiaoli, Qian, Lu, Wang, Cheng, Shu, Longfei, Liu, Fei, Liu, Shengwei, Huang, Zhijian, He, Jianguo, Yan, Qingyun, He, Zhili
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252087/
https://www.ncbi.nlm.nih.gov/pubmed/35787295
http://dx.doi.org/10.1186/s40168-022-01292-1
_version_ 1784740185074827264
author Zeng, Jiaxiong
Tu, Qichao
Yu, Xiaoli
Qian, Lu
Wang, Cheng
Shu, Longfei
Liu, Fei
Liu, Shengwei
Huang, Zhijian
He, Jianguo
Yan, Qingyun
He, Zhili
author_facet Zeng, Jiaxiong
Tu, Qichao
Yu, Xiaoli
Qian, Lu
Wang, Cheng
Shu, Longfei
Liu, Fei
Liu, Shengwei
Huang, Zhijian
He, Jianguo
Yan, Qingyun
He, Zhili
author_sort Zeng, Jiaxiong
collection PubMed
description BACKGROUND: Phosphorus (P) is one of the most essential macronutrients on the planet, and microorganisms (including bacteria and archaea) play a key role in P cycling in all living things and ecosystems. However, our comprehensive understanding of key P cycling genes (PCGs) and microorganisms (PCMs) as well as their ecological functions remains elusive even with the rapid advancement of metagenome sequencing technologies. One of major challenges is a lack of a comprehensive and accurately annotated P cycling functional gene database. RESULTS: In this study, we constructed a well-curated P cycling database (PCycDB) covering 139 gene families and 10 P metabolic processes, including several previously ignored PCGs such as pafA encoding phosphate-insensitive phosphatase, ptxABCD (phosphite-related genes), and novel aepXVWPS genes for 2-aminoethylphosphonate transporters. We achieved an annotation accuracy, positive predictive value (PPV), sensitivity, specificity, and negative predictive value (NPV) of 99.8%, 96.1%, 99.9%, 99.8%, and 99.9%, respectively, for simulated gene datasets. Compared to other orthology databases, PCycDB is more accurate, more comprehensive, and faster to profile the PCGs. We used PCycDB to analyze P cycling microbial communities from representative natural and engineered environments and showed that PCycDB could apply to different environments. CONCLUSIONS: We demonstrate that PCycDB is a powerful tool for advancing our understanding of microbially driven P cycling in the environment with high coverage, high accuracy, and rapid analysis of metagenome sequencing data. The PCycDB is available at https://github.com/ZengJiaxiong/Phosphorus-cycling-database. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-022-01292-1.
format Online
Article
Text
id pubmed-9252087
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92520872022-07-05 PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes Zeng, Jiaxiong Tu, Qichao Yu, Xiaoli Qian, Lu Wang, Cheng Shu, Longfei Liu, Fei Liu, Shengwei Huang, Zhijian He, Jianguo Yan, Qingyun He, Zhili Microbiome Methodology BACKGROUND: Phosphorus (P) is one of the most essential macronutrients on the planet, and microorganisms (including bacteria and archaea) play a key role in P cycling in all living things and ecosystems. However, our comprehensive understanding of key P cycling genes (PCGs) and microorganisms (PCMs) as well as their ecological functions remains elusive even with the rapid advancement of metagenome sequencing technologies. One of major challenges is a lack of a comprehensive and accurately annotated P cycling functional gene database. RESULTS: In this study, we constructed a well-curated P cycling database (PCycDB) covering 139 gene families and 10 P metabolic processes, including several previously ignored PCGs such as pafA encoding phosphate-insensitive phosphatase, ptxABCD (phosphite-related genes), and novel aepXVWPS genes for 2-aminoethylphosphonate transporters. We achieved an annotation accuracy, positive predictive value (PPV), sensitivity, specificity, and negative predictive value (NPV) of 99.8%, 96.1%, 99.9%, 99.8%, and 99.9%, respectively, for simulated gene datasets. Compared to other orthology databases, PCycDB is more accurate, more comprehensive, and faster to profile the PCGs. We used PCycDB to analyze P cycling microbial communities from representative natural and engineered environments and showed that PCycDB could apply to different environments. CONCLUSIONS: We demonstrate that PCycDB is a powerful tool for advancing our understanding of microbially driven P cycling in the environment with high coverage, high accuracy, and rapid analysis of metagenome sequencing data. The PCycDB is available at https://github.com/ZengJiaxiong/Phosphorus-cycling-database. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-022-01292-1. BioMed Central 2022-07-04 /pmc/articles/PMC9252087/ /pubmed/35787295 http://dx.doi.org/10.1186/s40168-022-01292-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Zeng, Jiaxiong
Tu, Qichao
Yu, Xiaoli
Qian, Lu
Wang, Cheng
Shu, Longfei
Liu, Fei
Liu, Shengwei
Huang, Zhijian
He, Jianguo
Yan, Qingyun
He, Zhili
PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
title PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
title_full PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
title_fullStr PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
title_full_unstemmed PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
title_short PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
title_sort pcycdb: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252087/
https://www.ncbi.nlm.nih.gov/pubmed/35787295
http://dx.doi.org/10.1186/s40168-022-01292-1
work_keys_str_mv AT zengjiaxiong pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT tuqichao pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT yuxiaoli pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT qianlu pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT wangcheng pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT shulongfei pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT liufei pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT liushengwei pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT huangzhijian pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT hejianguo pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT yanqingyun pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes
AT hezhili pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes