Cargando…
PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes
BACKGROUND: Phosphorus (P) is one of the most essential macronutrients on the planet, and microorganisms (including bacteria and archaea) play a key role in P cycling in all living things and ecosystems. However, our comprehensive understanding of key P cycling genes (PCGs) and microorganisms (PCMs)...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252087/ https://www.ncbi.nlm.nih.gov/pubmed/35787295 http://dx.doi.org/10.1186/s40168-022-01292-1 |
_version_ | 1784740185074827264 |
---|---|
author | Zeng, Jiaxiong Tu, Qichao Yu, Xiaoli Qian, Lu Wang, Cheng Shu, Longfei Liu, Fei Liu, Shengwei Huang, Zhijian He, Jianguo Yan, Qingyun He, Zhili |
author_facet | Zeng, Jiaxiong Tu, Qichao Yu, Xiaoli Qian, Lu Wang, Cheng Shu, Longfei Liu, Fei Liu, Shengwei Huang, Zhijian He, Jianguo Yan, Qingyun He, Zhili |
author_sort | Zeng, Jiaxiong |
collection | PubMed |
description | BACKGROUND: Phosphorus (P) is one of the most essential macronutrients on the planet, and microorganisms (including bacteria and archaea) play a key role in P cycling in all living things and ecosystems. However, our comprehensive understanding of key P cycling genes (PCGs) and microorganisms (PCMs) as well as their ecological functions remains elusive even with the rapid advancement of metagenome sequencing technologies. One of major challenges is a lack of a comprehensive and accurately annotated P cycling functional gene database. RESULTS: In this study, we constructed a well-curated P cycling database (PCycDB) covering 139 gene families and 10 P metabolic processes, including several previously ignored PCGs such as pafA encoding phosphate-insensitive phosphatase, ptxABCD (phosphite-related genes), and novel aepXVWPS genes for 2-aminoethylphosphonate transporters. We achieved an annotation accuracy, positive predictive value (PPV), sensitivity, specificity, and negative predictive value (NPV) of 99.8%, 96.1%, 99.9%, 99.8%, and 99.9%, respectively, for simulated gene datasets. Compared to other orthology databases, PCycDB is more accurate, more comprehensive, and faster to profile the PCGs. We used PCycDB to analyze P cycling microbial communities from representative natural and engineered environments and showed that PCycDB could apply to different environments. CONCLUSIONS: We demonstrate that PCycDB is a powerful tool for advancing our understanding of microbially driven P cycling in the environment with high coverage, high accuracy, and rapid analysis of metagenome sequencing data. The PCycDB is available at https://github.com/ZengJiaxiong/Phosphorus-cycling-database. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-022-01292-1. |
format | Online Article Text |
id | pubmed-9252087 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92520872022-07-05 PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes Zeng, Jiaxiong Tu, Qichao Yu, Xiaoli Qian, Lu Wang, Cheng Shu, Longfei Liu, Fei Liu, Shengwei Huang, Zhijian He, Jianguo Yan, Qingyun He, Zhili Microbiome Methodology BACKGROUND: Phosphorus (P) is one of the most essential macronutrients on the planet, and microorganisms (including bacteria and archaea) play a key role in P cycling in all living things and ecosystems. However, our comprehensive understanding of key P cycling genes (PCGs) and microorganisms (PCMs) as well as their ecological functions remains elusive even with the rapid advancement of metagenome sequencing technologies. One of major challenges is a lack of a comprehensive and accurately annotated P cycling functional gene database. RESULTS: In this study, we constructed a well-curated P cycling database (PCycDB) covering 139 gene families and 10 P metabolic processes, including several previously ignored PCGs such as pafA encoding phosphate-insensitive phosphatase, ptxABCD (phosphite-related genes), and novel aepXVWPS genes for 2-aminoethylphosphonate transporters. We achieved an annotation accuracy, positive predictive value (PPV), sensitivity, specificity, and negative predictive value (NPV) of 99.8%, 96.1%, 99.9%, 99.8%, and 99.9%, respectively, for simulated gene datasets. Compared to other orthology databases, PCycDB is more accurate, more comprehensive, and faster to profile the PCGs. We used PCycDB to analyze P cycling microbial communities from representative natural and engineered environments and showed that PCycDB could apply to different environments. CONCLUSIONS: We demonstrate that PCycDB is a powerful tool for advancing our understanding of microbially driven P cycling in the environment with high coverage, high accuracy, and rapid analysis of metagenome sequencing data. The PCycDB is available at https://github.com/ZengJiaxiong/Phosphorus-cycling-database. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-022-01292-1. BioMed Central 2022-07-04 /pmc/articles/PMC9252087/ /pubmed/35787295 http://dx.doi.org/10.1186/s40168-022-01292-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Zeng, Jiaxiong Tu, Qichao Yu, Xiaoli Qian, Lu Wang, Cheng Shu, Longfei Liu, Fei Liu, Shengwei Huang, Zhijian He, Jianguo Yan, Qingyun He, Zhili PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
title | PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
title_full | PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
title_fullStr | PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
title_full_unstemmed | PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
title_short | PCycDB: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
title_sort | pcycdb: a comprehensive and accurate database for fast analysis of phosphorus cycling genes |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252087/ https://www.ncbi.nlm.nih.gov/pubmed/35787295 http://dx.doi.org/10.1186/s40168-022-01292-1 |
work_keys_str_mv | AT zengjiaxiong pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT tuqichao pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT yuxiaoli pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT qianlu pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT wangcheng pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT shulongfei pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT liufei pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT liushengwei pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT huangzhijian pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT hejianguo pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT yanqingyun pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes AT hezhili pcycdbacomprehensiveandaccuratedatabaseforfastanalysisofphosphoruscyclinggenes |