Cargando…
Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
BACKGROUND: Increasing studies have demonstrated potential disproportionate functional and ecological contributions of rare taxa in a microbial community. However, the study of the microbial rare biosphere is hampered by their inherent scarcity and the deficiency of currently available techniques. S...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387074/ https://www.ncbi.nlm.nih.gov/pubmed/35978448 http://dx.doi.org/10.1186/s40793-022-00436-y |
_version_ | 1784769947592818688 |
---|---|
author | Jia, Yangyang Zhao, Shengguo Guo, Wenjie Peng, Ling Zhao, Fang Wang, Lushan Fan, Guangyi Zhu, Yuanfang Xu, Dayou Liu, Guilin Wang, Ruoqing Fang, Xiaodong Zhang, He Kristiansen, Karsten Zhang, Wenwei Chen, Jianwei |
author_facet | Jia, Yangyang Zhao, Shengguo Guo, Wenjie Peng, Ling Zhao, Fang Wang, Lushan Fan, Guangyi Zhu, Yuanfang Xu, Dayou Liu, Guilin Wang, Ruoqing Fang, Xiaodong Zhang, He Kristiansen, Karsten Zhang, Wenwei Chen, Jianwei |
author_sort | Jia, Yangyang |
collection | PubMed |
description | BACKGROUND: Increasing studies have demonstrated potential disproportionate functional and ecological contributions of rare taxa in a microbial community. However, the study of the microbial rare biosphere is hampered by their inherent scarcity and the deficiency of currently available techniques. Sample-wise cross contaminations might be introduced by sample index misassignment in the most widely used metabarcoding amplicon sequencing approach. Although downstream bioinformatic quality control and clustering or denoising algorithms could remove sequencing errors and non-biological artifact reads, no algorithm could eliminate high quality reads from sample-wise cross contaminations introduced by index misassignment, making it difficult to distinguish between bona fide rare taxa and potential false positives in metabarcoding studies. RESULTS: We thoroughly evaluated the rate of index misassignment of the widely used NovaSeq 6000 and DNBSEQ-G400 sequencing platforms using both commercial and customized mock communities, and observed significant lower (0.08% vs. 5.68%) fraction of potential false positive reads for DNBSEQ-G400 as compared to NovaSeq 6000. Significant batch effects could be caused by stochastically introduced false positive or false negative rare taxa. These false detections could also lead to inflated alpha diversity of relatively simple microbial communities and underestimated that of complex ones. Further test using a set of cow rumen samples reported differential rare taxa by different sequencing platforms. Correlation analysis of the rare taxa detected by each sequencing platform demonstrated that the rare taxa identified by DNBSEQ-G400 platform had a much higher possibility to be correlated with the physiochemical properties of rumen fluid as compared to NovaSeq 6000 platform. Community assembly mechanism and microbial network correlation analysis indicated that false positive or negative rare taxa detection could lead to biased community assembly mechanism and identification of fake keystone species of the community. CONCLUSIONS: We highly suggest proper positive/negative/blank controls, technical replicate settings, and proper sequencing platform selection in future amplicon studies, especially when the microbial rare biosphere would be focused. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40793-022-00436-y. |
format | Online Article Text |
id | pubmed-9387074 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93870742022-08-19 Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies Jia, Yangyang Zhao, Shengguo Guo, Wenjie Peng, Ling Zhao, Fang Wang, Lushan Fan, Guangyi Zhu, Yuanfang Xu, Dayou Liu, Guilin Wang, Ruoqing Fang, Xiaodong Zhang, He Kristiansen, Karsten Zhang, Wenwei Chen, Jianwei Environ Microbiome Research Article BACKGROUND: Increasing studies have demonstrated potential disproportionate functional and ecological contributions of rare taxa in a microbial community. However, the study of the microbial rare biosphere is hampered by their inherent scarcity and the deficiency of currently available techniques. Sample-wise cross contaminations might be introduced by sample index misassignment in the most widely used metabarcoding amplicon sequencing approach. Although downstream bioinformatic quality control and clustering or denoising algorithms could remove sequencing errors and non-biological artifact reads, no algorithm could eliminate high quality reads from sample-wise cross contaminations introduced by index misassignment, making it difficult to distinguish between bona fide rare taxa and potential false positives in metabarcoding studies. RESULTS: We thoroughly evaluated the rate of index misassignment of the widely used NovaSeq 6000 and DNBSEQ-G400 sequencing platforms using both commercial and customized mock communities, and observed significant lower (0.08% vs. 5.68%) fraction of potential false positive reads for DNBSEQ-G400 as compared to NovaSeq 6000. Significant batch effects could be caused by stochastically introduced false positive or false negative rare taxa. These false detections could also lead to inflated alpha diversity of relatively simple microbial communities and underestimated that of complex ones. Further test using a set of cow rumen samples reported differential rare taxa by different sequencing platforms. Correlation analysis of the rare taxa detected by each sequencing platform demonstrated that the rare taxa identified by DNBSEQ-G400 platform had a much higher possibility to be correlated with the physiochemical properties of rumen fluid as compared to NovaSeq 6000 platform. Community assembly mechanism and microbial network correlation analysis indicated that false positive or negative rare taxa detection could lead to biased community assembly mechanism and identification of fake keystone species of the community. CONCLUSIONS: We highly suggest proper positive/negative/blank controls, technical replicate settings, and proper sequencing platform selection in future amplicon studies, especially when the microbial rare biosphere would be focused. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40793-022-00436-y. BioMed Central 2022-08-17 /pmc/articles/PMC9387074/ /pubmed/35978448 http://dx.doi.org/10.1186/s40793-022-00436-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Jia, Yangyang Zhao, Shengguo Guo, Wenjie Peng, Ling Zhao, Fang Wang, Lushan Fan, Guangyi Zhu, Yuanfang Xu, Dayou Liu, Guilin Wang, Ruoqing Fang, Xiaodong Zhang, He Kristiansen, Karsten Zhang, Wenwei Chen, Jianwei Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
title | Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
title_full | Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
title_fullStr | Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
title_full_unstemmed | Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
title_short | Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
title_sort | sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387074/ https://www.ncbi.nlm.nih.gov/pubmed/35978448 http://dx.doi.org/10.1186/s40793-022-00436-y |
work_keys_str_mv | AT jiayangyang sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT zhaoshengguo sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT guowenjie sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT pengling sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT zhaofang sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT wanglushan sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT fanguangyi sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT zhuyuanfang sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT xudayou sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT liuguilin sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT wangruoqing sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT fangxiaodong sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT zhanghe sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT kristiansenkarsten sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT zhangwenwei sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies AT chenjianwei sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies |