Cargando…

Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies

BACKGROUND: Increasing studies have demonstrated potential disproportionate functional and ecological contributions of rare taxa in a microbial community. However, the study of the microbial rare biosphere is hampered by their inherent scarcity and the deficiency of currently available techniques. S...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Yangyang, Zhao, Shengguo, Guo, Wenjie, Peng, Ling, Zhao, Fang, Wang, Lushan, Fan, Guangyi, Zhu, Yuanfang, Xu, Dayou, Liu, Guilin, Wang, Ruoqing, Fang, Xiaodong, Zhang, He, Kristiansen, Karsten, Zhang, Wenwei, Chen, Jianwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387074/
https://www.ncbi.nlm.nih.gov/pubmed/35978448
http://dx.doi.org/10.1186/s40793-022-00436-y
_version_ 1784769947592818688
author Jia, Yangyang
Zhao, Shengguo
Guo, Wenjie
Peng, Ling
Zhao, Fang
Wang, Lushan
Fan, Guangyi
Zhu, Yuanfang
Xu, Dayou
Liu, Guilin
Wang, Ruoqing
Fang, Xiaodong
Zhang, He
Kristiansen, Karsten
Zhang, Wenwei
Chen, Jianwei
author_facet Jia, Yangyang
Zhao, Shengguo
Guo, Wenjie
Peng, Ling
Zhao, Fang
Wang, Lushan
Fan, Guangyi
Zhu, Yuanfang
Xu, Dayou
Liu, Guilin
Wang, Ruoqing
Fang, Xiaodong
Zhang, He
Kristiansen, Karsten
Zhang, Wenwei
Chen, Jianwei
author_sort Jia, Yangyang
collection PubMed
description BACKGROUND: Increasing studies have demonstrated potential disproportionate functional and ecological contributions of rare taxa in a microbial community. However, the study of the microbial rare biosphere is hampered by their inherent scarcity and the deficiency of currently available techniques. Sample-wise cross contaminations might be introduced by sample index misassignment in the most widely used metabarcoding amplicon sequencing approach. Although downstream bioinformatic quality control and clustering or denoising algorithms could remove sequencing errors and non-biological artifact reads, no algorithm could eliminate high quality reads from sample-wise cross contaminations introduced by index misassignment, making it difficult to distinguish between bona fide rare taxa and potential false positives in metabarcoding studies. RESULTS: We thoroughly evaluated the rate of index misassignment of the widely used NovaSeq 6000 and DNBSEQ-G400 sequencing platforms using both commercial and customized mock communities, and observed significant lower (0.08% vs. 5.68%) fraction of potential false positive reads for DNBSEQ-G400 as compared to NovaSeq 6000. Significant batch effects could be caused by stochastically introduced false positive or false negative rare taxa. These false detections could also lead to inflated alpha diversity of relatively simple microbial communities and underestimated that of complex ones. Further test using a set of cow rumen samples reported differential rare taxa by different sequencing platforms. Correlation analysis of the rare taxa detected by each sequencing platform demonstrated that the rare taxa identified by DNBSEQ-G400 platform had a much higher possibility to be correlated with the physiochemical properties of rumen fluid as compared to NovaSeq 6000 platform. Community assembly mechanism and microbial network correlation analysis indicated that false positive or negative rare taxa detection could lead to biased community assembly mechanism and identification of fake keystone species of the community. CONCLUSIONS: We highly suggest proper positive/negative/blank controls, technical replicate settings, and proper sequencing platform selection in future amplicon studies, especially when the microbial rare biosphere would be focused. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40793-022-00436-y.
format Online
Article
Text
id pubmed-9387074
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93870742022-08-19 Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies Jia, Yangyang Zhao, Shengguo Guo, Wenjie Peng, Ling Zhao, Fang Wang, Lushan Fan, Guangyi Zhu, Yuanfang Xu, Dayou Liu, Guilin Wang, Ruoqing Fang, Xiaodong Zhang, He Kristiansen, Karsten Zhang, Wenwei Chen, Jianwei Environ Microbiome Research Article BACKGROUND: Increasing studies have demonstrated potential disproportionate functional and ecological contributions of rare taxa in a microbial community. However, the study of the microbial rare biosphere is hampered by their inherent scarcity and the deficiency of currently available techniques. Sample-wise cross contaminations might be introduced by sample index misassignment in the most widely used metabarcoding amplicon sequencing approach. Although downstream bioinformatic quality control and clustering or denoising algorithms could remove sequencing errors and non-biological artifact reads, no algorithm could eliminate high quality reads from sample-wise cross contaminations introduced by index misassignment, making it difficult to distinguish between bona fide rare taxa and potential false positives in metabarcoding studies. RESULTS: We thoroughly evaluated the rate of index misassignment of the widely used NovaSeq 6000 and DNBSEQ-G400 sequencing platforms using both commercial and customized mock communities, and observed significant lower (0.08% vs. 5.68%) fraction of potential false positive reads for DNBSEQ-G400 as compared to NovaSeq 6000. Significant batch effects could be caused by stochastically introduced false positive or false negative rare taxa. These false detections could also lead to inflated alpha diversity of relatively simple microbial communities and underestimated that of complex ones. Further test using a set of cow rumen samples reported differential rare taxa by different sequencing platforms. Correlation analysis of the rare taxa detected by each sequencing platform demonstrated that the rare taxa identified by DNBSEQ-G400 platform had a much higher possibility to be correlated with the physiochemical properties of rumen fluid as compared to NovaSeq 6000 platform. Community assembly mechanism and microbial network correlation analysis indicated that false positive or negative rare taxa detection could lead to biased community assembly mechanism and identification of fake keystone species of the community. CONCLUSIONS: We highly suggest proper positive/negative/blank controls, technical replicate settings, and proper sequencing platform selection in future amplicon studies, especially when the microbial rare biosphere would be focused. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40793-022-00436-y. BioMed Central 2022-08-17 /pmc/articles/PMC9387074/ /pubmed/35978448 http://dx.doi.org/10.1186/s40793-022-00436-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Jia, Yangyang
Zhao, Shengguo
Guo, Wenjie
Peng, Ling
Zhao, Fang
Wang, Lushan
Fan, Guangyi
Zhu, Yuanfang
Xu, Dayou
Liu, Guilin
Wang, Ruoqing
Fang, Xiaodong
Zhang, He
Kristiansen, Karsten
Zhang, Wenwei
Chen, Jianwei
Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
title Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
title_full Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
title_fullStr Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
title_full_unstemmed Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
title_short Sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
title_sort sequencing introduced false positive rare taxa lead to biased microbial community diversity, assembly, and interaction interpretation in amplicon studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387074/
https://www.ncbi.nlm.nih.gov/pubmed/35978448
http://dx.doi.org/10.1186/s40793-022-00436-y
work_keys_str_mv AT jiayangyang sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT zhaoshengguo sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT guowenjie sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT pengling sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT zhaofang sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT wanglushan sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT fanguangyi sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT zhuyuanfang sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT xudayou sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT liuguilin sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT wangruoqing sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT fangxiaodong sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT zhanghe sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT kristiansenkarsten sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT zhangwenwei sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies
AT chenjianwei sequencingintroducedfalsepositiveraretaxaleadtobiasedmicrobialcommunitydiversityassemblyandinteractioninterpretationinampliconstudies