Cargando…

Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets

Identification and analysis of fungal communities commonly rely on internal transcribed spacer–based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high...

Descripción completa

Detalles Bibliográficos
Autores principales: Rolling, Thierry, Zhai, Bing, Frame, John, Hohl, Tobias M., Taur, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Clinical Investigation 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765055/
https://www.ncbi.nlm.nih.gov/pubmed/34813499
http://dx.doi.org/10.1172/jci.insight.151663
_version_ 1784634283683479552
author Rolling, Thierry
Zhai, Bing
Frame, John
Hohl, Tobias M.
Taur, Ying
author_facet Rolling, Thierry
Zhai, Bing
Frame, John
Hohl, Tobias M.
Taur, Ying
author_sort Rolling, Thierry
collection PubMed
description Identification and analysis of fungal communities commonly rely on internal transcribed spacer–based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high-resolution inference of fungal constituents, we customized a DADA2-based pipeline using a mix of 11 medically relevant fungi. While DADA2 allowed the discrimination of ITS1 sequences differing by single nucleotides, quality filtering, sequencing bias, and database selection were identified as key variables determining the accuracy of sample inference. Due to species-specific differences in sequencing quality, default filtering settings removed most reads that originated from Aspergillus species, Saccharomyces cerevisiae, and Candida glabrata. By fine-tuning the quality filtering process, we achieved an improved representation of the fungal communities. By adapting a wobble nucleotide in the ITS1 forward primer region, we further increased the yield of S. cerevisiae and C. glabrata sequences. Finally, we showed that a BLAST-based algorithm based on the UNITE+INSD or the NCBI NT database achieved a higher reliability in species-level taxonomic annotation compared with the naive Bayesian classifier implemented in DADA2. These steps optimized a robust fungal ITS1 sequencing pipeline that, in most instances, enabled species-level assignment of community members.
format Online
Article
Text
id pubmed-8765055
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Clinical Investigation
record_format MEDLINE/PubMed
spelling pubmed-87650552022-01-24 Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets Rolling, Thierry Zhai, Bing Frame, John Hohl, Tobias M. Taur, Ying JCI Insight Resource and Technical Advance Identification and analysis of fungal communities commonly rely on internal transcribed spacer–based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high-resolution inference of fungal constituents, we customized a DADA2-based pipeline using a mix of 11 medically relevant fungi. While DADA2 allowed the discrimination of ITS1 sequences differing by single nucleotides, quality filtering, sequencing bias, and database selection were identified as key variables determining the accuracy of sample inference. Due to species-specific differences in sequencing quality, default filtering settings removed most reads that originated from Aspergillus species, Saccharomyces cerevisiae, and Candida glabrata. By fine-tuning the quality filtering process, we achieved an improved representation of the fungal communities. By adapting a wobble nucleotide in the ITS1 forward primer region, we further increased the yield of S. cerevisiae and C. glabrata sequences. Finally, we showed that a BLAST-based algorithm based on the UNITE+INSD or the NCBI NT database achieved a higher reliability in species-level taxonomic annotation compared with the naive Bayesian classifier implemented in DADA2. These steps optimized a robust fungal ITS1 sequencing pipeline that, in most instances, enabled species-level assignment of community members. American Society for Clinical Investigation 2022-01-11 /pmc/articles/PMC8765055/ /pubmed/34813499 http://dx.doi.org/10.1172/jci.insight.151663 Text en © 2022 Rolling et al. https://creativecommons.org/licenses/by/4.0/This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Resource and Technical Advance
Rolling, Thierry
Zhai, Bing
Frame, John
Hohl, Tobias M.
Taur, Ying
Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
title Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
title_full Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
title_fullStr Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
title_full_unstemmed Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
title_short Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
title_sort customization of a dada2-based pipeline for fungal internal transcribed spacer 1 (its1) amplicon data sets
topic Resource and Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765055/
https://www.ncbi.nlm.nih.gov/pubmed/34813499
http://dx.doi.org/10.1172/jci.insight.151663
work_keys_str_mv AT rollingthierry customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets
AT zhaibing customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets
AT framejohn customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets
AT hohltobiasm customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets
AT taurying customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets