Cargando…
Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets
Identification and analysis of fungal communities commonly rely on internal transcribed spacer–based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Clinical Investigation
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765055/ https://www.ncbi.nlm.nih.gov/pubmed/34813499 http://dx.doi.org/10.1172/jci.insight.151663 |
_version_ | 1784634283683479552 |
---|---|
author | Rolling, Thierry Zhai, Bing Frame, John Hohl, Tobias M. Taur, Ying |
author_facet | Rolling, Thierry Zhai, Bing Frame, John Hohl, Tobias M. Taur, Ying |
author_sort | Rolling, Thierry |
collection | PubMed |
description | Identification and analysis of fungal communities commonly rely on internal transcribed spacer–based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high-resolution inference of fungal constituents, we customized a DADA2-based pipeline using a mix of 11 medically relevant fungi. While DADA2 allowed the discrimination of ITS1 sequences differing by single nucleotides, quality filtering, sequencing bias, and database selection were identified as key variables determining the accuracy of sample inference. Due to species-specific differences in sequencing quality, default filtering settings removed most reads that originated from Aspergillus species, Saccharomyces cerevisiae, and Candida glabrata. By fine-tuning the quality filtering process, we achieved an improved representation of the fungal communities. By adapting a wobble nucleotide in the ITS1 forward primer region, we further increased the yield of S. cerevisiae and C. glabrata sequences. Finally, we showed that a BLAST-based algorithm based on the UNITE+INSD or the NCBI NT database achieved a higher reliability in species-level taxonomic annotation compared with the naive Bayesian classifier implemented in DADA2. These steps optimized a robust fungal ITS1 sequencing pipeline that, in most instances, enabled species-level assignment of community members. |
format | Online Article Text |
id | pubmed-8765055 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Society for Clinical Investigation |
record_format | MEDLINE/PubMed |
spelling | pubmed-87650552022-01-24 Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets Rolling, Thierry Zhai, Bing Frame, John Hohl, Tobias M. Taur, Ying JCI Insight Resource and Technical Advance Identification and analysis of fungal communities commonly rely on internal transcribed spacer–based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high-resolution inference of fungal constituents, we customized a DADA2-based pipeline using a mix of 11 medically relevant fungi. While DADA2 allowed the discrimination of ITS1 sequences differing by single nucleotides, quality filtering, sequencing bias, and database selection were identified as key variables determining the accuracy of sample inference. Due to species-specific differences in sequencing quality, default filtering settings removed most reads that originated from Aspergillus species, Saccharomyces cerevisiae, and Candida glabrata. By fine-tuning the quality filtering process, we achieved an improved representation of the fungal communities. By adapting a wobble nucleotide in the ITS1 forward primer region, we further increased the yield of S. cerevisiae and C. glabrata sequences. Finally, we showed that a BLAST-based algorithm based on the UNITE+INSD or the NCBI NT database achieved a higher reliability in species-level taxonomic annotation compared with the naive Bayesian classifier implemented in DADA2. These steps optimized a robust fungal ITS1 sequencing pipeline that, in most instances, enabled species-level assignment of community members. American Society for Clinical Investigation 2022-01-11 /pmc/articles/PMC8765055/ /pubmed/34813499 http://dx.doi.org/10.1172/jci.insight.151663 Text en © 2022 Rolling et al. https://creativecommons.org/licenses/by/4.0/This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Resource and Technical Advance Rolling, Thierry Zhai, Bing Frame, John Hohl, Tobias M. Taur, Ying Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets |
title | Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets |
title_full | Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets |
title_fullStr | Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets |
title_full_unstemmed | Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets |
title_short | Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets |
title_sort | customization of a dada2-based pipeline for fungal internal transcribed spacer 1 (its1) amplicon data sets |
topic | Resource and Technical Advance |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765055/ https://www.ncbi.nlm.nih.gov/pubmed/34813499 http://dx.doi.org/10.1172/jci.insight.151663 |
work_keys_str_mv | AT rollingthierry customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets AT zhaibing customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets AT framejohn customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets AT hohltobiasm customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets AT taurying customizationofadada2basedpipelineforfungalinternaltranscribedspacer1its1amplicondatasets |