Cargando…

Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification

The accuracy and precision of fungal molecular identification and classification are challenging, particularly in environmental metabarcoding approaches as these often trade accuracy for efficiency given the large data volumes at hand. In most ecological studies, only a single similarity cutoff valu...

Descripción completa

Detalles Bibliográficos
Autores principales: Vu, Duong, Nilsson, R. Henrik, Verkley, Gerard J. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9542245/
https://www.ncbi.nlm.nih.gov/pubmed/35621380
http://dx.doi.org/10.1111/1755-0998.13651
_version_ 1784804107463163904
author Vu, Duong
Nilsson, R. Henrik
Verkley, Gerard J. M.
author_facet Vu, Duong
Nilsson, R. Henrik
Verkley, Gerard J. M.
author_sort Vu, Duong
collection PubMed
description The accuracy and precision of fungal molecular identification and classification are challenging, particularly in environmental metabarcoding approaches as these often trade accuracy for efficiency given the large data volumes at hand. In most ecological studies, only a single similarity cutoff value is used for sequence identification. This is not sufficient since the most commonly used DNA markers are known to vary widely in terms of inter‐ and intraspecific variability. We address this problem by presenting a new tool, dnabarcoder, to predict local similarity cutoffs and measure the resolving powers of a biomarker for sequence identification for different clades of fungi. It was shown that the predicted similarity cutoffs varied significantly between the clades of a recently released ITS DNA barcode data set from the CBS culture collection of the Westerdijk Fungal Biodiversity Institute. When classifying a large public fungal ITS data set—the UNITE database—against the barcode data set, the local similarity cutoffs assigned fewer sequences than the traditional cutoffs used in metabarcoding studies. However, the obtained accuracy and precision were significantly improved. Our study showed that it might be better to extract the ITS region from the ITS barcodes to optimize taxonomic assignment accuracy. Furthermore, 15.3, 25.6, and 26.3% of the fungal species of the barcode data set were indistinguishable by full‐length ITS, ITS1, and ITS2, respectively. Except for these indistinguishable species, the resolving powers of full‐length ITS, ITS1, and ITS2 sequences were similar at the species level. Nevertheless, the complete ITS region had a better resolving power at higher taxonomic levels.
format Online
Article
Text
id pubmed-9542245
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-95422452022-10-14 Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification Vu, Duong Nilsson, R. Henrik Verkley, Gerard J. M. Mol Ecol Resour RESOURCE ARTICLES The accuracy and precision of fungal molecular identification and classification are challenging, particularly in environmental metabarcoding approaches as these often trade accuracy for efficiency given the large data volumes at hand. In most ecological studies, only a single similarity cutoff value is used for sequence identification. This is not sufficient since the most commonly used DNA markers are known to vary widely in terms of inter‐ and intraspecific variability. We address this problem by presenting a new tool, dnabarcoder, to predict local similarity cutoffs and measure the resolving powers of a biomarker for sequence identification for different clades of fungi. It was shown that the predicted similarity cutoffs varied significantly between the clades of a recently released ITS DNA barcode data set from the CBS culture collection of the Westerdijk Fungal Biodiversity Institute. When classifying a large public fungal ITS data set—the UNITE database—against the barcode data set, the local similarity cutoffs assigned fewer sequences than the traditional cutoffs used in metabarcoding studies. However, the obtained accuracy and precision were significantly improved. Our study showed that it might be better to extract the ITS region from the ITS barcodes to optimize taxonomic assignment accuracy. Furthermore, 15.3, 25.6, and 26.3% of the fungal species of the barcode data set were indistinguishable by full‐length ITS, ITS1, and ITS2, respectively. Except for these indistinguishable species, the resolving powers of full‐length ITS, ITS1, and ITS2 sequences were similar at the species level. Nevertheless, the complete ITS region had a better resolving power at higher taxonomic levels. John Wiley and Sons Inc. 2022-06-20 2022-10 /pmc/articles/PMC9542245/ /pubmed/35621380 http://dx.doi.org/10.1111/1755-0998.13651 Text en © 2022 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle RESOURCE ARTICLES
Vu, Duong
Nilsson, R. Henrik
Verkley, Gerard J. M.
Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification
title Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification
title_full Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification
title_fullStr Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification
title_full_unstemmed Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification
title_short Dnabarcoder: An open‐source software package for analysing and predicting DNA sequence similarity cutoffs for fungal sequence identification
title_sort dnabarcoder: an open‐source software package for analysing and predicting dna sequence similarity cutoffs for fungal sequence identification
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9542245/
https://www.ncbi.nlm.nih.gov/pubmed/35621380
http://dx.doi.org/10.1111/1755-0998.13651
work_keys_str_mv AT vuduong dnabarcoderanopensourcesoftwarepackageforanalysingandpredictingdnasequencesimilaritycutoffsforfungalsequenceidentification
AT nilssonrhenrik dnabarcoderanopensourcesoftwarepackageforanalysingandpredictingdnasequencesimilaritycutoffsforfungalsequenceidentification
AT verkleygerardjm dnabarcoderanopensourcesoftwarepackageforanalysingandpredictingdnasequencesimilaritycutoffsforfungalsequenceidentification