Cargando…

Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments

High‐throughput sequencing has the potential to describe biological communities with high efficiency yet comprehensive assessment of diversity with species‐level resolution remains one of the most challenging aspects of metabarcoding studies. We investigated the utility of curated ribosomal and mito...

Descripción completa

Detalles Bibliográficos
Autores principales: Macheriotou, Lara, Guilini, Katja, Bezerra, Tania Nara, Tytgat, Bjorn, Nguyen, Dinh Tu, Phuong Nguyen, Thi Xuan, Noppe, Febe, Armenteros, Maickel, Boufahja, Fehmi, Rigaux, Annelien, Vanreusel, Ann, Derycke, Sofie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6374678/
https://www.ncbi.nlm.nih.gov/pubmed/30805154
http://dx.doi.org/10.1002/ece3.4814
_version_ 1783395212520849408
author Macheriotou, Lara
Guilini, Katja
Bezerra, Tania Nara
Tytgat, Bjorn
Nguyen, Dinh Tu
Phuong Nguyen, Thi Xuan
Noppe, Febe
Armenteros, Maickel
Boufahja, Fehmi
Rigaux, Annelien
Vanreusel, Ann
Derycke, Sofie
author_facet Macheriotou, Lara
Guilini, Katja
Bezerra, Tania Nara
Tytgat, Bjorn
Nguyen, Dinh Tu
Phuong Nguyen, Thi Xuan
Noppe, Febe
Armenteros, Maickel
Boufahja, Fehmi
Rigaux, Annelien
Vanreusel, Ann
Derycke, Sofie
author_sort Macheriotou, Lara
collection PubMed
description High‐throughput sequencing has the potential to describe biological communities with high efficiency yet comprehensive assessment of diversity with species‐level resolution remains one of the most challenging aspects of metabarcoding studies. We investigated the utility of curated ribosomal and mitochondrial nematode reference sequence databases for determining phylum‐specific species‐level clustering thresholds. We compiled 438 ribosomal and 290 mitochondrial sequences which identified 99% and 94% as the species delineation clustering threshold, respectively. These thresholds were evaluated in HTS data from mock communities containing 39 nematode species as well as environmental samples from Vietnam. We compared the taxonomic description of the mocks generated by two read‐merging and two clustering algorithms and the cluster‐free Dada2 pipeline. Taxonomic assignment with the RDP classifier was assessed under different training sets. Our results showed that 36/39 mock nematode species were identified across the molecular markers (18S: 32, JB2: 19, JB3: 21) in UClust_ref OTUs at their respective clustering thresholds, outperforming UParse_denovo and the commonly used 97% similarity. Dada2 generated the most realistic number of ASVs (18S: 83, JB2: 75, JB3: 82), collectively identifying 30/39 mock species. The ribosomal marker outperformed the mitochondrial markers in terms of species and genus‐level detections for both OTUs and ASVs. The number of taxonomic assignments of OTUs/ASVs was highest when the smallest reference database containing only nematode sequences was used and when sequences were truncated to the respective amplicon length. Overall, OTUs generated more species‐level detections, which were, however, associated with higher error rates compared to ASVs. Genus‐level assignments using ASVs exhibited higher accuracy and lower error rates compared to species‐level assignments, suggesting that this is the most reliable pipeline for rapid assessment of alpha diversity from environmental samples.
format Online
Article
Text
id pubmed-6374678
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-63746782019-02-25 Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments Macheriotou, Lara Guilini, Katja Bezerra, Tania Nara Tytgat, Bjorn Nguyen, Dinh Tu Phuong Nguyen, Thi Xuan Noppe, Febe Armenteros, Maickel Boufahja, Fehmi Rigaux, Annelien Vanreusel, Ann Derycke, Sofie Ecol Evol Original Research High‐throughput sequencing has the potential to describe biological communities with high efficiency yet comprehensive assessment of diversity with species‐level resolution remains one of the most challenging aspects of metabarcoding studies. We investigated the utility of curated ribosomal and mitochondrial nematode reference sequence databases for determining phylum‐specific species‐level clustering thresholds. We compiled 438 ribosomal and 290 mitochondrial sequences which identified 99% and 94% as the species delineation clustering threshold, respectively. These thresholds were evaluated in HTS data from mock communities containing 39 nematode species as well as environmental samples from Vietnam. We compared the taxonomic description of the mocks generated by two read‐merging and two clustering algorithms and the cluster‐free Dada2 pipeline. Taxonomic assignment with the RDP classifier was assessed under different training sets. Our results showed that 36/39 mock nematode species were identified across the molecular markers (18S: 32, JB2: 19, JB3: 21) in UClust_ref OTUs at their respective clustering thresholds, outperforming UParse_denovo and the commonly used 97% similarity. Dada2 generated the most realistic number of ASVs (18S: 83, JB2: 75, JB3: 82), collectively identifying 30/39 mock species. The ribosomal marker outperformed the mitochondrial markers in terms of species and genus‐level detections for both OTUs and ASVs. The number of taxonomic assignments of OTUs/ASVs was highest when the smallest reference database containing only nematode sequences was used and when sequences were truncated to the respective amplicon length. Overall, OTUs generated more species‐level detections, which were, however, associated with higher error rates compared to ASVs. Genus‐level assignments using ASVs exhibited higher accuracy and lower error rates compared to species‐level assignments, suggesting that this is the most reliable pipeline for rapid assessment of alpha diversity from environmental samples. John Wiley and Sons Inc. 2019-01-22 /pmc/articles/PMC6374678/ /pubmed/30805154 http://dx.doi.org/10.1002/ece3.4814 Text en © 2018 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Macheriotou, Lara
Guilini, Katja
Bezerra, Tania Nara
Tytgat, Bjorn
Nguyen, Dinh Tu
Phuong Nguyen, Thi Xuan
Noppe, Febe
Armenteros, Maickel
Boufahja, Fehmi
Rigaux, Annelien
Vanreusel, Ann
Derycke, Sofie
Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
title Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
title_full Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
title_fullStr Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
title_full_unstemmed Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
title_short Metabarcoding free‐living marine nematodes using curated 18S and CO1 reference sequence databases for species‐level taxonomic assignments
title_sort metabarcoding free‐living marine nematodes using curated 18s and co1 reference sequence databases for species‐level taxonomic assignments
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6374678/
https://www.ncbi.nlm.nih.gov/pubmed/30805154
http://dx.doi.org/10.1002/ece3.4814
work_keys_str_mv AT macheriotoulara metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT guilinikatja metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT bezerratanianara metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT tytgatbjorn metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT nguyendinhtu metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT phuongnguyenthixuan metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT noppefebe metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT armenterosmaickel metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT boufahjafehmi metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT rigauxannelien metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT vanreuselann metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments
AT deryckesofie metabarcodingfreelivingmarinenematodesusingcurated18sandco1referencesequencedatabasesforspeciesleveltaxonomicassignments