Cargando…

HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis

INTRODUCTION: The fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges,...

Descripción completa

Detalles Bibliográficos
Autores principales: Mlaga, Kodjovi D., Mathieu, Alban, Beauparlant, Charles Joly, Ott, Alban, Khodr, Ahmad, Perin, Olivier, Droit, Arnaud
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8134036/
https://www.ncbi.nlm.nih.gov/pubmed/34025601
http://dx.doi.org/10.3389/fmicb.2021.640693
_version_ 1783695152653533184
author Mlaga, Kodjovi D.
Mathieu, Alban
Beauparlant, Charles Joly
Ott, Alban
Khodr, Ahmad
Perin, Olivier
Droit, Arnaud
author_facet Mlaga, Kodjovi D.
Mathieu, Alban
Beauparlant, Charles Joly
Ott, Alban
Khodr, Ahmad
Perin, Olivier
Droit, Arnaud
author_sort Mlaga, Kodjovi D.
collection PubMed
description INTRODUCTION: The fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges, we present a novel approach, Hierarchical Clustering with Kraken (HCK), to classify ITS1 amplicons and Abundance-Base Alternative Approach (ABAA) pipeline to detect and filter non-specific amplicons in fungi metabarcoding sequencing datasets. MATERIALS AND METHODS: We compared the performances of both pipelines against QIIME, KRAKEN, and DADA2 using publicly available fungi ITS mock community datasets and using BLASTn as a reference. We calculated the Precision, Recall, F-score using the True-Positive, False-positive, and False-negative estimation. Alpha diversity (Chao1 and Shannon metrics) was also used to evaluate the diversity estimation of our method. RESULTS: The analysis shows that ABAA reduced the number of false-positive with all metabarcoding methods tested, and HCK increases precision and recall. HCK, coupled with ABAA, improves the F-score and bring alpha diversity metric value close to that of the BLASTn alpha diversity values when compared to QIIME, KRAKEN, and DADA2. CONCLUSION: The developed HCK-ABAA approach allows better identification of the fungi community structures while avoiding use of a reference database for non-specific amplicons filtration. It results in a more robust and stable methodology over time. The software can be downloaded on the following link: https://bitbucket.org/GottySG36/hck/src/master/.
format Online
Article
Text
id pubmed-8134036
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81340362021-05-21 HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis Mlaga, Kodjovi D. Mathieu, Alban Beauparlant, Charles Joly Ott, Alban Khodr, Ahmad Perin, Olivier Droit, Arnaud Front Microbiol Microbiology INTRODUCTION: The fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges, we present a novel approach, Hierarchical Clustering with Kraken (HCK), to classify ITS1 amplicons and Abundance-Base Alternative Approach (ABAA) pipeline to detect and filter non-specific amplicons in fungi metabarcoding sequencing datasets. MATERIALS AND METHODS: We compared the performances of both pipelines against QIIME, KRAKEN, and DADA2 using publicly available fungi ITS mock community datasets and using BLASTn as a reference. We calculated the Precision, Recall, F-score using the True-Positive, False-positive, and False-negative estimation. Alpha diversity (Chao1 and Shannon metrics) was also used to evaluate the diversity estimation of our method. RESULTS: The analysis shows that ABAA reduced the number of false-positive with all metabarcoding methods tested, and HCK increases precision and recall. HCK, coupled with ABAA, improves the F-score and bring alpha diversity metric value close to that of the BLASTn alpha diversity values when compared to QIIME, KRAKEN, and DADA2. CONCLUSION: The developed HCK-ABAA approach allows better identification of the fungi community structures while avoiding use of a reference database for non-specific amplicons filtration. It results in a more robust and stable methodology over time. The software can be downloaded on the following link: https://bitbucket.org/GottySG36/hck/src/master/. Frontiers Media S.A. 2021-05-05 /pmc/articles/PMC8134036/ /pubmed/34025601 http://dx.doi.org/10.3389/fmicb.2021.640693 Text en Copyright © 2021 Mlaga, Mathieu, Beauparlant, Ott, Khodr, Perin and Droit. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Mlaga, Kodjovi D.
Mathieu, Alban
Beauparlant, Charles Joly
Ott, Alban
Khodr, Ahmad
Perin, Olivier
Droit, Arnaud
HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
title HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
title_full HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
title_fullStr HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
title_full_unstemmed HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
title_short HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
title_sort hck and abaa: a newly designed pipeline to improve fungi metabarcoding analysis
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8134036/
https://www.ncbi.nlm.nih.gov/pubmed/34025601
http://dx.doi.org/10.3389/fmicb.2021.640693
work_keys_str_mv AT mlagakodjovid hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis
AT mathieualban hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis
AT beauparlantcharlesjoly hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis
AT ottalban hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis
AT khodrahmad hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis
AT perinolivier hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis
AT droitarnaud hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis