Cargando…
HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis
INTRODUCTION: The fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges,...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8134036/ https://www.ncbi.nlm.nih.gov/pubmed/34025601 http://dx.doi.org/10.3389/fmicb.2021.640693 |
_version_ | 1783695152653533184 |
---|---|
author | Mlaga, Kodjovi D. Mathieu, Alban Beauparlant, Charles Joly Ott, Alban Khodr, Ahmad Perin, Olivier Droit, Arnaud |
author_facet | Mlaga, Kodjovi D. Mathieu, Alban Beauparlant, Charles Joly Ott, Alban Khodr, Ahmad Perin, Olivier Droit, Arnaud |
author_sort | Mlaga, Kodjovi D. |
collection | PubMed |
description | INTRODUCTION: The fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges, we present a novel approach, Hierarchical Clustering with Kraken (HCK), to classify ITS1 amplicons and Abundance-Base Alternative Approach (ABAA) pipeline to detect and filter non-specific amplicons in fungi metabarcoding sequencing datasets. MATERIALS AND METHODS: We compared the performances of both pipelines against QIIME, KRAKEN, and DADA2 using publicly available fungi ITS mock community datasets and using BLASTn as a reference. We calculated the Precision, Recall, F-score using the True-Positive, False-positive, and False-negative estimation. Alpha diversity (Chao1 and Shannon metrics) was also used to evaluate the diversity estimation of our method. RESULTS: The analysis shows that ABAA reduced the number of false-positive with all metabarcoding methods tested, and HCK increases precision and recall. HCK, coupled with ABAA, improves the F-score and bring alpha diversity metric value close to that of the BLASTn alpha diversity values when compared to QIIME, KRAKEN, and DADA2. CONCLUSION: The developed HCK-ABAA approach allows better identification of the fungi community structures while avoiding use of a reference database for non-specific amplicons filtration. It results in a more robust and stable methodology over time. The software can be downloaded on the following link: https://bitbucket.org/GottySG36/hck/src/master/. |
format | Online Article Text |
id | pubmed-8134036 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81340362021-05-21 HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis Mlaga, Kodjovi D. Mathieu, Alban Beauparlant, Charles Joly Ott, Alban Khodr, Ahmad Perin, Olivier Droit, Arnaud Front Microbiol Microbiology INTRODUCTION: The fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges, we present a novel approach, Hierarchical Clustering with Kraken (HCK), to classify ITS1 amplicons and Abundance-Base Alternative Approach (ABAA) pipeline to detect and filter non-specific amplicons in fungi metabarcoding sequencing datasets. MATERIALS AND METHODS: We compared the performances of both pipelines against QIIME, KRAKEN, and DADA2 using publicly available fungi ITS mock community datasets and using BLASTn as a reference. We calculated the Precision, Recall, F-score using the True-Positive, False-positive, and False-negative estimation. Alpha diversity (Chao1 and Shannon metrics) was also used to evaluate the diversity estimation of our method. RESULTS: The analysis shows that ABAA reduced the number of false-positive with all metabarcoding methods tested, and HCK increases precision and recall. HCK, coupled with ABAA, improves the F-score and bring alpha diversity metric value close to that of the BLASTn alpha diversity values when compared to QIIME, KRAKEN, and DADA2. CONCLUSION: The developed HCK-ABAA approach allows better identification of the fungi community structures while avoiding use of a reference database for non-specific amplicons filtration. It results in a more robust and stable methodology over time. The software can be downloaded on the following link: https://bitbucket.org/GottySG36/hck/src/master/. Frontiers Media S.A. 2021-05-05 /pmc/articles/PMC8134036/ /pubmed/34025601 http://dx.doi.org/10.3389/fmicb.2021.640693 Text en Copyright © 2021 Mlaga, Mathieu, Beauparlant, Ott, Khodr, Perin and Droit. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Mlaga, Kodjovi D. Mathieu, Alban Beauparlant, Charles Joly Ott, Alban Khodr, Ahmad Perin, Olivier Droit, Arnaud HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis |
title | HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis |
title_full | HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis |
title_fullStr | HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis |
title_full_unstemmed | HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis |
title_short | HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis |
title_sort | hck and abaa: a newly designed pipeline to improve fungi metabarcoding analysis |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8134036/ https://www.ncbi.nlm.nih.gov/pubmed/34025601 http://dx.doi.org/10.3389/fmicb.2021.640693 |
work_keys_str_mv | AT mlagakodjovid hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis AT mathieualban hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis AT beauparlantcharlesjoly hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis AT ottalban hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis AT khodrahmad hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis AT perinolivier hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis AT droitarnaud hckandabaaanewlydesignedpipelinetoimprovefungimetabarcodinganalysis |