Cargando…

APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data

SUMMARY: DNA metabarcoding is an emerging approach to assess and monitor biodiversity worldwide and consequently the number and size of data sets increases exponentially. To date, no published DNA metabarcoding data processing pipeline exists that is (i) platform independent, (ii) easy to use [incl....

Descripción completa

Detalles Bibliográficos
Autores principales: Buchner, Dominik, Macher, Till-Hendrik, Leese, Florian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563694/
https://www.ncbi.nlm.nih.gov/pubmed/36029248
http://dx.doi.org/10.1093/bioinformatics/btac588
_version_ 1784808464989552640
author Buchner, Dominik
Macher, Till-Hendrik
Leese, Florian
author_facet Buchner, Dominik
Macher, Till-Hendrik
Leese, Florian
author_sort Buchner, Dominik
collection PubMed
description SUMMARY: DNA metabarcoding is an emerging approach to assess and monitor biodiversity worldwide and consequently the number and size of data sets increases exponentially. To date, no published DNA metabarcoding data processing pipeline exists that is (i) platform independent, (ii) easy to use [incl. graphical user interface (GUI)], (iii) fast (does scale well with dataset size) and (iv) complies with data protection regulations of e.g. environmental agencies. The presented pipeline APSCALE meets these requirements and handles the most common tasks of sequence data processing, such as paired-end merging, primer trimming, quality filtering, clustering and denoising of any popular metabarcoding marker, such as internal transcribed spacer, 16S or cytochrome c oxidase subunit I. APSCALE comes in a command line and a GUI version. The latter provides the user with additional summary statistics options and links to GUI-based downstream applications. AVAILABILITY AND IMPLEMENTATION: APSCALE is written in Python, a platform-independent language, and integrates functions of the open-source tools, VSEARCH (Rognes et al., 2016), cutadapt (Martin, 2011) and LULU (Frøslev et al., 2017). All modules support multithreading to allow fast processing of larger DNA metabarcoding datasets. Further information and troubleshooting are provided on the respective GitHub pages for the command-line version (https://github.com/DominikBuchner/apscale) and the GUI-based version (https://github.com/TillMacher/apscale_gui), including a detailed tutorial. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9563694
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95636942022-10-18 APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data Buchner, Dominik Macher, Till-Hendrik Leese, Florian Bioinformatics Applications Notes SUMMARY: DNA metabarcoding is an emerging approach to assess and monitor biodiversity worldwide and consequently the number and size of data sets increases exponentially. To date, no published DNA metabarcoding data processing pipeline exists that is (i) platform independent, (ii) easy to use [incl. graphical user interface (GUI)], (iii) fast (does scale well with dataset size) and (iv) complies with data protection regulations of e.g. environmental agencies. The presented pipeline APSCALE meets these requirements and handles the most common tasks of sequence data processing, such as paired-end merging, primer trimming, quality filtering, clustering and denoising of any popular metabarcoding marker, such as internal transcribed spacer, 16S or cytochrome c oxidase subunit I. APSCALE comes in a command line and a GUI version. The latter provides the user with additional summary statistics options and links to GUI-based downstream applications. AVAILABILITY AND IMPLEMENTATION: APSCALE is written in Python, a platform-independent language, and integrates functions of the open-source tools, VSEARCH (Rognes et al., 2016), cutadapt (Martin, 2011) and LULU (Frøslev et al., 2017). All modules support multithreading to allow fast processing of larger DNA metabarcoding datasets. Further information and troubleshooting are provided on the respective GitHub pages for the command-line version (https://github.com/DominikBuchner/apscale) and the GUI-based version (https://github.com/TillMacher/apscale_gui), including a detailed tutorial. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-08-27 /pmc/articles/PMC9563694/ /pubmed/36029248 http://dx.doi.org/10.1093/bioinformatics/btac588 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Buchner, Dominik
Macher, Till-Hendrik
Leese, Florian
APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data
title APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data
title_full APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data
title_fullStr APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data
title_full_unstemmed APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data
title_short APSCALE: advanced pipeline for simple yet comprehensive analyses of DNA metabarcoding data
title_sort apscale: advanced pipeline for simple yet comprehensive analyses of dna metabarcoding data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563694/
https://www.ncbi.nlm.nih.gov/pubmed/36029248
http://dx.doi.org/10.1093/bioinformatics/btac588
work_keys_str_mv AT buchnerdominik apscaleadvancedpipelineforsimpleyetcomprehensiveanalysesofdnametabarcodingdata
AT machertillhendrik apscaleadvancedpipelineforsimpleyetcomprehensiveanalysesofdnametabarcodingdata
AT leeseflorian apscaleadvancedpipelineforsimpleyetcomprehensiveanalysesofdnametabarcodingdata