Cargando…

App-SpaM: phylogenetic placement of short reads without sequence alignment

MOTIVATION: Phylogenetic placement is the task of placing a query sequence of unknown taxonomic origin into a given phylogenetic tree of a set of reference sequences. A major field of application of such methods is, for example, the taxonomic identification of reads in metabarcoding or metagenomic s...

Descripción completa

Detalles Bibliográficos
Autores principales: Blanke, Matthias, Morgenstern, Burkhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710606/
https://www.ncbi.nlm.nih.gov/pubmed/36700102
http://dx.doi.org/10.1093/bioadv/vbab027
_version_ 1784841402013712384
author Blanke, Matthias
Morgenstern, Burkhard
author_facet Blanke, Matthias
Morgenstern, Burkhard
author_sort Blanke, Matthias
collection PubMed
description MOTIVATION: Phylogenetic placement is the task of placing a query sequence of unknown taxonomic origin into a given phylogenetic tree of a set of reference sequences. A major field of application of such methods is, for example, the taxonomic identification of reads in metabarcoding or metagenomic studies. Several approaches to phylogenetic placement have been proposed in recent years. The most accurate of them requires a multiple sequence alignment of the references as input. However, calculating multiple alignments is not only time-consuming but also limits the applicability of these approaches. RESULTS: Herein, we propose Alignment-free phylogenetic placement algorithm based on Spaced-word Matches (App-SpaM), an efficient algorithm for the phylogenetic placement of short sequencing reads on a tree of a set of reference sequences. App-SpaM produces results of high quality that are on a par with the best available approaches to phylogenetic placement, while our software is two orders of magnitude faster than these existing methods. Our approach neither requires a multiple alignment of the reference sequences nor alignments of the queries to the references. This enables App-SpaM to perform phylogenetic placement on a broad variety of datasets. AVAILABILITY AND IMPLEMENTATION: The source code of App-SpaM is freely available on Github at https://github.com/matthiasblanke/App-SpaM together with detailed instructions for installation and settings. App-SpaM is furthermore available as a Conda-package on the Bioconda channel. CONTACT: matthias.blanke@biologie.uni-goettingen.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9710606
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97106062023-01-24 App-SpaM: phylogenetic placement of short reads without sequence alignment Blanke, Matthias Morgenstern, Burkhard Bioinform Adv Original Article MOTIVATION: Phylogenetic placement is the task of placing a query sequence of unknown taxonomic origin into a given phylogenetic tree of a set of reference sequences. A major field of application of such methods is, for example, the taxonomic identification of reads in metabarcoding or metagenomic studies. Several approaches to phylogenetic placement have been proposed in recent years. The most accurate of them requires a multiple sequence alignment of the references as input. However, calculating multiple alignments is not only time-consuming but also limits the applicability of these approaches. RESULTS: Herein, we propose Alignment-free phylogenetic placement algorithm based on Spaced-word Matches (App-SpaM), an efficient algorithm for the phylogenetic placement of short sequencing reads on a tree of a set of reference sequences. App-SpaM produces results of high quality that are on a par with the best available approaches to phylogenetic placement, while our software is two orders of magnitude faster than these existing methods. Our approach neither requires a multiple alignment of the reference sequences nor alignments of the queries to the references. This enables App-SpaM to perform phylogenetic placement on a broad variety of datasets. AVAILABILITY AND IMPLEMENTATION: The source code of App-SpaM is freely available on Github at https://github.com/matthiasblanke/App-SpaM together with detailed instructions for installation and settings. App-SpaM is furthermore available as a Conda-package on the Bioconda channel. CONTACT: matthias.blanke@biologie.uni-goettingen.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-10-13 /pmc/articles/PMC9710606/ /pubmed/36700102 http://dx.doi.org/10.1093/bioadv/vbab027 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Blanke, Matthias
Morgenstern, Burkhard
App-SpaM: phylogenetic placement of short reads without sequence alignment
title App-SpaM: phylogenetic placement of short reads without sequence alignment
title_full App-SpaM: phylogenetic placement of short reads without sequence alignment
title_fullStr App-SpaM: phylogenetic placement of short reads without sequence alignment
title_full_unstemmed App-SpaM: phylogenetic placement of short reads without sequence alignment
title_short App-SpaM: phylogenetic placement of short reads without sequence alignment
title_sort app-spam: phylogenetic placement of short reads without sequence alignment
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710606/
https://www.ncbi.nlm.nih.gov/pubmed/36700102
http://dx.doi.org/10.1093/bioadv/vbab027
work_keys_str_mv AT blankematthias appspamphylogeneticplacementofshortreadswithoutsequencealignment
AT morgensternburkhard appspamphylogeneticplacementofshortreadswithoutsequencealignment