Cargando…

PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics

MOTIVATION: Metagenomic and metatranscriptomic sequencing have become increasingly popular tools for producing massive amounts of short-read data, often used for the reconstruction of draft genomes or the detection of (active) genes in microbial communities. Unfortunately, sequence assemblies of suc...

Descripción completa

Detalles Bibliográficos
Autores principales:	Schön, Max E, Eme, Laura, Ettema, Thijs J G
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2020
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703773/ https://www.ncbi.nlm.nih.gov/pubmed/31647547 http://dx.doi.org/10.1093/bioinformatics/btz799

_version_	1783616693089599488
author	Schön, Max E Eme, Laura Ettema, Thijs J G
author_facet	Schön, Max E Eme, Laura Ettema, Thijs J G
author_sort	Schön, Max E
collection	PubMed
description	MOTIVATION: Metagenomic and metatranscriptomic sequencing have become increasingly popular tools for producing massive amounts of short-read data, often used for the reconstruction of draft genomes or the detection of (active) genes in microbial communities. Unfortunately, sequence assemblies of such datasets generally remain a computationally challenging task. Frequently, researchers are only interested in a specific group of organisms or genes; yet, the assembly of multiple datasets only to identify candidate sequences for a specific question is sometimes prohibitively slow, forcing researchers to select a subset of available datasets to address their question. Here, we present PhyloMagnet, a workflow to screen meta-omics datasets for taxa and genes of interest using gene-centric assembly and phylogenetic placement of sequences. RESULTS: Using PhyloMagnet, we could identify up to 87% of the genera in an in vitro mock community with variable abundances, while the false positive predictions per single gene tree ranged from 0 to 23%. When applied to a group of metagenomes for which a set of metagenome assembled genomes (MAGs) have been published, we could detect the majority of the taxonomic labels that the MAGs had been annotated with. In a metatranscriptomic setting, the phylogenetic placement of assembled contigs corresponds to that of transcripts obtained from transcriptome assembly. AVAILABILITY AND IMPLEMENTATION: PhyloMagnet is built using Nextflow, available at github.com/maxemil/PhyloMagnet and is developed and tested on Linux. It is released under the open source GNU GPL licence and documentation is available at phylomagnet.readthedocs.io. Version 0.5 of PhyloMagnet was used for all benchmarking experiments. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-7703773
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-77037732020-12-07 PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics Schön, Max E Eme, Laura Ettema, Thijs J G Bioinformatics Original Papers MOTIVATION: Metagenomic and metatranscriptomic sequencing have become increasingly popular tools for producing massive amounts of short-read data, often used for the reconstruction of draft genomes or the detection of (active) genes in microbial communities. Unfortunately, sequence assemblies of such datasets generally remain a computationally challenging task. Frequently, researchers are only interested in a specific group of organisms or genes; yet, the assembly of multiple datasets only to identify candidate sequences for a specific question is sometimes prohibitively slow, forcing researchers to select a subset of available datasets to address their question. Here, we present PhyloMagnet, a workflow to screen meta-omics datasets for taxa and genes of interest using gene-centric assembly and phylogenetic placement of sequences. RESULTS: Using PhyloMagnet, we could identify up to 87% of the genera in an in vitro mock community with variable abundances, while the false positive predictions per single gene tree ranged from 0 to 23%. When applied to a group of metagenomes for which a set of metagenome assembled genomes (MAGs) have been published, we could detect the majority of the taxonomic labels that the MAGs had been annotated with. In a metatranscriptomic setting, the phylogenetic placement of assembled contigs corresponds to that of transcripts obtained from transcriptome assembly. AVAILABILITY AND IMPLEMENTATION: PhyloMagnet is built using Nextflow, available at github.com/maxemil/PhyloMagnet and is developed and tested on Linux. It is released under the open source GNU GPL licence and documentation is available at phylomagnet.readthedocs.io. Version 0.5 of PhyloMagnet was used for all benchmarking experiments. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-03-15 2019-10-24 /pmc/articles/PMC7703773/ /pubmed/31647547 http://dx.doi.org/10.1093/bioinformatics/btz799 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Schön, Max E Eme, Laura Ettema, Thijs J G PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
title	PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
title_full	PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
title_fullStr	PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
title_full_unstemmed	PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
title_short	PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
title_sort	phylomagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703773/ https://www.ncbi.nlm.nih.gov/pubmed/31647547 http://dx.doi.org/10.1093/bioinformatics/btz799
work_keys_str_mv	AT schonmaxe phylomagnetfastandaccuratescreeningofshortreadmetaomicsdatausinggenecentricphylogenetics AT emelaura phylomagnetfastandaccuratescreeningofshortreadmetaomicsdatausinggenecentricphylogenetics AT ettemathijsjg phylomagnetfastandaccuratescreeningofshortreadmetaomicsdatausinggenecentricphylogenetics

PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics

Ejemplares similares