Cargando…

Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes

Microbial species play important roles in different environments and the production of high-quality genomes from metagenome data sets represents a major obstacle to understanding their ecological and evolutionary dynamics. Metagenome-Assembled Genomes Orchestra (MAGO) is a computational framework th...

Descripción completa

Detalles Bibliográficos
Autores principales: Murovec, Boštjan, Deutsch, Leon, Stres, Blaz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6993843/
https://www.ncbi.nlm.nih.gov/pubmed/31633780
http://dx.doi.org/10.1093/molbev/msz237
_version_ 1783493113338134528
author Murovec, Boštjan
Deutsch, Leon
Stres, Blaz
author_facet Murovec, Boštjan
Deutsch, Leon
Stres, Blaz
author_sort Murovec, Boštjan
collection PubMed
description Microbial species play important roles in different environments and the production of high-quality genomes from metagenome data sets represents a major obstacle to understanding their ecological and evolutionary dynamics. Metagenome-Assembled Genomes Orchestra (MAGO) is a computational framework that integrates and simplifies metagenome assembly, binning, bin improvement, bin quality (completeness and contamination), bin annotation, and evolutionary placement of bins via detailed maximum-likelihood phylogeny based on multiple marker genes using different amino acid substitution models, next to average nucleotide identity analysis of genomes for delineation of species boundaries and operational taxonomic units. MAGO offers streamlined execution of the entire metagenomics pipeline, error checking, computational resource distribution and compatibility of data formats, governed by user-tailored pipeline processing. MAGO is an open-source-software package released in three different ways, as a singularity image and a Docker container for HPC purposes as well as for running MAGO on a commodity hardware, and a virtual machine for gaining a full access to MAGO underlying structure and source code. MAGO is open to suggestions for extensions and is amenable for use in both research and teaching of genomics and molecular evolution of genomes assembled from small single-cell projects or large-scale and complex environmental metagenomes.
format Online
Article
Text
id pubmed-6993843
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69938432020-02-05 Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes Murovec, Boštjan Deutsch, Leon Stres, Blaz Mol Biol Evol Resources Microbial species play important roles in different environments and the production of high-quality genomes from metagenome data sets represents a major obstacle to understanding their ecological and evolutionary dynamics. Metagenome-Assembled Genomes Orchestra (MAGO) is a computational framework that integrates and simplifies metagenome assembly, binning, bin improvement, bin quality (completeness and contamination), bin annotation, and evolutionary placement of bins via detailed maximum-likelihood phylogeny based on multiple marker genes using different amino acid substitution models, next to average nucleotide identity analysis of genomes for delineation of species boundaries and operational taxonomic units. MAGO offers streamlined execution of the entire metagenomics pipeline, error checking, computational resource distribution and compatibility of data formats, governed by user-tailored pipeline processing. MAGO is an open-source-software package released in three different ways, as a singularity image and a Docker container for HPC purposes as well as for running MAGO on a commodity hardware, and a virtual machine for gaining a full access to MAGO underlying structure and source code. MAGO is open to suggestions for extensions and is amenable for use in both research and teaching of genomics and molecular evolution of genomes assembled from small single-cell projects or large-scale and complex environmental metagenomes. Oxford University Press 2020-02 2019-10-21 /pmc/articles/PMC6993843/ /pubmed/31633780 http://dx.doi.org/10.1093/molbev/msz237 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Resources
Murovec, Boštjan
Deutsch, Leon
Stres, Blaz
Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes
title Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes
title_full Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes
title_fullStr Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes
title_full_unstemmed Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes
title_short Computational Framework for High-Quality Production and Large-Scale Evolutionary Analysis of Metagenome Assembled Genomes
title_sort computational framework for high-quality production and large-scale evolutionary analysis of metagenome assembled genomes
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6993843/
https://www.ncbi.nlm.nih.gov/pubmed/31633780
http://dx.doi.org/10.1093/molbev/msz237
work_keys_str_mv AT murovecbostjan computationalframeworkforhighqualityproductionandlargescaleevolutionaryanalysisofmetagenomeassembledgenomes
AT deutschleon computationalframeworkforhighqualityproductionandlargescaleevolutionaryanalysisofmetagenomeassembledgenomes
AT stresblaz computationalframeworkforhighqualityproductionandlargescaleevolutionaryanalysisofmetagenomeassembledgenomes