Cargando…

Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations

MOTIVATION: A long-standing limitation in comparative genomic studies is the dependency on a reference genome, which hinders the spectrum of genetic diversity that can be identified across a population of organisms. This is especially true in the microbial world where genome architectures can signif...

Descripción completa

Detalles Bibliográficos
Autores principales: Salazar, Alex N, Abeel, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129293/
https://www.ncbi.nlm.nih.gov/pubmed/30423098
http://dx.doi.org/10.1093/bioinformatics/bty614
_version_ 1783353775679864832
author Salazar, Alex N
Abeel, Thomas
author_facet Salazar, Alex N
Abeel, Thomas
author_sort Salazar, Alex N
collection PubMed
description MOTIVATION: A long-standing limitation in comparative genomic studies is the dependency on a reference genome, which hinders the spectrum of genetic diversity that can be identified across a population of organisms. This is especially true in the microbial world where genome architectures can significantly vary. There is therefore a need for computational methods that can simultaneously analyze the architectures of multiple genomes without introducing bias from a reference. RESULTS: In this article, we present Ptolemy: a novel method for studying the diversity of genome architectures—such as structural variation and pan-genomes—across a collection of microbial assemblies without the need of a reference. Ptolemy is a ‘top-down’ approach to compare whole genome assemblies. Genomes are represented as labeled multi-directed graphs—known as quivers—which are then merged into a single, canonical quiver by identifying ‘gene anchors’ via synteny analysis. The canonical quiver represents an approximate, structural alignment of all genomes in a given collection encoding structural variation across (sub-) populations within the collection. We highlight various applications of Ptolemy by analyzing structural variation and the pan-genomes of different datasets composing of Mycobacterium, Saccharomyces, Escherichia and Shigella species. Our results show that Ptolemy is flexible and can handle both conserved and highly dynamic genome architectures. Ptolemy is user-friendly—requires only FASTA-formatted assembly along with a corresponding GFF-formatted file—and resource-friendly—can align 24 genomes in ∼10 mins with four CPUs and <2 GB of RAM. AVAILABILITY AND IMPLEMENTATION: Github: https://github.com/AbeelLab/ptolemy SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6129293
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61292932018-09-12 Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations Salazar, Alex N Abeel, Thomas Bioinformatics Eccb 2018: European Conference on Computational Biology Proceedings MOTIVATION: A long-standing limitation in comparative genomic studies is the dependency on a reference genome, which hinders the spectrum of genetic diversity that can be identified across a population of organisms. This is especially true in the microbial world where genome architectures can significantly vary. There is therefore a need for computational methods that can simultaneously analyze the architectures of multiple genomes without introducing bias from a reference. RESULTS: In this article, we present Ptolemy: a novel method for studying the diversity of genome architectures—such as structural variation and pan-genomes—across a collection of microbial assemblies without the need of a reference. Ptolemy is a ‘top-down’ approach to compare whole genome assemblies. Genomes are represented as labeled multi-directed graphs—known as quivers—which are then merged into a single, canonical quiver by identifying ‘gene anchors’ via synteny analysis. The canonical quiver represents an approximate, structural alignment of all genomes in a given collection encoding structural variation across (sub-) populations within the collection. We highlight various applications of Ptolemy by analyzing structural variation and the pan-genomes of different datasets composing of Mycobacterium, Saccharomyces, Escherichia and Shigella species. Our results show that Ptolemy is flexible and can handle both conserved and highly dynamic genome architectures. Ptolemy is user-friendly—requires only FASTA-formatted assembly along with a corresponding GFF-formatted file—and resource-friendly—can align 24 genomes in ∼10 mins with four CPUs and <2 GB of RAM. AVAILABILITY AND IMPLEMENTATION: Github: https://github.com/AbeelLab/ptolemy SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-09-01 2018-09-08 /pmc/articles/PMC6129293/ /pubmed/30423098 http://dx.doi.org/10.1093/bioinformatics/bty614 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2018: European Conference on Computational Biology Proceedings
Salazar, Alex N
Abeel, Thomas
Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
title Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
title_full Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
title_fullStr Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
title_full_unstemmed Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
title_short Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
title_sort approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations
topic Eccb 2018: European Conference on Computational Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129293/
https://www.ncbi.nlm.nih.gov/pubmed/30423098
http://dx.doi.org/10.1093/bioinformatics/bty614
work_keys_str_mv AT salazaralexn approximatesimultaneouscomparisonofmicrobialgenomearchitecturesviasyntenicanchoringofquiverrepresentations
AT abeelthomas approximatesimultaneouscomparisonofmicrobialgenomearchitecturesviasyntenicanchoringofquiverrepresentations