Cargando…

Evidente—a visual analytics tool for data enrichment in SNP-based phylogenetic trees

MOTIVATION: A common practice in the analysis of pathogens and their strains is using single-nucleotide polymorphisms (SNPs) to reconstruct their evolutionary history. However, genome-wide SNP-based phylogenetic trees are rarely analyzed without any further information. Including the underlying SNP...

Descripción completa

Detalles Bibliográficos
Autores principales: Witte Paz, Mathias, Harbig, Theresa A, Nieselt, Kay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710622/
https://www.ncbi.nlm.nih.gov/pubmed/36699377
http://dx.doi.org/10.1093/bioadv/vbac075
Descripción
Sumario:MOTIVATION: A common practice in the analysis of pathogens and their strains is using single-nucleotide polymorphisms (SNPs) to reconstruct their evolutionary history. However, genome-wide SNP-based phylogenetic trees are rarely analyzed without any further information. Including the underlying SNP data together with further metadata on the respective samples in the exploration process can facilitate linking the genomic and phenotypic properties of the samples. RESULTS: We introduce Efficient VIsual analytics tool for Data ENrichment in phylogenetic TreEs (Evidente), a web-application that provides an interactive visual analysis interface for the simultaneous interrogation of phylogenetic relationships, genome-wide SNP data and metadata for samples of an organism. Besides visualizing the phylogenetic tree, Evidente classifies SNPs as supporting or non-supporting of the tree structures and shows the distribution of both types of SNPs among samples and clades of interest. Furthermore, additional metadata can be included in the visualization. Lastly, Evidente includes an enrichment analysis to identify over-represented genomic features encoded by GO-terms within the clades of the tree. We demonstrate the usability of Evidente with the data of the pathogens Treponema pallidum and Mycobacterium leprae. AVAILABILITY AND IMPLEMENTATION: Evidente is available at the TueVis visualization web server at https://evidente-tuevis.cs.uni-tuebingen.de/, it can also be run locally. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.