Cargando…

Searching for virus phylotypes

Motivation: Large phylogenies are being built today to study virus evolution, trace the origin of epidemics, establish the mode of transmission and survey the appearance of drug resistance. However, no tool is available to quickly inspect these phylogenies and combine them with extrinsic traits (e.g...

Descripción completa

Detalles Bibliográficos
Autores principales: Chevenet, François, Jung, Matthieu, Peeters, Martine, de Oliveira, Tulio, Gascuel, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3582263/
https://www.ncbi.nlm.nih.gov/pubmed/23329414
http://dx.doi.org/10.1093/bioinformatics/btt010
_version_ 1782260550520012800
author Chevenet, François
Jung, Matthieu
Peeters, Martine
de Oliveira, Tulio
Gascuel, Olivier
author_facet Chevenet, François
Jung, Matthieu
Peeters, Martine
de Oliveira, Tulio
Gascuel, Olivier
author_sort Chevenet, François
collection PubMed
description Motivation: Large phylogenies are being built today to study virus evolution, trace the origin of epidemics, establish the mode of transmission and survey the appearance of drug resistance. However, no tool is available to quickly inspect these phylogenies and combine them with extrinsic traits (e.g. geographic location, risk group, presence of a given resistance mutation), seeking to extract strain groups of specific interest or requiring surveillance. Results: We propose a new method for obtaining such groups, which we call phylotypes, from a phylogeny having taxa (strains) annotated with extrinsic traits. Phylotypes are subsets of taxa with close phylogenetic relationships and common trait values. The method combines ancestral trait reconstruction using parsimony, with combinatorial and numerical criteria measuring tree shape characteristics and the diversity and separation of the potential phylotypes. A shuffling procedure is used to assess the statistical significance of phylotypes. All algorithms have linear time complexity. This results in low computing times, typically a few minutes for the larger data sets with a number of shuffling steps. Two HIV-1 data sets are analyzed, one of which is large, containing >3000 strains of HIV-1 subtype C collected worldwide, where the method shows its ability to recover known clusters and transmission routes, and to detect new ones. Availability: This method and companion tools are implemented in an interactive Web interface (www.phylotype.org), which provides a wide choice of graphical views and output formats, and allows for exploratory analyses of large data sets. Contact: francois.chevenet@ird.fr, gascuel@lirmm.fr Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3582263
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35822632013-02-26 Searching for virus phylotypes Chevenet, François Jung, Matthieu Peeters, Martine de Oliveira, Tulio Gascuel, Olivier Bioinformatics Original Papers Motivation: Large phylogenies are being built today to study virus evolution, trace the origin of epidemics, establish the mode of transmission and survey the appearance of drug resistance. However, no tool is available to quickly inspect these phylogenies and combine them with extrinsic traits (e.g. geographic location, risk group, presence of a given resistance mutation), seeking to extract strain groups of specific interest or requiring surveillance. Results: We propose a new method for obtaining such groups, which we call phylotypes, from a phylogeny having taxa (strains) annotated with extrinsic traits. Phylotypes are subsets of taxa with close phylogenetic relationships and common trait values. The method combines ancestral trait reconstruction using parsimony, with combinatorial and numerical criteria measuring tree shape characteristics and the diversity and separation of the potential phylotypes. A shuffling procedure is used to assess the statistical significance of phylotypes. All algorithms have linear time complexity. This results in low computing times, typically a few minutes for the larger data sets with a number of shuffling steps. Two HIV-1 data sets are analyzed, one of which is large, containing >3000 strains of HIV-1 subtype C collected worldwide, where the method shows its ability to recover known clusters and transmission routes, and to detect new ones. Availability: This method and companion tools are implemented in an interactive Web interface (www.phylotype.org), which provides a wide choice of graphical views and output formats, and allows for exploratory analyses of large data sets. Contact: francois.chevenet@ird.fr, gascuel@lirmm.fr Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-03-01 2013-01-17 /pmc/articles/PMC3582263/ /pubmed/23329414 http://dx.doi.org/10.1093/bioinformatics/btt010 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Chevenet, François
Jung, Matthieu
Peeters, Martine
de Oliveira, Tulio
Gascuel, Olivier
Searching for virus phylotypes
title Searching for virus phylotypes
title_full Searching for virus phylotypes
title_fullStr Searching for virus phylotypes
title_full_unstemmed Searching for virus phylotypes
title_short Searching for virus phylotypes
title_sort searching for virus phylotypes
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3582263/
https://www.ncbi.nlm.nih.gov/pubmed/23329414
http://dx.doi.org/10.1093/bioinformatics/btt010
work_keys_str_mv AT chevenetfrancois searchingforvirusphylotypes
AT jungmatthieu searchingforvirusphylotypes
AT peetersmartine searchingforvirusphylotypes
AT deoliveiratulio searchingforvirusphylotypes
AT gascuelolivier searchingforvirusphylotypes