Cargando…

Identification of Hidden Population Structure in Time-Scaled Phylogenies

Population structure influences genealogical patterns, however, data pertaining to how populations are structured are often unavailable or not directly observable. Inference of population structure is highly important in molecular epidemiology where pathogen phylogenetics is increasingly used to inf...

Descripción completa

Detalles Bibliográficos
Autores principales: Volz, Erik M, Carsten, Wiuf, Grad, Yonatan H, Frost, Simon D W, Dennis, Ann M, Didelot, Xavier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559910/
https://www.ncbi.nlm.nih.gov/pubmed/32049340
http://dx.doi.org/10.1093/sysbio/syaa009
_version_ 1784592837124292608
author Volz, Erik M
Carsten, Wiuf
Grad, Yonatan H
Frost, Simon D W
Dennis, Ann M
Didelot, Xavier
author_facet Volz, Erik M
Carsten, Wiuf
Grad, Yonatan H
Frost, Simon D W
Dennis, Ann M
Didelot, Xavier
author_sort Volz, Erik M
collection PubMed
description Population structure influences genealogical patterns, however, data pertaining to how populations are structured are often unavailable or not directly observable. Inference of population structure is highly important in molecular epidemiology where pathogen phylogenetics is increasingly used to infer transmission patterns and detect outbreaks. Discrepancies between observed and idealized genealogies, such as those generated by the coalescent process, can be quantified, and where significant differences occur, may reveal the action of natural selection, host population structure, or other demographic and epidemiological heterogeneities. We have developed a fast non-parametric statistical test for detection of cryptic population structure in time-scaled phylogenetic trees. The test is based on contrasting estimated phylogenies with the theoretically expected phylodynamic ordering of common ancestors in two clades within a coalescent framework. These statistical tests have also motivated the development of algorithms which can be used to quickly screen a phylogenetic tree for clades which are likely to share a distinct demographic or epidemiological history. Epidemiological applications include identification of outbreaks in vulnerable host populations or rapid expansion of genotypes with a fitness advantage. To demonstrate the utility of these methods for outbreak detection, we applied the new methods to large phylogenies reconstructed from thousands of HIV-1 partial pol sequences. This revealed the presence of clades which had grown rapidly in the recent past and was significantly concentrated in young men, suggesting recent and rapid transmission in that group. Furthermore, to demonstrate the utility of these methods for the study of antimicrobial resistance, we applied the new methods to a large phylogeny reconstructed from whole genome Neisseria gonorrhoeae sequences. We find that population structure detected using these methods closely overlaps with the appearance and expansion of mutations conferring antimicrobial resistance. [Antimicrobial resistance; coalescent; HIV; population structure.]
format Online
Article
Text
id pubmed-8559910
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85599102021-11-02 Identification of Hidden Population Structure in Time-Scaled Phylogenies Volz, Erik M Carsten, Wiuf Grad, Yonatan H Frost, Simon D W Dennis, Ann M Didelot, Xavier Syst Biol Regular Articles Population structure influences genealogical patterns, however, data pertaining to how populations are structured are often unavailable or not directly observable. Inference of population structure is highly important in molecular epidemiology where pathogen phylogenetics is increasingly used to infer transmission patterns and detect outbreaks. Discrepancies between observed and idealized genealogies, such as those generated by the coalescent process, can be quantified, and where significant differences occur, may reveal the action of natural selection, host population structure, or other demographic and epidemiological heterogeneities. We have developed a fast non-parametric statistical test for detection of cryptic population structure in time-scaled phylogenetic trees. The test is based on contrasting estimated phylogenies with the theoretically expected phylodynamic ordering of common ancestors in two clades within a coalescent framework. These statistical tests have also motivated the development of algorithms which can be used to quickly screen a phylogenetic tree for clades which are likely to share a distinct demographic or epidemiological history. Epidemiological applications include identification of outbreaks in vulnerable host populations or rapid expansion of genotypes with a fitness advantage. To demonstrate the utility of these methods for outbreak detection, we applied the new methods to large phylogenies reconstructed from thousands of HIV-1 partial pol sequences. This revealed the presence of clades which had grown rapidly in the recent past and was significantly concentrated in young men, suggesting recent and rapid transmission in that group. Furthermore, to demonstrate the utility of these methods for the study of antimicrobial resistance, we applied the new methods to a large phylogeny reconstructed from whole genome Neisseria gonorrhoeae sequences. We find that population structure detected using these methods closely overlaps with the appearance and expansion of mutations conferring antimicrobial resistance. [Antimicrobial resistance; coalescent; HIV; population structure.] Oxford University Press 2020-02-12 /pmc/articles/PMC8559910/ /pubmed/32049340 http://dx.doi.org/10.1093/sysbio/syaa009 Text en © The Author(s) 2020. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Volz, Erik M
Carsten, Wiuf
Grad, Yonatan H
Frost, Simon D W
Dennis, Ann M
Didelot, Xavier
Identification of Hidden Population Structure in Time-Scaled Phylogenies
title Identification of Hidden Population Structure in Time-Scaled Phylogenies
title_full Identification of Hidden Population Structure in Time-Scaled Phylogenies
title_fullStr Identification of Hidden Population Structure in Time-Scaled Phylogenies
title_full_unstemmed Identification of Hidden Population Structure in Time-Scaled Phylogenies
title_short Identification of Hidden Population Structure in Time-Scaled Phylogenies
title_sort identification of hidden population structure in time-scaled phylogenies
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559910/
https://www.ncbi.nlm.nih.gov/pubmed/32049340
http://dx.doi.org/10.1093/sysbio/syaa009
work_keys_str_mv AT volzerikm identificationofhiddenpopulationstructureintimescaledphylogenies
AT carstenwiuf identificationofhiddenpopulationstructureintimescaledphylogenies
AT gradyonatanh identificationofhiddenpopulationstructureintimescaledphylogenies
AT frostsimondw identificationofhiddenpopulationstructureintimescaledphylogenies
AT dennisannm identificationofhiddenpopulationstructureintimescaledphylogenies
AT didelotxavier identificationofhiddenpopulationstructureintimescaledphylogenies