Cargando…

Biased phylodynamic inferences from analysing clusters of viral sequences

Phylogenetic methods are being increasingly used to help understand the transmission dynamics of measurably evolving viruses, including HIV. Clusters of highly similar sequences are often observed, which appear to follow a ‘power law’ behaviour, with a small number of very large clusters. These clus...

Descripción completa

Detalles Bibliográficos
Autores principales: Dearlove, Bethany L., Xiang, Fei, Frost, Simon D. W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570026/
https://www.ncbi.nlm.nih.gov/pubmed/28852573
http://dx.doi.org/10.1093/ve/vex020
_version_ 1783259101570007040
author Dearlove, Bethany L.
Xiang, Fei
Frost, Simon D. W.
author_facet Dearlove, Bethany L.
Xiang, Fei
Frost, Simon D. W.
author_sort Dearlove, Bethany L.
collection PubMed
description Phylogenetic methods are being increasingly used to help understand the transmission dynamics of measurably evolving viruses, including HIV. Clusters of highly similar sequences are often observed, which appear to follow a ‘power law’ behaviour, with a small number of very large clusters. These clusters may help to identify subpopulations in an epidemic, and inform where intervention strategies should be implemented. However, clustering of samples does not necessarily imply the presence of a subpopulation with high transmission rates, as groups of closely related viruses can also occur due to non-epidemiological effects such as over-sampling. It is important to ensure that observed phylogenetic clustering reflects true heterogeneity in the transmitting population, and is not being driven by non-epidemiological effects. We qualify the effect of using a falsely identified ‘transmission cluster’ of sequences to estimate phylodynamic parameters including the effective population size and exponential growth rate under several demographic scenarios. Our simulation studies show that taking the maximum size cluster to re-estimate parameters from trees simulated under a randomly mixing, constant population size coalescent process systematically underestimates the overall effective population size. In addition, the transmission cluster wrongly resembles an exponential or logistic growth model 99% of the time. We also illustrate the consequences of false clusters in exponentially growing coalescent and birth-death trees, where again, the growth rate is skewed upwards. This has clear implications for identifying clusters in large viral databases, where a false cluster could result in wasted intervention resources.
format Online
Article
Text
id pubmed-5570026
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55700262017-08-29 Biased phylodynamic inferences from analysing clusters of viral sequences Dearlove, Bethany L. Xiang, Fei Frost, Simon D. W. Virus Evol Reflections Phylogenetic methods are being increasingly used to help understand the transmission dynamics of measurably evolving viruses, including HIV. Clusters of highly similar sequences are often observed, which appear to follow a ‘power law’ behaviour, with a small number of very large clusters. These clusters may help to identify subpopulations in an epidemic, and inform where intervention strategies should be implemented. However, clustering of samples does not necessarily imply the presence of a subpopulation with high transmission rates, as groups of closely related viruses can also occur due to non-epidemiological effects such as over-sampling. It is important to ensure that observed phylogenetic clustering reflects true heterogeneity in the transmitting population, and is not being driven by non-epidemiological effects. We qualify the effect of using a falsely identified ‘transmission cluster’ of sequences to estimate phylodynamic parameters including the effective population size and exponential growth rate under several demographic scenarios. Our simulation studies show that taking the maximum size cluster to re-estimate parameters from trees simulated under a randomly mixing, constant population size coalescent process systematically underestimates the overall effective population size. In addition, the transmission cluster wrongly resembles an exponential or logistic growth model 99% of the time. We also illustrate the consequences of false clusters in exponentially growing coalescent and birth-death trees, where again, the growth rate is skewed upwards. This has clear implications for identifying clusters in large viral databases, where a false cluster could result in wasted intervention resources. Oxford University Press 2017-08-03 /pmc/articles/PMC5570026/ /pubmed/28852573 http://dx.doi.org/10.1093/ve/vex020 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Reflections
Dearlove, Bethany L.
Xiang, Fei
Frost, Simon D. W.
Biased phylodynamic inferences from analysing clusters of viral sequences
title Biased phylodynamic inferences from analysing clusters of viral sequences
title_full Biased phylodynamic inferences from analysing clusters of viral sequences
title_fullStr Biased phylodynamic inferences from analysing clusters of viral sequences
title_full_unstemmed Biased phylodynamic inferences from analysing clusters of viral sequences
title_short Biased phylodynamic inferences from analysing clusters of viral sequences
title_sort biased phylodynamic inferences from analysing clusters of viral sequences
topic Reflections
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570026/
https://www.ncbi.nlm.nih.gov/pubmed/28852573
http://dx.doi.org/10.1093/ve/vex020
work_keys_str_mv AT dearlovebethanyl biasedphylodynamicinferencesfromanalysingclustersofviralsequences
AT xiangfei biasedphylodynamicinferencesfromanalysingclustersofviralsequences
AT frostsimondw biasedphylodynamicinferencesfromanalysingclustersofviralsequences