Cargando…
Biased phylodynamic inferences from analysing clusters of viral sequences
Phylogenetic methods are being increasingly used to help understand the transmission dynamics of measurably evolving viruses, including HIV. Clusters of highly similar sequences are often observed, which appear to follow a ‘power law’ behaviour, with a small number of very large clusters. These clus...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570026/ https://www.ncbi.nlm.nih.gov/pubmed/28852573 http://dx.doi.org/10.1093/ve/vex020 |
_version_ | 1783259101570007040 |
---|---|
author | Dearlove, Bethany L. Xiang, Fei Frost, Simon D. W. |
author_facet | Dearlove, Bethany L. Xiang, Fei Frost, Simon D. W. |
author_sort | Dearlove, Bethany L. |
collection | PubMed |
description | Phylogenetic methods are being increasingly used to help understand the transmission dynamics of measurably evolving viruses, including HIV. Clusters of highly similar sequences are often observed, which appear to follow a ‘power law’ behaviour, with a small number of very large clusters. These clusters may help to identify subpopulations in an epidemic, and inform where intervention strategies should be implemented. However, clustering of samples does not necessarily imply the presence of a subpopulation with high transmission rates, as groups of closely related viruses can also occur due to non-epidemiological effects such as over-sampling. It is important to ensure that observed phylogenetic clustering reflects true heterogeneity in the transmitting population, and is not being driven by non-epidemiological effects. We qualify the effect of using a falsely identified ‘transmission cluster’ of sequences to estimate phylodynamic parameters including the effective population size and exponential growth rate under several demographic scenarios. Our simulation studies show that taking the maximum size cluster to re-estimate parameters from trees simulated under a randomly mixing, constant population size coalescent process systematically underestimates the overall effective population size. In addition, the transmission cluster wrongly resembles an exponential or logistic growth model 99% of the time. We also illustrate the consequences of false clusters in exponentially growing coalescent and birth-death trees, where again, the growth rate is skewed upwards. This has clear implications for identifying clusters in large viral databases, where a false cluster could result in wasted intervention resources. |
format | Online Article Text |
id | pubmed-5570026 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-55700262017-08-29 Biased phylodynamic inferences from analysing clusters of viral sequences Dearlove, Bethany L. Xiang, Fei Frost, Simon D. W. Virus Evol Reflections Phylogenetic methods are being increasingly used to help understand the transmission dynamics of measurably evolving viruses, including HIV. Clusters of highly similar sequences are often observed, which appear to follow a ‘power law’ behaviour, with a small number of very large clusters. These clusters may help to identify subpopulations in an epidemic, and inform where intervention strategies should be implemented. However, clustering of samples does not necessarily imply the presence of a subpopulation with high transmission rates, as groups of closely related viruses can also occur due to non-epidemiological effects such as over-sampling. It is important to ensure that observed phylogenetic clustering reflects true heterogeneity in the transmitting population, and is not being driven by non-epidemiological effects. We qualify the effect of using a falsely identified ‘transmission cluster’ of sequences to estimate phylodynamic parameters including the effective population size and exponential growth rate under several demographic scenarios. Our simulation studies show that taking the maximum size cluster to re-estimate parameters from trees simulated under a randomly mixing, constant population size coalescent process systematically underestimates the overall effective population size. In addition, the transmission cluster wrongly resembles an exponential or logistic growth model 99% of the time. We also illustrate the consequences of false clusters in exponentially growing coalescent and birth-death trees, where again, the growth rate is skewed upwards. This has clear implications for identifying clusters in large viral databases, where a false cluster could result in wasted intervention resources. Oxford University Press 2017-08-03 /pmc/articles/PMC5570026/ /pubmed/28852573 http://dx.doi.org/10.1093/ve/vex020 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Reflections Dearlove, Bethany L. Xiang, Fei Frost, Simon D. W. Biased phylodynamic inferences from analysing clusters of viral sequences |
title | Biased phylodynamic inferences from analysing clusters of viral sequences |
title_full | Biased phylodynamic inferences from analysing clusters of viral sequences |
title_fullStr | Biased phylodynamic inferences from analysing clusters of viral sequences |
title_full_unstemmed | Biased phylodynamic inferences from analysing clusters of viral sequences |
title_short | Biased phylodynamic inferences from analysing clusters of viral sequences |
title_sort | biased phylodynamic inferences from analysing clusters of viral sequences |
topic | Reflections |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570026/ https://www.ncbi.nlm.nih.gov/pubmed/28852573 http://dx.doi.org/10.1093/ve/vex020 |
work_keys_str_mv | AT dearlovebethanyl biasedphylodynamicinferencesfromanalysingclustersofviralsequences AT xiangfei biasedphylodynamicinferencesfromanalysingclustersofviralsequences AT frostsimondw biasedphylodynamicinferencesfromanalysingclustersofviralsequences |