Cargando…
Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
For infectious diseases, a genetic cluster is a group of closely related infections that is usually interpreted as representing a recent outbreak of transmission. Genetic clustering methods are becoming increasingly popular for molecular epidemiology, especially in the context of HIV where there is...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210024/ https://www.ncbi.nlm.nih.gov/pubmed/28058111 http://dx.doi.org/10.1093/ve/vew031 |
_version_ | 1782490829812662272 |
---|---|
author | Poon, Art F. Y. |
author_facet | Poon, Art F. Y. |
author_sort | Poon, Art F. Y. |
collection | PubMed |
description | For infectious diseases, a genetic cluster is a group of closely related infections that is usually interpreted as representing a recent outbreak of transmission. Genetic clustering methods are becoming increasingly popular for molecular epidemiology, especially in the context of HIV where there is now considerable interest in applying these methods to prioritize groups for public health resources such as pre-exposure prophylaxis. To date, genetic clustering has generally been performed with ad hoc algorithms, only some of which have since been encoded and distributed as free software. These algorithms have seldom been validated on simulated data where clusters are known, and their interpretation and similarities are not transparent to users outside of the field. Here, I provide a brief overview on the development and inter-relationships of genetic clustering methods, and an evaluation of six methods on data simulated under an epidemic model in a risk-structured population. The simulation analysis demonstrates that the majority of clustering methods are systematically biased to detect variation in sampling rates among subpopulations, not variation in transmission rates. I discuss these results in the context of previous work and the implications for public health applications of genetic clustering. |
format | Online Article Text |
id | pubmed-5210024 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-52100242017-01-05 Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks Poon, Art F. Y. Virus Evol Reflections For infectious diseases, a genetic cluster is a group of closely related infections that is usually interpreted as representing a recent outbreak of transmission. Genetic clustering methods are becoming increasingly popular for molecular epidemiology, especially in the context of HIV where there is now considerable interest in applying these methods to prioritize groups for public health resources such as pre-exposure prophylaxis. To date, genetic clustering has generally been performed with ad hoc algorithms, only some of which have since been encoded and distributed as free software. These algorithms have seldom been validated on simulated data where clusters are known, and their interpretation and similarities are not transparent to users outside of the field. Here, I provide a brief overview on the development and inter-relationships of genetic clustering methods, and an evaluation of six methods on data simulated under an epidemic model in a risk-structured population. The simulation analysis demonstrates that the majority of clustering methods are systematically biased to detect variation in sampling rates among subpopulations, not variation in transmission rates. I discuss these results in the context of previous work and the implications for public health applications of genetic clustering. Oxford University Press 2016-10-20 /pmc/articles/PMC5210024/ /pubmed/28058111 http://dx.doi.org/10.1093/ve/vew031 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Reflections Poon, Art F. Y. Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
title | Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
title_full | Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
title_fullStr | Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
title_full_unstemmed | Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
title_short | Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
title_sort | impacts and shortcomings of genetic clustering methods for infectious disease outbreaks |
topic | Reflections |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210024/ https://www.ncbi.nlm.nih.gov/pubmed/28058111 http://dx.doi.org/10.1093/ve/vew031 |
work_keys_str_mv | AT poonartfy impactsandshortcomingsofgeneticclusteringmethodsforinfectiousdiseaseoutbreaks |