Cargando…

Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks

For infectious diseases, a genetic cluster is a group of closely related infections that is usually interpreted as representing a recent outbreak of transmission. Genetic clustering methods are becoming increasingly popular for molecular epidemiology, especially in the context of HIV where there is...

Descripción completa

Detalles Bibliográficos
Autor principal: Poon, Art F. Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210024/
https://www.ncbi.nlm.nih.gov/pubmed/28058111
http://dx.doi.org/10.1093/ve/vew031
_version_ 1782490829812662272
author Poon, Art F. Y.
author_facet Poon, Art F. Y.
author_sort Poon, Art F. Y.
collection PubMed
description For infectious diseases, a genetic cluster is a group of closely related infections that is usually interpreted as representing a recent outbreak of transmission. Genetic clustering methods are becoming increasingly popular for molecular epidemiology, especially in the context of HIV where there is now considerable interest in applying these methods to prioritize groups for public health resources such as pre-exposure prophylaxis. To date, genetic clustering has generally been performed with ad hoc algorithms, only some of which have since been encoded and distributed as free software. These algorithms have seldom been validated on simulated data where clusters are known, and their interpretation and similarities are not transparent to users outside of the field. Here, I provide a brief overview on the development and inter-relationships of genetic clustering methods, and an evaluation of six methods on data simulated under an epidemic model in a risk-structured population. The simulation analysis demonstrates that the majority of clustering methods are systematically biased to detect variation in sampling rates among subpopulations, not variation in transmission rates. I discuss these results in the context of previous work and the implications for public health applications of genetic clustering.
format Online
Article
Text
id pubmed-5210024
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-52100242017-01-05 Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks Poon, Art F. Y. Virus Evol Reflections For infectious diseases, a genetic cluster is a group of closely related infections that is usually interpreted as representing a recent outbreak of transmission. Genetic clustering methods are becoming increasingly popular for molecular epidemiology, especially in the context of HIV where there is now considerable interest in applying these methods to prioritize groups for public health resources such as pre-exposure prophylaxis. To date, genetic clustering has generally been performed with ad hoc algorithms, only some of which have since been encoded and distributed as free software. These algorithms have seldom been validated on simulated data where clusters are known, and their interpretation and similarities are not transparent to users outside of the field. Here, I provide a brief overview on the development and inter-relationships of genetic clustering methods, and an evaluation of six methods on data simulated under an epidemic model in a risk-structured population. The simulation analysis demonstrates that the majority of clustering methods are systematically biased to detect variation in sampling rates among subpopulations, not variation in transmission rates. I discuss these results in the context of previous work and the implications for public health applications of genetic clustering. Oxford University Press 2016-10-20 /pmc/articles/PMC5210024/ /pubmed/28058111 http://dx.doi.org/10.1093/ve/vew031 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Reflections
Poon, Art F. Y.
Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
title Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
title_full Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
title_fullStr Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
title_full_unstemmed Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
title_short Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
title_sort impacts and shortcomings of genetic clustering methods for infectious disease outbreaks
topic Reflections
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210024/
https://www.ncbi.nlm.nih.gov/pubmed/28058111
http://dx.doi.org/10.1093/ve/vew031
work_keys_str_mv AT poonartfy impactsandshortcomingsofgeneticclusteringmethodsforinfectiousdiseaseoutbreaks