Cargando…

Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State

Molecular cluster detection can be used to interrupt HIV transmission but is dependent on identifying clusters where transmission is likely. We characterized molecular cluster detection in Washington State, evaluated the current cluster investigation criteria, and developed a criterion using machine...

Descripción completa

Detalles Bibliográficos
Autores principales: Erly, Steven J., Herbeck, Joshua T., Kerani, Roxanne P., Reuer, Jennifer R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7077225/
https://www.ncbi.nlm.nih.gov/pubmed/31991877
http://dx.doi.org/10.3390/v12020142
_version_ 1783507384018141184
author Erly, Steven J.
Herbeck, Joshua T.
Kerani, Roxanne P.
Reuer, Jennifer R.
author_facet Erly, Steven J.
Herbeck, Joshua T.
Kerani, Roxanne P.
Reuer, Jennifer R.
author_sort Erly, Steven J.
collection PubMed
description Molecular cluster detection can be used to interrupt HIV transmission but is dependent on identifying clusters where transmission is likely. We characterized molecular cluster detection in Washington State, evaluated the current cluster investigation criteria, and developed a criterion using machine learning. The population living with HIV (PLWH) in Washington State, those with an analyzable genotype sequences, and those in clusters were described across demographic characteristics from 2015 to2018. The relationship between 3- and 12-month cluster growth and demographic, clinical, and temporal predictors were described, and a random forest model was fit using data from 2016 to 2017. The ability of this model to identify clusters with future transmission was compared to Centers for Disease Control and Prevention (CDC) and the Washington state criteria in 2018. The population with a genotype was similar to all PLWH, but people in a cluster were disproportionately white, male, and men who have sex with men. The clusters selected for investigation by the random forest model grew on average 2.3 cases (95% CI 1.1–1.4) in 3 months, which was not significantly larger than the CDC criteria (2.0 cases, 95% CI 0.5–3.4). Disparities in the cases analyzed suggest that molecular cluster detection may not benefit all populations. Jurisdictions should use auxiliary data sources for prediction or continue using established investigation criteria.
format Online
Article
Text
id pubmed-7077225
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70772252020-03-20 Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State Erly, Steven J. Herbeck, Joshua T. Kerani, Roxanne P. Reuer, Jennifer R. Viruses Article Molecular cluster detection can be used to interrupt HIV transmission but is dependent on identifying clusters where transmission is likely. We characterized molecular cluster detection in Washington State, evaluated the current cluster investigation criteria, and developed a criterion using machine learning. The population living with HIV (PLWH) in Washington State, those with an analyzable genotype sequences, and those in clusters were described across demographic characteristics from 2015 to2018. The relationship between 3- and 12-month cluster growth and demographic, clinical, and temporal predictors were described, and a random forest model was fit using data from 2016 to 2017. The ability of this model to identify clusters with future transmission was compared to Centers for Disease Control and Prevention (CDC) and the Washington state criteria in 2018. The population with a genotype was similar to all PLWH, but people in a cluster were disproportionately white, male, and men who have sex with men. The clusters selected for investigation by the random forest model grew on average 2.3 cases (95% CI 1.1–1.4) in 3 months, which was not significantly larger than the CDC criteria (2.0 cases, 95% CI 0.5–3.4). Disparities in the cases analyzed suggest that molecular cluster detection may not benefit all populations. Jurisdictions should use auxiliary data sources for prediction or continue using established investigation criteria. MDPI 2020-01-26 /pmc/articles/PMC7077225/ /pubmed/31991877 http://dx.doi.org/10.3390/v12020142 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Erly, Steven J.
Herbeck, Joshua T.
Kerani, Roxanne P.
Reuer, Jennifer R.
Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State
title Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State
title_full Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State
title_fullStr Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State
title_full_unstemmed Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State
title_short Characterization of Molecular Cluster Detection and Evaluation of Cluster Investigation Criteria Using Machine Learning Methods and Statewide Surveillance Data in Washington State
title_sort characterization of molecular cluster detection and evaluation of cluster investigation criteria using machine learning methods and statewide surveillance data in washington state
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7077225/
https://www.ncbi.nlm.nih.gov/pubmed/31991877
http://dx.doi.org/10.3390/v12020142
work_keys_str_mv AT erlystevenj characterizationofmolecularclusterdetectionandevaluationofclusterinvestigationcriteriausingmachinelearningmethodsandstatewidesurveillancedatainwashingtonstate
AT herbeckjoshuat characterizationofmolecularclusterdetectionandevaluationofclusterinvestigationcriteriausingmachinelearningmethodsandstatewidesurveillancedatainwashingtonstate
AT keraniroxannep characterizationofmolecularclusterdetectionandevaluationofclusterinvestigationcriteriausingmachinelearningmethodsandstatewidesurveillancedatainwashingtonstate
AT reuerjenniferr characterizationofmolecularclusterdetectionandevaluationofclusterinvestigationcriteriausingmachinelearningmethodsandstatewidesurveillancedatainwashingtonstate