Cargando…

Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection

Detection of viral transmission clusters using molecular epidemiology is critical to the response pillar of the Ending the HIV Epidemic initiative. Here, we studied whether inference with an incomplete dataset would influence the accuracy of the reconstructed molecular transmission network. We analy...

Descripción completa

Detalles Bibliográficos
Autores principales: Mazrouee, Sepideh, Hallmark, Camden J., Mora, Ricardo, Del Vecchio, Natascha, Carrasco Hernandez, Rocio, Carr, Michelle, McNeese, Marlene, Fujimoto, Kayo, Wertheim, Joel O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9648870/
https://www.ncbi.nlm.nih.gov/pubmed/36357480
http://dx.doi.org/10.1038/s41598-022-21924-8
_version_ 1784827673110904832
author Mazrouee, Sepideh
Hallmark, Camden J.
Mora, Ricardo
Del Vecchio, Natascha
Carrasco Hernandez, Rocio
Carr, Michelle
McNeese, Marlene
Fujimoto, Kayo
Wertheim, Joel O.
author_facet Mazrouee, Sepideh
Hallmark, Camden J.
Mora, Ricardo
Del Vecchio, Natascha
Carrasco Hernandez, Rocio
Carr, Michelle
McNeese, Marlene
Fujimoto, Kayo
Wertheim, Joel O.
author_sort Mazrouee, Sepideh
collection PubMed
description Detection of viral transmission clusters using molecular epidemiology is critical to the response pillar of the Ending the HIV Epidemic initiative. Here, we studied whether inference with an incomplete dataset would influence the accuracy of the reconstructed molecular transmission network. We analyzed viral sequence data available from ~ 13,000 individuals with diagnosed HIV (2012–2019) from Houston Health Department surveillance data with 53% completeness (n = 6852 individuals with sequences). We extracted random subsamples and compared the resulting reconstructed networks versus the full-size network. Increasing simulated completeness was associated with an increase in the number of detected clusters. We also subsampled based on the network node influence in the transmission of the virus where we measured Expected Force (ExF) for each node in the network. We simulated the removal of nodes with the highest and then lowest ExF from the full dataset and discovered that 4.7% and 60% of priority clusters were detected respectively. These results highlight the non-uniform impact of capturing high influence nodes in identifying transmission clusters. Although increasing sequence reporting completeness is the way to fully detect HIV transmission patterns, reaching high completeness has remained challenging in the real world. Hence, we suggest taking a network science approach to enhance performance of molecular cluster detection, augmented by node influence information.
format Online
Article
Text
id pubmed-9648870
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96488702022-11-14 Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection Mazrouee, Sepideh Hallmark, Camden J. Mora, Ricardo Del Vecchio, Natascha Carrasco Hernandez, Rocio Carr, Michelle McNeese, Marlene Fujimoto, Kayo Wertheim, Joel O. Sci Rep Article Detection of viral transmission clusters using molecular epidemiology is critical to the response pillar of the Ending the HIV Epidemic initiative. Here, we studied whether inference with an incomplete dataset would influence the accuracy of the reconstructed molecular transmission network. We analyzed viral sequence data available from ~ 13,000 individuals with diagnosed HIV (2012–2019) from Houston Health Department surveillance data with 53% completeness (n = 6852 individuals with sequences). We extracted random subsamples and compared the resulting reconstructed networks versus the full-size network. Increasing simulated completeness was associated with an increase in the number of detected clusters. We also subsampled based on the network node influence in the transmission of the virus where we measured Expected Force (ExF) for each node in the network. We simulated the removal of nodes with the highest and then lowest ExF from the full dataset and discovered that 4.7% and 60% of priority clusters were detected respectively. These results highlight the non-uniform impact of capturing high influence nodes in identifying transmission clusters. Although increasing sequence reporting completeness is the way to fully detect HIV transmission patterns, reaching high completeness has remained challenging in the real world. Hence, we suggest taking a network science approach to enhance performance of molecular cluster detection, augmented by node influence information. Nature Publishing Group UK 2022-11-10 /pmc/articles/PMC9648870/ /pubmed/36357480 http://dx.doi.org/10.1038/s41598-022-21924-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Mazrouee, Sepideh
Hallmark, Camden J.
Mora, Ricardo
Del Vecchio, Natascha
Carrasco Hernandez, Rocio
Carr, Michelle
McNeese, Marlene
Fujimoto, Kayo
Wertheim, Joel O.
Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
title Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
title_full Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
title_fullStr Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
title_full_unstemmed Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
title_short Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
title_sort impact of molecular sequence data completeness on hiv cluster detection and a network science approach to enhance detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9648870/
https://www.ncbi.nlm.nih.gov/pubmed/36357480
http://dx.doi.org/10.1038/s41598-022-21924-8
work_keys_str_mv AT mazroueesepideh impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT hallmarkcamdenj impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT moraricardo impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT delvecchionatascha impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT carrascohernandezrocio impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT carrmichelle impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT mcneesemarlene impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT fujimotokayo impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection
AT wertheimjoelo impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection