Cargando…
Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection
Detection of viral transmission clusters using molecular epidemiology is critical to the response pillar of the Ending the HIV Epidemic initiative. Here, we studied whether inference with an incomplete dataset would influence the accuracy of the reconstructed molecular transmission network. We analy...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9648870/ https://www.ncbi.nlm.nih.gov/pubmed/36357480 http://dx.doi.org/10.1038/s41598-022-21924-8 |
_version_ | 1784827673110904832 |
---|---|
author | Mazrouee, Sepideh Hallmark, Camden J. Mora, Ricardo Del Vecchio, Natascha Carrasco Hernandez, Rocio Carr, Michelle McNeese, Marlene Fujimoto, Kayo Wertheim, Joel O. |
author_facet | Mazrouee, Sepideh Hallmark, Camden J. Mora, Ricardo Del Vecchio, Natascha Carrasco Hernandez, Rocio Carr, Michelle McNeese, Marlene Fujimoto, Kayo Wertheim, Joel O. |
author_sort | Mazrouee, Sepideh |
collection | PubMed |
description | Detection of viral transmission clusters using molecular epidemiology is critical to the response pillar of the Ending the HIV Epidemic initiative. Here, we studied whether inference with an incomplete dataset would influence the accuracy of the reconstructed molecular transmission network. We analyzed viral sequence data available from ~ 13,000 individuals with diagnosed HIV (2012–2019) from Houston Health Department surveillance data with 53% completeness (n = 6852 individuals with sequences). We extracted random subsamples and compared the resulting reconstructed networks versus the full-size network. Increasing simulated completeness was associated with an increase in the number of detected clusters. We also subsampled based on the network node influence in the transmission of the virus where we measured Expected Force (ExF) for each node in the network. We simulated the removal of nodes with the highest and then lowest ExF from the full dataset and discovered that 4.7% and 60% of priority clusters were detected respectively. These results highlight the non-uniform impact of capturing high influence nodes in identifying transmission clusters. Although increasing sequence reporting completeness is the way to fully detect HIV transmission patterns, reaching high completeness has remained challenging in the real world. Hence, we suggest taking a network science approach to enhance performance of molecular cluster detection, augmented by node influence information. |
format | Online Article Text |
id | pubmed-9648870 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96488702022-11-14 Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection Mazrouee, Sepideh Hallmark, Camden J. Mora, Ricardo Del Vecchio, Natascha Carrasco Hernandez, Rocio Carr, Michelle McNeese, Marlene Fujimoto, Kayo Wertheim, Joel O. Sci Rep Article Detection of viral transmission clusters using molecular epidemiology is critical to the response pillar of the Ending the HIV Epidemic initiative. Here, we studied whether inference with an incomplete dataset would influence the accuracy of the reconstructed molecular transmission network. We analyzed viral sequence data available from ~ 13,000 individuals with diagnosed HIV (2012–2019) from Houston Health Department surveillance data with 53% completeness (n = 6852 individuals with sequences). We extracted random subsamples and compared the resulting reconstructed networks versus the full-size network. Increasing simulated completeness was associated with an increase in the number of detected clusters. We also subsampled based on the network node influence in the transmission of the virus where we measured Expected Force (ExF) for each node in the network. We simulated the removal of nodes with the highest and then lowest ExF from the full dataset and discovered that 4.7% and 60% of priority clusters were detected respectively. These results highlight the non-uniform impact of capturing high influence nodes in identifying transmission clusters. Although increasing sequence reporting completeness is the way to fully detect HIV transmission patterns, reaching high completeness has remained challenging in the real world. Hence, we suggest taking a network science approach to enhance performance of molecular cluster detection, augmented by node influence information. Nature Publishing Group UK 2022-11-10 /pmc/articles/PMC9648870/ /pubmed/36357480 http://dx.doi.org/10.1038/s41598-022-21924-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Mazrouee, Sepideh Hallmark, Camden J. Mora, Ricardo Del Vecchio, Natascha Carrasco Hernandez, Rocio Carr, Michelle McNeese, Marlene Fujimoto, Kayo Wertheim, Joel O. Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection |
title | Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection |
title_full | Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection |
title_fullStr | Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection |
title_full_unstemmed | Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection |
title_short | Impact of molecular sequence data completeness on HIV cluster detection and a network science approach to enhance detection |
title_sort | impact of molecular sequence data completeness on hiv cluster detection and a network science approach to enhance detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9648870/ https://www.ncbi.nlm.nih.gov/pubmed/36357480 http://dx.doi.org/10.1038/s41598-022-21924-8 |
work_keys_str_mv | AT mazroueesepideh impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT hallmarkcamdenj impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT moraricardo impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT delvecchionatascha impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT carrascohernandezrocio impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT carrmichelle impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT mcneesemarlene impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT fujimotokayo impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection AT wertheimjoelo impactofmolecularsequencedatacompletenessonhivclusterdetectionandanetworkscienceapproachtoenhancedetection |