Cargando…

Inferred networks, machine learning, and health data

This paper presents a network science approach to investigate a health information dataset, the Sexual Acquisition and Transmission of HIV Cooperative Agreement Program (SATHCAP), to uncover hidden relationships that can be used to suggest targeted health interventions. From the data, four key targe...

Descripción completa

Detalles Bibliográficos
Autores principales: Matta, John, Singh, Virender, Auten, Trevor, Sanjel, Prashant
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9870174/
https://www.ncbi.nlm.nih.gov/pubmed/36689443
http://dx.doi.org/10.1371/journal.pone.0280910
_version_ 1784876918422634496
author Matta, John
Singh, Virender
Auten, Trevor
Sanjel, Prashant
author_facet Matta, John
Singh, Virender
Auten, Trevor
Sanjel, Prashant
author_sort Matta, John
collection PubMed
description This paper presents a network science approach to investigate a health information dataset, the Sexual Acquisition and Transmission of HIV Cooperative Agreement Program (SATHCAP), to uncover hidden relationships that can be used to suggest targeted health interventions. From the data, four key target variables are chosen: HIV status, injecting drug use, homelessness, and insurance status. These target variables are converted to a graph format using four separate graph inference techniques: graphical lasso, Meinshausen Bühlmann (MB), k-Nearest Neighbors (kNN), and correlation thresholding (CT). The graphs are then clustered using four clustering methods: Louvain, Leiden, and NBR-Clust with VAT and integrity. Promising clusters are chosen using internal evaluation measures and are visualized and analyzed to identify marker attributes and key relationships. The kNN and CT inference methods are shown to give useful results when combined with NBR-Clust clustering. Examples of cluster analysis indicate that the methodology produces results that will be relevant to the public health community.
format Online
Article
Text
id pubmed-9870174
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-98701742023-01-24 Inferred networks, machine learning, and health data Matta, John Singh, Virender Auten, Trevor Sanjel, Prashant PLoS One Research Article This paper presents a network science approach to investigate a health information dataset, the Sexual Acquisition and Transmission of HIV Cooperative Agreement Program (SATHCAP), to uncover hidden relationships that can be used to suggest targeted health interventions. From the data, four key target variables are chosen: HIV status, injecting drug use, homelessness, and insurance status. These target variables are converted to a graph format using four separate graph inference techniques: graphical lasso, Meinshausen Bühlmann (MB), k-Nearest Neighbors (kNN), and correlation thresholding (CT). The graphs are then clustered using four clustering methods: Louvain, Leiden, and NBR-Clust with VAT and integrity. Promising clusters are chosen using internal evaluation measures and are visualized and analyzed to identify marker attributes and key relationships. The kNN and CT inference methods are shown to give useful results when combined with NBR-Clust clustering. Examples of cluster analysis indicate that the methodology produces results that will be relevant to the public health community. Public Library of Science 2023-01-23 /pmc/articles/PMC9870174/ /pubmed/36689443 http://dx.doi.org/10.1371/journal.pone.0280910 Text en © 2023 Matta et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Matta, John
Singh, Virender
Auten, Trevor
Sanjel, Prashant
Inferred networks, machine learning, and health data
title Inferred networks, machine learning, and health data
title_full Inferred networks, machine learning, and health data
title_fullStr Inferred networks, machine learning, and health data
title_full_unstemmed Inferred networks, machine learning, and health data
title_short Inferred networks, machine learning, and health data
title_sort inferred networks, machine learning, and health data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9870174/
https://www.ncbi.nlm.nih.gov/pubmed/36689443
http://dx.doi.org/10.1371/journal.pone.0280910
work_keys_str_mv AT mattajohn inferrednetworksmachinelearningandhealthdata
AT singhvirender inferrednetworksmachinelearningandhealthdata
AT autentrevor inferrednetworksmachinelearningandhealthdata
AT sanjelprashant inferrednetworksmachinelearningandhealthdata