Cargando…

Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests

OBJECTIVE: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (e.g., endometriosis, ovar...

Descripción completa

Detalles Bibliográficos
Autores principales: Chan, Lauren E, Casiraghi, Elena, Putman, Tim, Reese, Justin, Harmon, Quaker E., Schaper, Kevin, Hedge, Harshad, Valentini, Giorgio, Schmitt, Charles, Motsinger-Reif, Alison, Hall, Janet E, Mungall, Christopher J, Robinson, Peter N, Haendel, Melissa A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371183/
https://www.ncbi.nlm.nih.gov/pubmed/37502882
http://dx.doi.org/10.1101/2023.07.14.23292679
_version_ 1785078100527153152
author Chan, Lauren E
Casiraghi, Elena
Putman, Tim
Reese, Justin
Harmon, Quaker E.
Schaper, Kevin
Hedge, Harshad
Valentini, Giorgio
Schmitt, Charles
Motsinger-Reif, Alison
Hall, Janet E
Mungall, Christopher J
Robinson, Peter N
Haendel, Melissa A
author_facet Chan, Lauren E
Casiraghi, Elena
Putman, Tim
Reese, Justin
Harmon, Quaker E.
Schaper, Kevin
Hedge, Harshad
Valentini, Giorgio
Schmitt, Charles
Motsinger-Reif, Alison
Hall, Janet E
Mungall, Christopher J
Robinson, Peter N
Haendel, Melissa A
author_sort Chan, Lauren E
collection PubMed
description OBJECTIVE: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (e.g., endometriosis, ovarian cyst, and uterine fibroids). MATERIALS AND METHODS: We harmonized survey data from the Personalized Environment and Genes Study on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. RESULTS: Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. DISCUSSION: Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal, but can support hypothesis generation. CONCLUSION: This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.
format Online
Article
Text
id pubmed-10371183
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103711832023-07-27 Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests Chan, Lauren E Casiraghi, Elena Putman, Tim Reese, Justin Harmon, Quaker E. Schaper, Kevin Hedge, Harshad Valentini, Giorgio Schmitt, Charles Motsinger-Reif, Alison Hall, Janet E Mungall, Christopher J Robinson, Peter N Haendel, Melissa A medRxiv Article OBJECTIVE: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (e.g., endometriosis, ovarian cyst, and uterine fibroids). MATERIALS AND METHODS: We harmonized survey data from the Personalized Environment and Genes Study on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. RESULTS: Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. DISCUSSION: Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal, but can support hypothesis generation. CONCLUSION: This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs. Cold Spring Harbor Laboratory 2023-07-16 /pmc/articles/PMC10371183/ /pubmed/37502882 http://dx.doi.org/10.1101/2023.07.14.23292679 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Chan, Lauren E
Casiraghi, Elena
Putman, Tim
Reese, Justin
Harmon, Quaker E.
Schaper, Kevin
Hedge, Harshad
Valentini, Giorgio
Schmitt, Charles
Motsinger-Reif, Alison
Hall, Janet E
Mungall, Christopher J
Robinson, Peter N
Haendel, Melissa A
Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
title Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
title_full Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
title_fullStr Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
title_full_unstemmed Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
title_short Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
title_sort predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371183/
https://www.ncbi.nlm.nih.gov/pubmed/37502882
http://dx.doi.org/10.1101/2023.07.14.23292679
work_keys_str_mv AT chanlaurene predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT casiraghielena predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT putmantim predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT reesejustin predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT harmonquakere predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT schaperkevin predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT hedgeharshad predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT valentinigiorgio predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT schmittcharles predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT motsingerreifalison predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT halljanete predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT mungallchristopherj predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT robinsonpetern predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests
AT haendelmelissaa predictingnutritionandenvironmentalfactorsassociatedwithfemalereproductivedisordersusingaknowledgegraphandrandomforests