Cargando…

Building a knowledge graph to enable precision medicine

Developing personalized diagnostic strategies and targeted treatments requires a deep understanding of disease biology and the ability to dissect the relationship between molecular and genetic factors and their phenotypic consequences. However, such knowledge is fragmented across publications, non-s...

Descripción completa

Detalles Bibliográficos
Autores principales: Chandak, Payal, Huang, Kexin, Zitnik, Marinka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9893183/
https://www.ncbi.nlm.nih.gov/pubmed/36732524
http://dx.doi.org/10.1038/s41597-023-01960-3
_version_ 1784881471016665088
author Chandak, Payal
Huang, Kexin
Zitnik, Marinka
author_facet Chandak, Payal
Huang, Kexin
Zitnik, Marinka
author_sort Chandak, Payal
collection PubMed
description Developing personalized diagnostic strategies and targeted treatments requires a deep understanding of disease biology and the ability to dissect the relationship between molecular and genetic factors and their phenotypic consequences. However, such knowledge is fragmented across publications, non-standardized repositories, and evolving ontologies describing various scales of biological organization between genotypes and clinical phenotypes. Here, we present PrimeKG, a multimodal knowledge graph for precision medicine analyses. PrimeKG integrates 20 high-quality resources to describe 17,080 diseases with 4,050,249 relationships representing ten major biological scales, including disease-associated protein perturbations, biological processes and pathways, anatomical and phenotypic scales, and the entire range of approved drugs with their therapeutic action, considerably expanding previous efforts in disease-rooted knowledge graphs. PrimeKG contains an abundance of ‘indications’, ‘contradictions’, and ‘off-label use’ drug-disease edges that lack in other knowledge graphs and can support AI analyses of how drugs affect disease-associated networks. We supplement PrimeKG’s graph structure with language descriptions of clinical guidelines to enable multimodal analyses and provide instructions for continual updates of PrimeKG as new data become available.
format Online
Article
Text
id pubmed-9893183
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98931832023-02-02 Building a knowledge graph to enable precision medicine Chandak, Payal Huang, Kexin Zitnik, Marinka Sci Data Data Descriptor Developing personalized diagnostic strategies and targeted treatments requires a deep understanding of disease biology and the ability to dissect the relationship between molecular and genetic factors and their phenotypic consequences. However, such knowledge is fragmented across publications, non-standardized repositories, and evolving ontologies describing various scales of biological organization between genotypes and clinical phenotypes. Here, we present PrimeKG, a multimodal knowledge graph for precision medicine analyses. PrimeKG integrates 20 high-quality resources to describe 17,080 diseases with 4,050,249 relationships representing ten major biological scales, including disease-associated protein perturbations, biological processes and pathways, anatomical and phenotypic scales, and the entire range of approved drugs with their therapeutic action, considerably expanding previous efforts in disease-rooted knowledge graphs. PrimeKG contains an abundance of ‘indications’, ‘contradictions’, and ‘off-label use’ drug-disease edges that lack in other knowledge graphs and can support AI analyses of how drugs affect disease-associated networks. We supplement PrimeKG’s graph structure with language descriptions of clinical guidelines to enable multimodal analyses and provide instructions for continual updates of PrimeKG as new data become available. Nature Publishing Group UK 2023-02-02 /pmc/articles/PMC9893183/ /pubmed/36732524 http://dx.doi.org/10.1038/s41597-023-01960-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Chandak, Payal
Huang, Kexin
Zitnik, Marinka
Building a knowledge graph to enable precision medicine
title Building a knowledge graph to enable precision medicine
title_full Building a knowledge graph to enable precision medicine
title_fullStr Building a knowledge graph to enable precision medicine
title_full_unstemmed Building a knowledge graph to enable precision medicine
title_short Building a knowledge graph to enable precision medicine
title_sort building a knowledge graph to enable precision medicine
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9893183/
https://www.ncbi.nlm.nih.gov/pubmed/36732524
http://dx.doi.org/10.1038/s41597-023-01960-3
work_keys_str_mv AT chandakpayal buildingaknowledgegraphtoenableprecisionmedicine
AT huangkexin buildingaknowledgegraphtoenableprecisionmedicine
AT zitnikmarinka buildingaknowledgegraphtoenableprecisionmedicine