Cargando…

PO2RDF: representation of real-world data for precision oncology using resource description framework

BACKGROUND: Next-generation sequencing provides comprehensive information about individuals’ genetic makeup and is commonplace in precision oncology practice. Due to the heterogeneity of individual patient’s disease conditions and treatment journeys, not all targeted therapies were initiated despite...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yiqing, Dimou, Anastasios, Shen, Feichen, Zong, Nansu, Davila, Jaime I., Liu, Hongfang, Wang, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9338627/
https://www.ncbi.nlm.nih.gov/pubmed/35907849
http://dx.doi.org/10.1186/s12920-022-01314-9
_version_ 1784760012192612352
author Zhao, Yiqing
Dimou, Anastasios
Shen, Feichen
Zong, Nansu
Davila, Jaime I.
Liu, Hongfang
Wang, Chen
author_facet Zhao, Yiqing
Dimou, Anastasios
Shen, Feichen
Zong, Nansu
Davila, Jaime I.
Liu, Hongfang
Wang, Chen
author_sort Zhao, Yiqing
collection PubMed
description BACKGROUND: Next-generation sequencing provides comprehensive information about individuals’ genetic makeup and is commonplace in precision oncology practice. Due to the heterogeneity of individual patient’s disease conditions and treatment journeys, not all targeted therapies were initiated despite actionable mutations. To better understand and support the clinical decision-making process in precision oncology, there is a need to examine real-world associations between patients’ genetic information and treatment choices. METHODS: To fill the gap of insufficient use of real-world data (RWD) in electronic health records (EHRs), we generated a single Resource Description Framework (RDF) resource, called PO2RDF (precision oncology to RDF), by integrating information regarding genes, variants, diseases, and drugs from genetic reports and EHRs. RESULTS: There are a total 2,309,014 triples contained in the PO2RDF. Among them, 32,815 triples are related to Gene, 34,695 triples are related to Variant, 8,787 triples are related to Disease, 26,154 triples are related to Drug. We performed two use case analyses to demonstrate the usability of the PO2RDF: (1) we examined real-world associations between EGFR mutations and targeted therapies to confirm existing knowledge and detect off-label use. (2) We examined differences in prognosis for lung cancer patients with/without TP53 mutations. CONCLUSIONS: In conclusion, our work proposed to use RDF to organize and distribute clinical RWD that is otherwise inaccessible externally. Our work serves as a pilot study that will lead to new clinical applications and could ultimately stimulate progress in the field of precision oncology.
format Online
Article
Text
id pubmed-9338627
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93386272022-07-31 PO2RDF: representation of real-world data for precision oncology using resource description framework Zhao, Yiqing Dimou, Anastasios Shen, Feichen Zong, Nansu Davila, Jaime I. Liu, Hongfang Wang, Chen BMC Med Genomics Technical Advance BACKGROUND: Next-generation sequencing provides comprehensive information about individuals’ genetic makeup and is commonplace in precision oncology practice. Due to the heterogeneity of individual patient’s disease conditions and treatment journeys, not all targeted therapies were initiated despite actionable mutations. To better understand and support the clinical decision-making process in precision oncology, there is a need to examine real-world associations between patients’ genetic information and treatment choices. METHODS: To fill the gap of insufficient use of real-world data (RWD) in electronic health records (EHRs), we generated a single Resource Description Framework (RDF) resource, called PO2RDF (precision oncology to RDF), by integrating information regarding genes, variants, diseases, and drugs from genetic reports and EHRs. RESULTS: There are a total 2,309,014 triples contained in the PO2RDF. Among them, 32,815 triples are related to Gene, 34,695 triples are related to Variant, 8,787 triples are related to Disease, 26,154 triples are related to Drug. We performed two use case analyses to demonstrate the usability of the PO2RDF: (1) we examined real-world associations between EGFR mutations and targeted therapies to confirm existing knowledge and detect off-label use. (2) We examined differences in prognosis for lung cancer patients with/without TP53 mutations. CONCLUSIONS: In conclusion, our work proposed to use RDF to organize and distribute clinical RWD that is otherwise inaccessible externally. Our work serves as a pilot study that will lead to new clinical applications and could ultimately stimulate progress in the field of precision oncology. BioMed Central 2022-07-30 /pmc/articles/PMC9338627/ /pubmed/35907849 http://dx.doi.org/10.1186/s12920-022-01314-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Technical Advance
Zhao, Yiqing
Dimou, Anastasios
Shen, Feichen
Zong, Nansu
Davila, Jaime I.
Liu, Hongfang
Wang, Chen
PO2RDF: representation of real-world data for precision oncology using resource description framework
title PO2RDF: representation of real-world data for precision oncology using resource description framework
title_full PO2RDF: representation of real-world data for precision oncology using resource description framework
title_fullStr PO2RDF: representation of real-world data for precision oncology using resource description framework
title_full_unstemmed PO2RDF: representation of real-world data for precision oncology using resource description framework
title_short PO2RDF: representation of real-world data for precision oncology using resource description framework
title_sort po2rdf: representation of real-world data for precision oncology using resource description framework
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9338627/
https://www.ncbi.nlm.nih.gov/pubmed/35907849
http://dx.doi.org/10.1186/s12920-022-01314-9
work_keys_str_mv AT zhaoyiqing po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework
AT dimouanastasios po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework
AT shenfeichen po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework
AT zongnansu po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework
AT davilajaimei po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework
AT liuhongfang po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework
AT wangchen po2rdfrepresentationofrealworlddataforprecisiononcologyusingresourcedescriptionframework