Cargando…
OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review
The current generation of sequencing technologies has led to significant advances in identifying novel disease-associated mutations and generated large amounts of data in a high-throughput manner. Such data in conjunction with clinical routine data are proven to be highly useful in deriving populati...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9569469/ https://www.ncbi.nlm.nih.gov/pubmed/36233137 http://dx.doi.org/10.3390/ijms231911834 |
_version_ | 1784809861040570368 |
---|---|
author | Ahmadi, Najia Peng, Yuan Wolfien, Markus Zoch, Michéle Sedlmayr, Martin |
author_facet | Ahmadi, Najia Peng, Yuan Wolfien, Markus Zoch, Michéle Sedlmayr, Martin |
author_sort | Ahmadi, Najia |
collection | PubMed |
description | The current generation of sequencing technologies has led to significant advances in identifying novel disease-associated mutations and generated large amounts of data in a high-throughput manner. Such data in conjunction with clinical routine data are proven to be highly useful in deriving population-level and patient-level predictions, especially in the field of cancer precision medicine. However, data harmonization across multiple national and international clinical sites is an essential step for the assessment of events and outcomes associated with patients, which is currently not adequately addressed. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) is an internationally established research data repository introduced by the Observational Health Data Science and Informatics (OHDSI) community to overcome this issue. To address the needs of cancer research, the genomic vocabulary extension was introduced in 2020 to support the standardization of subsequent data analysis. In this review, we evaluate the current potential of the OMOP CDM to be applicable in cancer prediction and how comprehensively the genomic vocabulary extension of the OMOP can serve current needs of AI-based predictions. For this, we systematically screened the literature for articles that use the OMOP CDM in predictive analyses in cancer and investigated the underlying predictive models/tools. Interestingly, we found 248 articles, of which most use the OMOP for harmonizing their data, but only 5 make use of predictive algorithms on OMOP-based data and fulfill our criteria. The studies present multicentric investigations, in which the OMOP played an essential role in discovering and optimizing machine learning (ML)-based models. Ultimately, the use of the OMOP CDM leads to standardized data-driven studies for multiple clinical sites and enables a more solid basis utilizing, e.g., ML models that can be reused and combined in early prediction, diagnosis, and improvement of personalized cancer care and biomarker discovery. |
format | Online Article Text |
id | pubmed-9569469 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95694692022-10-17 OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review Ahmadi, Najia Peng, Yuan Wolfien, Markus Zoch, Michéle Sedlmayr, Martin Int J Mol Sci Review The current generation of sequencing technologies has led to significant advances in identifying novel disease-associated mutations and generated large amounts of data in a high-throughput manner. Such data in conjunction with clinical routine data are proven to be highly useful in deriving population-level and patient-level predictions, especially in the field of cancer precision medicine. However, data harmonization across multiple national and international clinical sites is an essential step for the assessment of events and outcomes associated with patients, which is currently not adequately addressed. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) is an internationally established research data repository introduced by the Observational Health Data Science and Informatics (OHDSI) community to overcome this issue. To address the needs of cancer research, the genomic vocabulary extension was introduced in 2020 to support the standardization of subsequent data analysis. In this review, we evaluate the current potential of the OMOP CDM to be applicable in cancer prediction and how comprehensively the genomic vocabulary extension of the OMOP can serve current needs of AI-based predictions. For this, we systematically screened the literature for articles that use the OMOP CDM in predictive analyses in cancer and investigated the underlying predictive models/tools. Interestingly, we found 248 articles, of which most use the OMOP for harmonizing their data, but only 5 make use of predictive algorithms on OMOP-based data and fulfill our criteria. The studies present multicentric investigations, in which the OMOP played an essential role in discovering and optimizing machine learning (ML)-based models. Ultimately, the use of the OMOP CDM leads to standardized data-driven studies for multiple clinical sites and enables a more solid basis utilizing, e.g., ML models that can be reused and combined in early prediction, diagnosis, and improvement of personalized cancer care and biomarker discovery. MDPI 2022-10-05 /pmc/articles/PMC9569469/ /pubmed/36233137 http://dx.doi.org/10.3390/ijms231911834 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Review Ahmadi, Najia Peng, Yuan Wolfien, Markus Zoch, Michéle Sedlmayr, Martin OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review |
title | OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review |
title_full | OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review |
title_fullStr | OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review |
title_full_unstemmed | OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review |
title_short | OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review |
title_sort | omop cdm can facilitate data-driven studies for cancer prediction: a systematic review |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9569469/ https://www.ncbi.nlm.nih.gov/pubmed/36233137 http://dx.doi.org/10.3390/ijms231911834 |
work_keys_str_mv | AT ahmadinajia omopcdmcanfacilitatedatadrivenstudiesforcancerpredictionasystematicreview AT pengyuan omopcdmcanfacilitatedatadrivenstudiesforcancerpredictionasystematicreview AT wolfienmarkus omopcdmcanfacilitatedatadrivenstudiesforcancerpredictionasystematicreview AT zochmichele omopcdmcanfacilitatedatadrivenstudiesforcancerpredictionasystematicreview AT sedlmayrmartin omopcdmcanfacilitatedatadrivenstudiesforcancerpredictionasystematicreview |