Cargando…
Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care
INTRODUCTION: A growing number of healthcare providers make complex treatment decisions guided by electronic health record (EHR) software interfaces. Many interfaces integrate multiple sources of data (e.g., labs, pharmacy, diagnoses) successfully, though relatively few have incorporated genetic dat...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9705957/ https://www.ncbi.nlm.nih.gov/pubmed/36458283 http://dx.doi.org/10.3389/fdata.2022.1059088 |
_version_ | 1784840399780577280 |
---|---|
author | Kidwai-Khan, Farah Rentsch, Christopher T. Pulk, Rebecca Alcorn, Charles Brandt, Cynthia A. Justice, Amy C. |
author_facet | Kidwai-Khan, Farah Rentsch, Christopher T. Pulk, Rebecca Alcorn, Charles Brandt, Cynthia A. Justice, Amy C. |
author_sort | Kidwai-Khan, Farah |
collection | PubMed |
description | INTRODUCTION: A growing number of healthcare providers make complex treatment decisions guided by electronic health record (EHR) software interfaces. Many interfaces integrate multiple sources of data (e.g., labs, pharmacy, diagnoses) successfully, though relatively few have incorporated genetic data. METHOD: This study utilizes informatics methods with predictive modeling to create and validate algorithms to enable informed pharmacogenomic decision-making at the point of care in near real-time. The proposed framework integrates EHR and genetic data relevant to the patient's current medications including decision support mechanisms based on predictive modeling. We created a prototype with EHR and linked genetic data from the Department of Veterans Affairs (VA), the largest integrated healthcare system in the US. The EHR data included diagnoses, medication fills, and outpatient clinic visits for 2,600 people with HIV and matched uninfected controls linked to prototypic genetic data (variations in single or multiple positions in the DNA sequence). We then mapped the medications that patients were prescribed to medications defined in the drug-gene interaction mapping of the Clinical Pharmacogenomics Implementation Consortium's (CPIC) level A (i.e., sufficient evidence for at least one prescribing action) guidelines that predict adverse events. CPIC is a National Institute of Health funded group of experts who develop evidence based pharmacogenomic guidelines. Preventable adverse events (PAE) can be defined as a harmful outcome from an intervention that could have been prevented. For this study, we focused on potential PAEs resulting from a medication-gene interaction. RESULTS: The final model showed AUC scores of 0.972 with an F1 score of 0.97 with genetic data as compared to 0.766 and 0.73 respectively, without genetic data integration. DISCUSSION: Over 98% of people in the cohort were on at least one medication with CPIC level a guideline in their lifetime. We compared predictive power of machine learning models to detect a PAE between five modeling methods: Random Forest, Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), K Nearest neighbors (KNN), and Decision Tree. We found that XGBoost performed best for the prototype when genetic data was added to the framework and improved prediction of PAE. We compared area under the curve (AUC) between the models in the testing dataset. |
format | Online Article Text |
id | pubmed-9705957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97059572022-11-30 Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care Kidwai-Khan, Farah Rentsch, Christopher T. Pulk, Rebecca Alcorn, Charles Brandt, Cynthia A. Justice, Amy C. Front Big Data Big Data INTRODUCTION: A growing number of healthcare providers make complex treatment decisions guided by electronic health record (EHR) software interfaces. Many interfaces integrate multiple sources of data (e.g., labs, pharmacy, diagnoses) successfully, though relatively few have incorporated genetic data. METHOD: This study utilizes informatics methods with predictive modeling to create and validate algorithms to enable informed pharmacogenomic decision-making at the point of care in near real-time. The proposed framework integrates EHR and genetic data relevant to the patient's current medications including decision support mechanisms based on predictive modeling. We created a prototype with EHR and linked genetic data from the Department of Veterans Affairs (VA), the largest integrated healthcare system in the US. The EHR data included diagnoses, medication fills, and outpatient clinic visits for 2,600 people with HIV and matched uninfected controls linked to prototypic genetic data (variations in single or multiple positions in the DNA sequence). We then mapped the medications that patients were prescribed to medications defined in the drug-gene interaction mapping of the Clinical Pharmacogenomics Implementation Consortium's (CPIC) level A (i.e., sufficient evidence for at least one prescribing action) guidelines that predict adverse events. CPIC is a National Institute of Health funded group of experts who develop evidence based pharmacogenomic guidelines. Preventable adverse events (PAE) can be defined as a harmful outcome from an intervention that could have been prevented. For this study, we focused on potential PAEs resulting from a medication-gene interaction. RESULTS: The final model showed AUC scores of 0.972 with an F1 score of 0.97 with genetic data as compared to 0.766 and 0.73 respectively, without genetic data integration. DISCUSSION: Over 98% of people in the cohort were on at least one medication with CPIC level a guideline in their lifetime. We compared predictive power of machine learning models to detect a PAE between five modeling methods: Random Forest, Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), K Nearest neighbors (KNN), and Decision Tree. We found that XGBoost performed best for the prototype when genetic data was added to the framework and improved prediction of PAE. We compared area under the curve (AUC) between the models in the testing dataset. Frontiers Media S.A. 2022-11-15 /pmc/articles/PMC9705957/ /pubmed/36458283 http://dx.doi.org/10.3389/fdata.2022.1059088 Text en Copyright © 2022 Kidwai-Khan, Rentsch, Pulk, Alcorn, Brandt and Justice. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Kidwai-Khan, Farah Rentsch, Christopher T. Pulk, Rebecca Alcorn, Charles Brandt, Cynthia A. Justice, Amy C. Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care |
title | Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care |
title_full | Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care |
title_fullStr | Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care |
title_full_unstemmed | Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care |
title_short | Pharmacogenomics driven decision support prototype with machine learning: A framework for improving patient care |
title_sort | pharmacogenomics driven decision support prototype with machine learning: a framework for improving patient care |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9705957/ https://www.ncbi.nlm.nih.gov/pubmed/36458283 http://dx.doi.org/10.3389/fdata.2022.1059088 |
work_keys_str_mv | AT kidwaikhanfarah pharmacogenomicsdrivendecisionsupportprototypewithmachinelearningaframeworkforimprovingpatientcare AT rentschchristophert pharmacogenomicsdrivendecisionsupportprototypewithmachinelearningaframeworkforimprovingpatientcare AT pulkrebecca pharmacogenomicsdrivendecisionsupportprototypewithmachinelearningaframeworkforimprovingpatientcare AT alcorncharles pharmacogenomicsdrivendecisionsupportprototypewithmachinelearningaframeworkforimprovingpatientcare AT brandtcynthiaa pharmacogenomicsdrivendecisionsupportprototypewithmachinelearningaframeworkforimprovingpatientcare AT justiceamyc pharmacogenomicsdrivendecisionsupportprototypewithmachinelearningaframeworkforimprovingpatientcare |