Cargando…

Machine Learning Approach for Active Vaccine Safety Monitoring

BACKGROUND: Vaccine safety surveillance is important because it is related to vaccine hesitancy, which affects vaccination rate. To increase confidence in vaccination, the active monitoring of vaccine adverse events is important. For effective active surveillance, we developed and verified a machine...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Yujeong, Jang, Jong-Hwan, Park, Namgi, Jeong, Na-Young, Lim, Eunsun, Kim, Soyun, Choi, Nam-Kyong, Yoon, Dukyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Academy of Medical Sciences 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352788/
https://www.ncbi.nlm.nih.gov/pubmed/34402232
http://dx.doi.org/10.3346/jkms.2021.36.e198
_version_ 1783736260965171200
author Kim, Yujeong
Jang, Jong-Hwan
Park, Namgi
Jeong, Na-Young
Lim, Eunsun
Kim, Soyun
Choi, Nam-Kyong
Yoon, Dukyong
author_facet Kim, Yujeong
Jang, Jong-Hwan
Park, Namgi
Jeong, Na-Young
Lim, Eunsun
Kim, Soyun
Choi, Nam-Kyong
Yoon, Dukyong
author_sort Kim, Yujeong
collection PubMed
description BACKGROUND: Vaccine safety surveillance is important because it is related to vaccine hesitancy, which affects vaccination rate. To increase confidence in vaccination, the active monitoring of vaccine adverse events is important. For effective active surveillance, we developed and verified a machine learning-based active surveillance system using national claim data. METHODS: We used two databases, one from the Korea Disease Control and Prevention Agency, which contains flu vaccination records for the elderly, and another from the National Health Insurance Service, which contains the claim data of vaccinated people. We developed a case-crossover design based machine learning model to predict the health outcome of interest events (anaphylaxis and agranulocytosis) using a random forest. Feature importance values were evaluated to determine candidate associations with each outcome. We investigated the relationship of the features to each event via a literature review, comparison with the Side Effect Resource, and using the Local Interpretable Model-agnostic Explanation method. RESULTS: The trained model predicted each health outcome of interest with a high accuracy (approximately 70%). We found literature supporting our results, and most of the important drug-related features were listed in the Side Effect Resource database as inducing the health outcome of interest. For anaphylaxis, flu vaccination ranked high in our feature importance analysis and had a positive association in Local Interpretable Model-Agnostic Explanation analysis. Although the feature importance of vaccination was lower for agranulocytosis, it also had a positive relationship in the Local Interpretable Model-Agnostic Explanation analysis. CONCLUSION: We developed a machine learning-based active surveillance system for detecting possible factors that can induce adverse events using health claim and vaccination databases. The results of the study demonstrated a potentially useful application of two linked national health record databases. Our model can contribute to the establishment of a system for conducting active surveillance on vaccination.
format Online
Article
Text
id pubmed-8352788
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Korean Academy of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-83527882021-08-12 Machine Learning Approach for Active Vaccine Safety Monitoring Kim, Yujeong Jang, Jong-Hwan Park, Namgi Jeong, Na-Young Lim, Eunsun Kim, Soyun Choi, Nam-Kyong Yoon, Dukyong J Korean Med Sci Original Article BACKGROUND: Vaccine safety surveillance is important because it is related to vaccine hesitancy, which affects vaccination rate. To increase confidence in vaccination, the active monitoring of vaccine adverse events is important. For effective active surveillance, we developed and verified a machine learning-based active surveillance system using national claim data. METHODS: We used two databases, one from the Korea Disease Control and Prevention Agency, which contains flu vaccination records for the elderly, and another from the National Health Insurance Service, which contains the claim data of vaccinated people. We developed a case-crossover design based machine learning model to predict the health outcome of interest events (anaphylaxis and agranulocytosis) using a random forest. Feature importance values were evaluated to determine candidate associations with each outcome. We investigated the relationship of the features to each event via a literature review, comparison with the Side Effect Resource, and using the Local Interpretable Model-agnostic Explanation method. RESULTS: The trained model predicted each health outcome of interest with a high accuracy (approximately 70%). We found literature supporting our results, and most of the important drug-related features were listed in the Side Effect Resource database as inducing the health outcome of interest. For anaphylaxis, flu vaccination ranked high in our feature importance analysis and had a positive association in Local Interpretable Model-Agnostic Explanation analysis. Although the feature importance of vaccination was lower for agranulocytosis, it also had a positive relationship in the Local Interpretable Model-Agnostic Explanation analysis. CONCLUSION: We developed a machine learning-based active surveillance system for detecting possible factors that can induce adverse events using health claim and vaccination databases. The results of the study demonstrated a potentially useful application of two linked national health record databases. Our model can contribute to the establishment of a system for conducting active surveillance on vaccination. The Korean Academy of Medical Sciences 2021-07-02 /pmc/articles/PMC8352788/ /pubmed/34402232 http://dx.doi.org/10.3346/jkms.2021.36.e198 Text en © 2021 The Korean Academy of Medical Sciences. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Kim, Yujeong
Jang, Jong-Hwan
Park, Namgi
Jeong, Na-Young
Lim, Eunsun
Kim, Soyun
Choi, Nam-Kyong
Yoon, Dukyong
Machine Learning Approach for Active Vaccine Safety Monitoring
title Machine Learning Approach for Active Vaccine Safety Monitoring
title_full Machine Learning Approach for Active Vaccine Safety Monitoring
title_fullStr Machine Learning Approach for Active Vaccine Safety Monitoring
title_full_unstemmed Machine Learning Approach for Active Vaccine Safety Monitoring
title_short Machine Learning Approach for Active Vaccine Safety Monitoring
title_sort machine learning approach for active vaccine safety monitoring
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352788/
https://www.ncbi.nlm.nih.gov/pubmed/34402232
http://dx.doi.org/10.3346/jkms.2021.36.e198
work_keys_str_mv AT kimyujeong machinelearningapproachforactivevaccinesafetymonitoring
AT jangjonghwan machinelearningapproachforactivevaccinesafetymonitoring
AT parknamgi machinelearningapproachforactivevaccinesafetymonitoring
AT jeongnayoung machinelearningapproachforactivevaccinesafetymonitoring
AT limeunsun machinelearningapproachforactivevaccinesafetymonitoring
AT kimsoyun machinelearningapproachforactivevaccinesafetymonitoring
AT choinamkyong machinelearningapproachforactivevaccinesafetymonitoring
AT yoondukyong machinelearningapproachforactivevaccinesafetymonitoring