Cargando…
Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias
In order to determine the site of origin (SOO) in outflow tract ventricular arrhythmias (OTVAs) before an ablation procedure, several algorithms based on manual identification of electrocardiogram (ECG) features, have been developed. However, the reported accuracy decreases when tested with differen...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9412034/ https://www.ncbi.nlm.nih.gov/pubmed/36035489 http://dx.doi.org/10.3389/fphys.2022.909372 |
_version_ | 1784775400613740544 |
---|---|
author | Doste, Ruben Lozano, Miguel Jimenez-Perez, Guillermo Mont, Lluis Berruezo, Antonio Penela, Diego Camara, Oscar Sebastian, Rafael |
author_facet | Doste, Ruben Lozano, Miguel Jimenez-Perez, Guillermo Mont, Lluis Berruezo, Antonio Penela, Diego Camara, Oscar Sebastian, Rafael |
author_sort | Doste, Ruben |
collection | PubMed |
description | In order to determine the site of origin (SOO) in outflow tract ventricular arrhythmias (OTVAs) before an ablation procedure, several algorithms based on manual identification of electrocardiogram (ECG) features, have been developed. However, the reported accuracy decreases when tested with different datasets. Machine learning algorithms can automatize the process and improve generalization, but their performance is hampered by the lack of large enough OTVA databases. We propose the use of detailed electrophysiological simulations of OTVAs to train a machine learning classification model to predict the ventricular origin of the SOO of ectopic beats. We generated a synthetic database of 12-lead ECGs (2,496 signals) by running multiple simulations from the most typical OTVA SOO in 16 patient-specific geometries. Two types of input data were considered in the classification, raw and feature ECG signals. From the simulated raw 12-lead ECG, we analyzed the contribution of each lead in the predictions, keeping the best ones for the training process. For feature-based analysis, we used entropy-based methods to rank the obtained features. A cross-validation process was included to evaluate the machine learning model. Following, two clinical OTVA databases from different hospitals, including ECGs from 365 patients, were used as test-sets to assess the generalization of the proposed approach. The results show that V2 was the best lead for classification. Prediction of the SOO in OTVA, using both raw signals or features for classification, presented high accuracy values (>0.96). Generalization of the network trained on simulated data was good for both patient datasets (accuracy of 0.86 and 0.84, respectively) and presented better values than using exclusively real ECGs for classification (accuracy of 0.84 and 0.76 for each dataset). The use of simulated ECG data for training machine learning-based classification algorithms is critical to obtain good SOO predictions in OTVA compared to real data alone. The fast implementation and generalization of the proposed methodology may contribute towards its application to a clinical routine. |
format | Online Article Text |
id | pubmed-9412034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94120342022-08-27 Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias Doste, Ruben Lozano, Miguel Jimenez-Perez, Guillermo Mont, Lluis Berruezo, Antonio Penela, Diego Camara, Oscar Sebastian, Rafael Front Physiol Physiology In order to determine the site of origin (SOO) in outflow tract ventricular arrhythmias (OTVAs) before an ablation procedure, several algorithms based on manual identification of electrocardiogram (ECG) features, have been developed. However, the reported accuracy decreases when tested with different datasets. Machine learning algorithms can automatize the process and improve generalization, but their performance is hampered by the lack of large enough OTVA databases. We propose the use of detailed electrophysiological simulations of OTVAs to train a machine learning classification model to predict the ventricular origin of the SOO of ectopic beats. We generated a synthetic database of 12-lead ECGs (2,496 signals) by running multiple simulations from the most typical OTVA SOO in 16 patient-specific geometries. Two types of input data were considered in the classification, raw and feature ECG signals. From the simulated raw 12-lead ECG, we analyzed the contribution of each lead in the predictions, keeping the best ones for the training process. For feature-based analysis, we used entropy-based methods to rank the obtained features. A cross-validation process was included to evaluate the machine learning model. Following, two clinical OTVA databases from different hospitals, including ECGs from 365 patients, were used as test-sets to assess the generalization of the proposed approach. The results show that V2 was the best lead for classification. Prediction of the SOO in OTVA, using both raw signals or features for classification, presented high accuracy values (>0.96). Generalization of the network trained on simulated data was good for both patient datasets (accuracy of 0.86 and 0.84, respectively) and presented better values than using exclusively real ECGs for classification (accuracy of 0.84 and 0.76 for each dataset). The use of simulated ECG data for training machine learning-based classification algorithms is critical to obtain good SOO predictions in OTVA compared to real data alone. The fast implementation and generalization of the proposed methodology may contribute towards its application to a clinical routine. Frontiers Media S.A. 2022-08-12 /pmc/articles/PMC9412034/ /pubmed/36035489 http://dx.doi.org/10.3389/fphys.2022.909372 Text en Copyright © 2022 Doste, Lozano, Jimenez-Perez, Mont, Berruezo, Penela, Camara and Sebastian. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Physiology Doste, Ruben Lozano, Miguel Jimenez-Perez, Guillermo Mont, Lluis Berruezo, Antonio Penela, Diego Camara, Oscar Sebastian, Rafael Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
title | Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
title_full | Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
title_fullStr | Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
title_full_unstemmed | Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
title_short | Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
title_sort | training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias |
topic | Physiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9412034/ https://www.ncbi.nlm.nih.gov/pubmed/36035489 http://dx.doi.org/10.3389/fphys.2022.909372 |
work_keys_str_mv | AT dosteruben trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT lozanomiguel trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT jimenezperezguillermo trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT montlluis trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT berruezoantonio trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT peneladiego trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT camaraoscar trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias AT sebastianrafael trainingmachinelearningmodelswithsyntheticdataimprovesthepredictionofventricularorigininoutflowtractventriculararrhythmias |