Cargando…

Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data

Scientific progress has contributed to creating many devices to gather vast amounts of biomedical data over time. The goal of these devices is generally to monitor people's health conditions, diagnose, and prevent patients' diseases, for example, to discover cardiovascular disorders or pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Maturo, Fabrizio, Verde, Rosanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303904/
https://www.ncbi.nlm.nih.gov/pubmed/35184323
http://dx.doi.org/10.1002/sim.9353
_version_ 1784751980472696832
author Maturo, Fabrizio
Verde, Rosanna
author_facet Maturo, Fabrizio
Verde, Rosanna
author_sort Maturo, Fabrizio
collection PubMed
description Scientific progress has contributed to creating many devices to gather vast amounts of biomedical data over time. The goal of these devices is generally to monitor people's health conditions, diagnose, and prevent patients' diseases, for example, to discover cardiovascular disorders or predict epileptic seizures. A common way of investigating these data is classification, but these instruments generate signals often characterized by high dimensionality. Learning from these data is definitely a challenging task due to many issues, for example, the trade‐off between complexity and accuracy and the course of dimensionality. This study proposes a supervised classification method based on the joint use of functional data analysis, classification trees, and random forest to deal with massive biomedical data recorded over time. For this purpose, this research suggests different original tools to extract features and train functional classifiers, interpret the classification rules, assess leaves' quality and composition, avoid the classical drawbacks due to the COD, and improve the accuracy of the functional classifiers. Focusing on ECG data as a possible example, the final purpose of this study is to offer an original approach to identify and classify patients at risk using different types of biomedical signals. The results confirm that this line of research is exciting; indeed, the interpretative tools show evidence to be very useful for understanding classification rules. Furthermore, the performance of the proposed functional classifier, in terms of accuracy, is excellent because the latter breaks the previous classification record regarding a well‐known ECG dataset.
format Online
Article
Text
id pubmed-9303904
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-93039042022-07-28 Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data Maturo, Fabrizio Verde, Rosanna Stat Med Research Articles Scientific progress has contributed to creating many devices to gather vast amounts of biomedical data over time. The goal of these devices is generally to monitor people's health conditions, diagnose, and prevent patients' diseases, for example, to discover cardiovascular disorders or predict epileptic seizures. A common way of investigating these data is classification, but these instruments generate signals often characterized by high dimensionality. Learning from these data is definitely a challenging task due to many issues, for example, the trade‐off between complexity and accuracy and the course of dimensionality. This study proposes a supervised classification method based on the joint use of functional data analysis, classification trees, and random forest to deal with massive biomedical data recorded over time. For this purpose, this research suggests different original tools to extract features and train functional classifiers, interpret the classification rules, assess leaves' quality and composition, avoid the classical drawbacks due to the COD, and improve the accuracy of the functional classifiers. Focusing on ECG data as a possible example, the final purpose of this study is to offer an original approach to identify and classify patients at risk using different types of biomedical signals. The results confirm that this line of research is exciting; indeed, the interpretative tools show evidence to be very useful for understanding classification rules. Furthermore, the performance of the proposed functional classifier, in terms of accuracy, is excellent because the latter breaks the previous classification record regarding a well‐known ECG dataset. John Wiley and Sons Inc. 2022-02-20 2022-05-30 /pmc/articles/PMC9303904/ /pubmed/35184323 http://dx.doi.org/10.1002/sim.9353 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Maturo, Fabrizio
Verde, Rosanna
Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
title Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
title_full Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
title_fullStr Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
title_full_unstemmed Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
title_short Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
title_sort pooling random forest and functional data analysis for biomedical signals supervised classification: theory and application to electrocardiogram data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303904/
https://www.ncbi.nlm.nih.gov/pubmed/35184323
http://dx.doi.org/10.1002/sim.9353
work_keys_str_mv AT maturofabrizio poolingrandomforestandfunctionaldataanalysisforbiomedicalsignalssupervisedclassificationtheoryandapplicationtoelectrocardiogramdata
AT verderosanna poolingrandomforestandfunctionaldataanalysisforbiomedicalsignalssupervisedclassificationtheoryandapplicationtoelectrocardiogramdata