Cargando…
Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data
Scientific progress has contributed to creating many devices to gather vast amounts of biomedical data over time. The goal of these devices is generally to monitor people's health conditions, diagnose, and prevent patients' diseases, for example, to discover cardiovascular disorders or pre...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303904/ https://www.ncbi.nlm.nih.gov/pubmed/35184323 http://dx.doi.org/10.1002/sim.9353 |
_version_ | 1784751980472696832 |
---|---|
author | Maturo, Fabrizio Verde, Rosanna |
author_facet | Maturo, Fabrizio Verde, Rosanna |
author_sort | Maturo, Fabrizio |
collection | PubMed |
description | Scientific progress has contributed to creating many devices to gather vast amounts of biomedical data over time. The goal of these devices is generally to monitor people's health conditions, diagnose, and prevent patients' diseases, for example, to discover cardiovascular disorders or predict epileptic seizures. A common way of investigating these data is classification, but these instruments generate signals often characterized by high dimensionality. Learning from these data is definitely a challenging task due to many issues, for example, the trade‐off between complexity and accuracy and the course of dimensionality. This study proposes a supervised classification method based on the joint use of functional data analysis, classification trees, and random forest to deal with massive biomedical data recorded over time. For this purpose, this research suggests different original tools to extract features and train functional classifiers, interpret the classification rules, assess leaves' quality and composition, avoid the classical drawbacks due to the COD, and improve the accuracy of the functional classifiers. Focusing on ECG data as a possible example, the final purpose of this study is to offer an original approach to identify and classify patients at risk using different types of biomedical signals. The results confirm that this line of research is exciting; indeed, the interpretative tools show evidence to be very useful for understanding classification rules. Furthermore, the performance of the proposed functional classifier, in terms of accuracy, is excellent because the latter breaks the previous classification record regarding a well‐known ECG dataset. |
format | Online Article Text |
id | pubmed-9303904 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93039042022-07-28 Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data Maturo, Fabrizio Verde, Rosanna Stat Med Research Articles Scientific progress has contributed to creating many devices to gather vast amounts of biomedical data over time. The goal of these devices is generally to monitor people's health conditions, diagnose, and prevent patients' diseases, for example, to discover cardiovascular disorders or predict epileptic seizures. A common way of investigating these data is classification, but these instruments generate signals often characterized by high dimensionality. Learning from these data is definitely a challenging task due to many issues, for example, the trade‐off between complexity and accuracy and the course of dimensionality. This study proposes a supervised classification method based on the joint use of functional data analysis, classification trees, and random forest to deal with massive biomedical data recorded over time. For this purpose, this research suggests different original tools to extract features and train functional classifiers, interpret the classification rules, assess leaves' quality and composition, avoid the classical drawbacks due to the COD, and improve the accuracy of the functional classifiers. Focusing on ECG data as a possible example, the final purpose of this study is to offer an original approach to identify and classify patients at risk using different types of biomedical signals. The results confirm that this line of research is exciting; indeed, the interpretative tools show evidence to be very useful for understanding classification rules. Furthermore, the performance of the proposed functional classifier, in terms of accuracy, is excellent because the latter breaks the previous classification record regarding a well‐known ECG dataset. John Wiley and Sons Inc. 2022-02-20 2022-05-30 /pmc/articles/PMC9303904/ /pubmed/35184323 http://dx.doi.org/10.1002/sim.9353 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Maturo, Fabrizio Verde, Rosanna Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data |
title | Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data |
title_full | Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data |
title_fullStr | Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data |
title_full_unstemmed | Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data |
title_short | Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data |
title_sort | pooling random forest and functional data analysis for biomedical signals supervised classification: theory and application to electrocardiogram data |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303904/ https://www.ncbi.nlm.nih.gov/pubmed/35184323 http://dx.doi.org/10.1002/sim.9353 |
work_keys_str_mv | AT maturofabrizio poolingrandomforestandfunctionaldataanalysisforbiomedicalsignalssupervisedclassificationtheoryandapplicationtoelectrocardiogramdata AT verderosanna poolingrandomforestandfunctionaldataanalysisforbiomedicalsignalssupervisedclassificationtheoryandapplicationtoelectrocardiogramdata |