Cargando…

Prediction of hematocrit through imbalanced dataset of blood spectra

In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced d...

Descripción completa

Detalles Bibliográficos
Autores principales: Decaro, Cristoforo, Montanari, Giovanni Battista, Bianconi, Marco, Bellanca, Gaetano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024026/
https://www.ncbi.nlm.nih.gov/pubmed/33850628
http://dx.doi.org/10.1049/htl2.12006
_version_ 1783675226691731456
author Decaro, Cristoforo
Montanari, Giovanni Battista
Bianconi, Marco
Bellanca, Gaetano
author_facet Decaro, Cristoforo
Montanari, Giovanni Battista
Bianconi, Marco
Bellanca, Gaetano
author_sort Decaro, Cristoforo
collection PubMed
description In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing.
format Online
Article
Text
id pubmed-8024026
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-80240262021-04-12 Prediction of hematocrit through imbalanced dataset of blood spectra Decaro, Cristoforo Montanari, Giovanni Battista Bianconi, Marco Bellanca, Gaetano Healthc Technol Lett Original Research Papers In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing. John Wiley and Sons Inc. 2021-04-06 /pmc/articles/PMC8024026/ /pubmed/33850628 http://dx.doi.org/10.1049/htl2.12006 Text en © 2021 The Authors. Healthcare Technology Letters published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research Papers
Decaro, Cristoforo
Montanari, Giovanni Battista
Bianconi, Marco
Bellanca, Gaetano
Prediction of hematocrit through imbalanced dataset of blood spectra
title Prediction of hematocrit through imbalanced dataset of blood spectra
title_full Prediction of hematocrit through imbalanced dataset of blood spectra
title_fullStr Prediction of hematocrit through imbalanced dataset of blood spectra
title_full_unstemmed Prediction of hematocrit through imbalanced dataset of blood spectra
title_short Prediction of hematocrit through imbalanced dataset of blood spectra
title_sort prediction of hematocrit through imbalanced dataset of blood spectra
topic Original Research Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024026/
https://www.ncbi.nlm.nih.gov/pubmed/33850628
http://dx.doi.org/10.1049/htl2.12006
work_keys_str_mv AT decarocristoforo predictionofhematocritthroughimbalanceddatasetofbloodspectra
AT montanarigiovannibattista predictionofhematocritthroughimbalanceddatasetofbloodspectra
AT bianconimarco predictionofhematocritthroughimbalanceddatasetofbloodspectra
AT bellancagaetano predictionofhematocritthroughimbalanceddatasetofbloodspectra