Cargando…
Prediction of hematocrit through imbalanced dataset of blood spectra
In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced d...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024026/ https://www.ncbi.nlm.nih.gov/pubmed/33850628 http://dx.doi.org/10.1049/htl2.12006 |
_version_ | 1783675226691731456 |
---|---|
author | Decaro, Cristoforo Montanari, Giovanni Battista Bianconi, Marco Bellanca, Gaetano |
author_facet | Decaro, Cristoforo Montanari, Giovanni Battista Bianconi, Marco Bellanca, Gaetano |
author_sort | Decaro, Cristoforo |
collection | PubMed |
description | In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing. |
format | Online Article Text |
id | pubmed-8024026 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80240262021-04-12 Prediction of hematocrit through imbalanced dataset of blood spectra Decaro, Cristoforo Montanari, Giovanni Battista Bianconi, Marco Bellanca, Gaetano Healthc Technol Lett Original Research Papers In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing. John Wiley and Sons Inc. 2021-04-06 /pmc/articles/PMC8024026/ /pubmed/33850628 http://dx.doi.org/10.1049/htl2.12006 Text en © 2021 The Authors. Healthcare Technology Letters published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Papers Decaro, Cristoforo Montanari, Giovanni Battista Bianconi, Marco Bellanca, Gaetano Prediction of hematocrit through imbalanced dataset of blood spectra |
title | Prediction of hematocrit through imbalanced dataset of blood spectra |
title_full | Prediction of hematocrit through imbalanced dataset of blood spectra |
title_fullStr | Prediction of hematocrit through imbalanced dataset of blood spectra |
title_full_unstemmed | Prediction of hematocrit through imbalanced dataset of blood spectra |
title_short | Prediction of hematocrit through imbalanced dataset of blood spectra |
title_sort | prediction of hematocrit through imbalanced dataset of blood spectra |
topic | Original Research Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024026/ https://www.ncbi.nlm.nih.gov/pubmed/33850628 http://dx.doi.org/10.1049/htl2.12006 |
work_keys_str_mv | AT decarocristoforo predictionofhematocritthroughimbalanceddatasetofbloodspectra AT montanarigiovannibattista predictionofhematocritthroughimbalanceddatasetofbloodspectra AT bianconimarco predictionofhematocritthroughimbalanceddatasetofbloodspectra AT bellancagaetano predictionofhematocritthroughimbalanceddatasetofbloodspectra |