Cargando…
An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques
The rising risk of diabetes, particularly in emerging countries, highlights the importance of early detection. Manual prediction can be a challenging task, leading to the need for automatic approaches. The major challenge with biomedical datasets is data scarcity. Biomedical data is often difficult...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544445/ https://www.ncbi.nlm.nih.gov/pubmed/37784049 http://dx.doi.org/10.1186/s12859-023-05488-6 |
_version_ | 1785114505995354112 |
---|---|
author | Patro, Kiran Kumar Allam, Jaya Prakash Sanapala, Umamaheswararao Marpu, Chaitanya Kumar Samee, Nagwan Abdel Alabdulhafith, Maali Plawiak, Pawel |
author_facet | Patro, Kiran Kumar Allam, Jaya Prakash Sanapala, Umamaheswararao Marpu, Chaitanya Kumar Samee, Nagwan Abdel Alabdulhafith, Maali Plawiak, Pawel |
author_sort | Patro, Kiran Kumar |
collection | PubMed |
description | The rising risk of diabetes, particularly in emerging countries, highlights the importance of early detection. Manual prediction can be a challenging task, leading to the need for automatic approaches. The major challenge with biomedical datasets is data scarcity. Biomedical data is often difficult to obtain in large quantities, which can limit the ability to train deep learning models effectively. Biomedical data can be noisy and inconsistent, which can make it difficult to train accurate models. To overcome the above-mentioned challenges, this work presents a new framework for data modeling that is based on correlation measures between features and can be used to process data effectively for predicting diabetes. The standard, publicly available Pima Indians Medical Diabetes (PIMA) dataset is utilized to verify the effectiveness of the proposed techniques. Experiments using the PIMA dataset showed that the proposed data modeling method improved the accuracy of machine learning models by an average of 9%, with deep convolutional neural network models achieving an accuracy of 96.13%. Overall, this study demonstrates the effectiveness of the proposed strategy in the early and reliable prediction of diabetes. |
format | Online Article Text |
id | pubmed-10544445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105444452023-10-03 An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques Patro, Kiran Kumar Allam, Jaya Prakash Sanapala, Umamaheswararao Marpu, Chaitanya Kumar Samee, Nagwan Abdel Alabdulhafith, Maali Plawiak, Pawel BMC Bioinformatics Research The rising risk of diabetes, particularly in emerging countries, highlights the importance of early detection. Manual prediction can be a challenging task, leading to the need for automatic approaches. The major challenge with biomedical datasets is data scarcity. Biomedical data is often difficult to obtain in large quantities, which can limit the ability to train deep learning models effectively. Biomedical data can be noisy and inconsistent, which can make it difficult to train accurate models. To overcome the above-mentioned challenges, this work presents a new framework for data modeling that is based on correlation measures between features and can be used to process data effectively for predicting diabetes. The standard, publicly available Pima Indians Medical Diabetes (PIMA) dataset is utilized to verify the effectiveness of the proposed techniques. Experiments using the PIMA dataset showed that the proposed data modeling method improved the accuracy of machine learning models by an average of 9%, with deep convolutional neural network models achieving an accuracy of 96.13%. Overall, this study demonstrates the effectiveness of the proposed strategy in the early and reliable prediction of diabetes. BioMed Central 2023-10-02 /pmc/articles/PMC10544445/ /pubmed/37784049 http://dx.doi.org/10.1186/s12859-023-05488-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Patro, Kiran Kumar Allam, Jaya Prakash Sanapala, Umamaheswararao Marpu, Chaitanya Kumar Samee, Nagwan Abdel Alabdulhafith, Maali Plawiak, Pawel An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
title | An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
title_full | An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
title_fullStr | An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
title_full_unstemmed | An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
title_short | An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
title_sort | effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544445/ https://www.ncbi.nlm.nih.gov/pubmed/37784049 http://dx.doi.org/10.1186/s12859-023-05488-6 |
work_keys_str_mv | AT patrokirankumar aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT allamjayaprakash aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT sanapalaumamaheswararao aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT marpuchaitanyakumar aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT sameenagwanabdel aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT alabdulhafithmaali aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT plawiakpawel aneffectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT patrokirankumar effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT allamjayaprakash effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT sanapalaumamaheswararao effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT marpuchaitanyakumar effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT sameenagwanabdel effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT alabdulhafithmaali effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques AT plawiakpawel effectivecorrelationbaseddatamodelingframeworkforautomaticdiabetespredictionusingmachineanddeeplearningtechniques |