Cargando…

A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study

Anthropometry is a Greek word that consists of the two words “Anthropo” meaning human species and “metery” meaning measurement. It is a science that deals with the size of the body including the dimensions of different parts, the field of motion and the strength of the muscles of the body. Specific...

Descripción completa

Detalles Bibliográficos
Autores principales: Jafari, Habib, Shohaimi, Shamarina, Salari, Nader, Kiaei, Ali Akbar, Najafi, Farid, Khazaei, Soleiman, Niaparast, Mehrdad, Abdollahi, Anita, Mohammadi, Masoud
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8775210/
https://www.ncbi.nlm.nih.gov/pubmed/35051240
http://dx.doi.org/10.1371/journal.pone.0262701
_version_ 1784636529569693696
author Jafari, Habib
Shohaimi, Shamarina
Salari, Nader
Kiaei, Ali Akbar
Najafi, Farid
Khazaei, Soleiman
Niaparast, Mehrdad
Abdollahi, Anita
Mohammadi, Masoud
author_facet Jafari, Habib
Shohaimi, Shamarina
Salari, Nader
Kiaei, Ali Akbar
Najafi, Farid
Khazaei, Soleiman
Niaparast, Mehrdad
Abdollahi, Anita
Mohammadi, Masoud
author_sort Jafari, Habib
collection PubMed
description Anthropometry is a Greek word that consists of the two words “Anthropo” meaning human species and “metery” meaning measurement. It is a science that deals with the size of the body including the dimensions of different parts, the field of motion and the strength of the muscles of the body. Specific individual dimensions such as heights, widths, depths, distances, environments and curvatures are usually measured. In this article, we investigate the anthropometric characteristics of patients with chronic diseases (diabetes, hypertension, cardiovascular disease, heart attacks and strokes) and find the factors affecting these diseases and the extent of the impact of each to make the necessary planning. We have focused on cohort studies for 10047 qualified participants from Ravansar County. Machine learning provides opportunities to improve discrimination through the analysis of complex interactions between broad variables. Among the chronic diseases in this cohort study, we have used three deep neural network models for diagnosis and prognosis of the risk of type 2 diabetes mellitus (T2DM) as a case study. Usually in Artificial Intelligence for medicine tasks, Imbalanced data is an important issue in learning and ignoring that leads to false evaluation results. Also, the accuracy evaluation criterion was not appropriate for this task, because a simple model that is labeling all samples negatively has high accuracy. So, the evaluation criteria of precession, recall, AUC, and AUPRC were considered. Then, the importance of variables in general was examined to determine which features are more important in the risk of T2DM. Finally, personality feature was added, in which individual feature importance was examined. Performing by Shapley Values, the model is tuned for each patient so that it can be used for prognosis of T2DM risk for that patient. In this paper, we have focused and implemented a full pipeline of Data Creation, Data Preprocessing, Handling Imbalanced Data, Deep Learning model, true Evaluation method, Feature Importance and Individual Feature Importance. Through the results, the pipeline demonstrated competence in improving the Diagnosis and Prognosis the risk of T2DM with personalization capability.
format Online
Article
Text
id pubmed-8775210
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87752102022-01-21 A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study Jafari, Habib Shohaimi, Shamarina Salari, Nader Kiaei, Ali Akbar Najafi, Farid Khazaei, Soleiman Niaparast, Mehrdad Abdollahi, Anita Mohammadi, Masoud PLoS One Research Article Anthropometry is a Greek word that consists of the two words “Anthropo” meaning human species and “metery” meaning measurement. It is a science that deals with the size of the body including the dimensions of different parts, the field of motion and the strength of the muscles of the body. Specific individual dimensions such as heights, widths, depths, distances, environments and curvatures are usually measured. In this article, we investigate the anthropometric characteristics of patients with chronic diseases (diabetes, hypertension, cardiovascular disease, heart attacks and strokes) and find the factors affecting these diseases and the extent of the impact of each to make the necessary planning. We have focused on cohort studies for 10047 qualified participants from Ravansar County. Machine learning provides opportunities to improve discrimination through the analysis of complex interactions between broad variables. Among the chronic diseases in this cohort study, we have used three deep neural network models for diagnosis and prognosis of the risk of type 2 diabetes mellitus (T2DM) as a case study. Usually in Artificial Intelligence for medicine tasks, Imbalanced data is an important issue in learning and ignoring that leads to false evaluation results. Also, the accuracy evaluation criterion was not appropriate for this task, because a simple model that is labeling all samples negatively has high accuracy. So, the evaluation criteria of precession, recall, AUC, and AUPRC were considered. Then, the importance of variables in general was examined to determine which features are more important in the risk of T2DM. Finally, personality feature was added, in which individual feature importance was examined. Performing by Shapley Values, the model is tuned for each patient so that it can be used for prognosis of T2DM risk for that patient. In this paper, we have focused and implemented a full pipeline of Data Creation, Data Preprocessing, Handling Imbalanced Data, Deep Learning model, true Evaluation method, Feature Importance and Individual Feature Importance. Through the results, the pipeline demonstrated competence in improving the Diagnosis and Prognosis the risk of T2DM with personalization capability. Public Library of Science 2022-01-20 /pmc/articles/PMC8775210/ /pubmed/35051240 http://dx.doi.org/10.1371/journal.pone.0262701 Text en © 2022 Jafari et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jafari, Habib
Shohaimi, Shamarina
Salari, Nader
Kiaei, Ali Akbar
Najafi, Farid
Khazaei, Soleiman
Niaparast, Mehrdad
Abdollahi, Anita
Mohammadi, Masoud
A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study
title A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study
title_full A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study
title_fullStr A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study
title_full_unstemmed A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study
title_short A full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and Shapley values: The Ravansar county anthropometric cohort study
title_sort full pipeline of diagnosis and prognosis the risk of chronic diseases using deep learning and shapley values: the ravansar county anthropometric cohort study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8775210/
https://www.ncbi.nlm.nih.gov/pubmed/35051240
http://dx.doi.org/10.1371/journal.pone.0262701
work_keys_str_mv AT jafarihabib afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT shohaimishamarina afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT salarinader afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT kiaeialiakbar afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT najafifarid afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT khazaeisoleiman afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT niaparastmehrdad afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT abdollahianita afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT mohammadimasoud afullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT jafarihabib fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT shohaimishamarina fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT salarinader fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT kiaeialiakbar fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT najafifarid fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT khazaeisoleiman fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT niaparastmehrdad fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT abdollahianita fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy
AT mohammadimasoud fullpipelineofdiagnosisandprognosistheriskofchronicdiseasesusingdeeplearningandshapleyvaluestheravansarcountyanthropometriccohortstudy