Cargando…

Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques

SIMPLE SUMMARY: The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. ABSTRACT: Thyroid disease prediction has emerged as an important task recently. Despi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaganti, Rajasekhar, Rustam, Furqan, De La Torre Díez, Isabel, Mazón, Juan Luis Vidal, Rodríguez, Carmen Lili, Ashraf, Imran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405591/
https://www.ncbi.nlm.nih.gov/pubmed/36010907
http://dx.doi.org/10.3390/cancers14163914
_version_ 1784773915294302208
author Chaganti, Rajasekhar
Rustam, Furqan
De La Torre Díez, Isabel
Mazón, Juan Luis Vidal
Rodríguez, Carmen Lili
Ashraf, Imran
author_facet Chaganti, Rajasekhar
Rustam, Furqan
De La Torre Díez, Isabel
Mazón, Juan Luis Vidal
Rodríguez, Carmen Lili
Ashraf, Imran
author_sort Chaganti, Rajasekhar
collection PubMed
description SIMPLE SUMMARY: The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. ABSTRACT: Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach.
format Online
Article
Text
id pubmed-9405591
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94055912022-08-26 Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques Chaganti, Rajasekhar Rustam, Furqan De La Torre Díez, Isabel Mazón, Juan Luis Vidal Rodríguez, Carmen Lili Ashraf, Imran Cancers (Basel) Article SIMPLE SUMMARY: The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. ABSTRACT: Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach. MDPI 2022-08-13 /pmc/articles/PMC9405591/ /pubmed/36010907 http://dx.doi.org/10.3390/cancers14163914 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chaganti, Rajasekhar
Rustam, Furqan
De La Torre Díez, Isabel
Mazón, Juan Luis Vidal
Rodríguez, Carmen Lili
Ashraf, Imran
Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
title Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
title_full Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
title_fullStr Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
title_full_unstemmed Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
title_short Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
title_sort thyroid disease prediction using selective features and machine learning techniques
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405591/
https://www.ncbi.nlm.nih.gov/pubmed/36010907
http://dx.doi.org/10.3390/cancers14163914
work_keys_str_mv AT chagantirajasekhar thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques
AT rustamfurqan thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques
AT delatorrediezisabel thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques
AT mazonjuanluisvidal thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques
AT rodriguezcarmenlili thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques
AT ashrafimran thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques