Cargando…
Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
SIMPLE SUMMARY: The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. ABSTRACT: Thyroid disease prediction has emerged as an important task recently. Despi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405591/ https://www.ncbi.nlm.nih.gov/pubmed/36010907 http://dx.doi.org/10.3390/cancers14163914 |
_version_ | 1784773915294302208 |
---|---|
author | Chaganti, Rajasekhar Rustam, Furqan De La Torre Díez, Isabel Mazón, Juan Luis Vidal Rodríguez, Carmen Lili Ashraf, Imran |
author_facet | Chaganti, Rajasekhar Rustam, Furqan De La Torre Díez, Isabel Mazón, Juan Luis Vidal Rodríguez, Carmen Lili Ashraf, Imran |
author_sort | Chaganti, Rajasekhar |
collection | PubMed |
description | SIMPLE SUMMARY: The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. ABSTRACT: Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach. |
format | Online Article Text |
id | pubmed-9405591 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94055912022-08-26 Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques Chaganti, Rajasekhar Rustam, Furqan De La Torre Díez, Isabel Mazón, Juan Luis Vidal Rodríguez, Carmen Lili Ashraf, Imran Cancers (Basel) Article SIMPLE SUMMARY: The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. ABSTRACT: Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach. MDPI 2022-08-13 /pmc/articles/PMC9405591/ /pubmed/36010907 http://dx.doi.org/10.3390/cancers14163914 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chaganti, Rajasekhar Rustam, Furqan De La Torre Díez, Isabel Mazón, Juan Luis Vidal Rodríguez, Carmen Lili Ashraf, Imran Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques |
title | Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques |
title_full | Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques |
title_fullStr | Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques |
title_full_unstemmed | Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques |
title_short | Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques |
title_sort | thyroid disease prediction using selective features and machine learning techniques |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405591/ https://www.ncbi.nlm.nih.gov/pubmed/36010907 http://dx.doi.org/10.3390/cancers14163914 |
work_keys_str_mv | AT chagantirajasekhar thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques AT rustamfurqan thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques AT delatorrediezisabel thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques AT mazonjuanluisvidal thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques AT rodriguezcarmenlili thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques AT ashrafimran thyroiddiseasepredictionusingselectivefeaturesandmachinelearningtechniques |