Cargando…

Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis

Numerous data mining models have been proposed to construct computer-aided medical expert systems. Bayesian network classifiers (BNCs) are more distinct and understandable than other models. To graphically describe the dependency relationships among clinical variables for thyroid disease diagnosis a...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, LiMin, Cao, FangYuan, Wang, ShuangCheng, Sun, MingHui, Dong, LiYan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5560694/
https://www.ncbi.nlm.nih.gov/pubmed/28817592
http://dx.doi.org/10.1371/journal.pone.0182070
_version_ 1783257705903816704
author Wang, LiMin
Cao, FangYuan
Wang, ShuangCheng
Sun, MingHui
Dong, LiYan
author_facet Wang, LiMin
Cao, FangYuan
Wang, ShuangCheng
Sun, MingHui
Dong, LiYan
author_sort Wang, LiMin
collection PubMed
description Numerous data mining models have been proposed to construct computer-aided medical expert systems. Bayesian network classifiers (BNCs) are more distinct and understandable than other models. To graphically describe the dependency relationships among clinical variables for thyroid disease diagnosis and ensure the rationality of the diagnosis results, the proposed k-dependence causal forest (KCF) model generates a series of submodels in the framework of maximum spanning tree (MST) and demonstrates stronger dependence representation. Friedman test on 12 UCI datasets shows that KCF has classification accuracy advantage over the other state-of-the-art BNCs, such as Naive Bayes, tree augmented Naive Bayes, and k-dependence Bayesian classifier. Our extensive experimental comparison on 4 medical datasets also proves the feasibility and effectiveness of KCF in terms of sensitivity and specificity.
format Online
Article
Text
id pubmed-5560694
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55606942017-08-25 Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis Wang, LiMin Cao, FangYuan Wang, ShuangCheng Sun, MingHui Dong, LiYan PLoS One Research Article Numerous data mining models have been proposed to construct computer-aided medical expert systems. Bayesian network classifiers (BNCs) are more distinct and understandable than other models. To graphically describe the dependency relationships among clinical variables for thyroid disease diagnosis and ensure the rationality of the diagnosis results, the proposed k-dependence causal forest (KCF) model generates a series of submodels in the framework of maximum spanning tree (MST) and demonstrates stronger dependence representation. Friedman test on 12 UCI datasets shows that KCF has classification accuracy advantage over the other state-of-the-art BNCs, such as Naive Bayes, tree augmented Naive Bayes, and k-dependence Bayesian classifier. Our extensive experimental comparison on 4 medical datasets also proves the feasibility and effectiveness of KCF in terms of sensitivity and specificity. Public Library of Science 2017-08-17 /pmc/articles/PMC5560694/ /pubmed/28817592 http://dx.doi.org/10.1371/journal.pone.0182070 Text en © 2017 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, LiMin
Cao, FangYuan
Wang, ShuangCheng
Sun, MingHui
Dong, LiYan
Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
title Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
title_full Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
title_fullStr Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
title_full_unstemmed Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
title_short Using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
title_sort using k-dependence causal forest to mine the most significant dependency relationships among clinical variables for thyroid disease diagnosis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5560694/
https://www.ncbi.nlm.nih.gov/pubmed/28817592
http://dx.doi.org/10.1371/journal.pone.0182070
work_keys_str_mv AT wanglimin usingkdependencecausalforesttominethemostsignificantdependencyrelationshipsamongclinicalvariablesforthyroiddiseasediagnosis
AT caofangyuan usingkdependencecausalforesttominethemostsignificantdependencyrelationshipsamongclinicalvariablesforthyroiddiseasediagnosis
AT wangshuangcheng usingkdependencecausalforesttominethemostsignificantdependencyrelationshipsamongclinicalvariablesforthyroiddiseasediagnosis
AT sunminghui usingkdependencecausalforesttominethemostsignificantdependencyrelationshipsamongclinicalvariablesforthyroiddiseasediagnosis
AT dongliyan usingkdependencecausalforesttominethemostsignificantdependencyrelationshipsamongclinicalvariablesforthyroiddiseasediagnosis