Cargando…

Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset

The early diagnosis of breast cancer is an important step in a fight against the disease. Machine learning techniques have shown promise in improving our understanding of the disease. As medical datasets consist of data points which cannot be precisely assigned to a class, fuzzy methods have been us...

Descripción completa

Detalles Bibliográficos
Autor principal: Ahmad, Amir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4783556/
https://www.ncbi.nlm.nih.gov/pubmed/27022504
http://dx.doi.org/10.1155/2016/4273813
_version_ 1782420129457373184
author Ahmad, Amir
author_facet Ahmad, Amir
author_sort Ahmad, Amir
collection PubMed
description The early diagnosis of breast cancer is an important step in a fight against the disease. Machine learning techniques have shown promise in improving our understanding of the disease. As medical datasets consist of data points which cannot be precisely assigned to a class, fuzzy methods have been useful for studying of these datasets. Sometimes breast cancer datasets are described by categorical features. Many fuzzy clustering algorithms have been developed for categorical datasets. However, in most of these methods Hamming distance is used to define the distance between the two categorical feature values. In this paper, we use a probabilistic distance measure for the distance computation among a pair of categorical feature values. Experiments demonstrate that the distance measure performs better than Hamming distance for Wisconsin breast cancer data.
format Online
Article
Text
id pubmed-4783556
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-47835562016-03-28 Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset Ahmad, Amir Scientifica (Cairo) Research Article The early diagnosis of breast cancer is an important step in a fight against the disease. Machine learning techniques have shown promise in improving our understanding of the disease. As medical datasets consist of data points which cannot be precisely assigned to a class, fuzzy methods have been useful for studying of these datasets. Sometimes breast cancer datasets are described by categorical features. Many fuzzy clustering algorithms have been developed for categorical datasets. However, in most of these methods Hamming distance is used to define the distance between the two categorical feature values. In this paper, we use a probabilistic distance measure for the distance computation among a pair of categorical feature values. Experiments demonstrate that the distance measure performs better than Hamming distance for Wisconsin breast cancer data. Hindawi Publishing Corporation 2016 2016-02-24 /pmc/articles/PMC4783556/ /pubmed/27022504 http://dx.doi.org/10.1155/2016/4273813 Text en Copyright © 2016 Amir Ahmad. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ahmad, Amir
Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset
title Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset
title_full Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset
title_fullStr Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset
title_full_unstemmed Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset
title_short Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset
title_sort evaluation of modified categorical data fuzzy clustering algorithm on the wisconsin breast cancer dataset
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4783556/
https://www.ncbi.nlm.nih.gov/pubmed/27022504
http://dx.doi.org/10.1155/2016/4273813
work_keys_str_mv AT ahmadamir evaluationofmodifiedcategoricaldatafuzzyclusteringalgorithmonthewisconsinbreastcancerdataset