Cargando…

Prediction of cancer using customised fuzzy rough machine learning approaches

This Letter proposes a customised approach for attribute selection applied to the fuzzy rough quick reduct algorithm. The unbalanced data is balanced using synthetic minority oversampling technique. The huge dimensionality of the cancer data is reduced using a correlation-based filter. The dimension...

Descripción completa

Detalles Bibliográficos
Autores principales: Arunkumar, Chinnaswamy, Ramakrishnan, Srinivasan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Institution of Engineering and Technology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6407447/
https://www.ncbi.nlm.nih.gov/pubmed/30881694
http://dx.doi.org/10.1049/htl.2018.5055
_version_ 1783401553435033600
author Arunkumar, Chinnaswamy
Ramakrishnan, Srinivasan
author_facet Arunkumar, Chinnaswamy
Ramakrishnan, Srinivasan
author_sort Arunkumar, Chinnaswamy
collection PubMed
description This Letter proposes a customised approach for attribute selection applied to the fuzzy rough quick reduct algorithm. The unbalanced data is balanced using synthetic minority oversampling technique. The huge dimensionality of the cancer data is reduced using a correlation-based filter. The dimensionality reduced balanced attribute gene subset is used to compute the final minimal reduct set using a customised fuzzy triangular norm operator on the fuzzy rough quick reduct algorithm. The customised fuzzy triangular norm operator is used with a Lukasiewicz fuzzy implicator to compute the fuzzy approximation. The customised operator selects the least number of informative feature genes from the dimensionality reduced datasets. Classification accuracy using leave-one-out cross validation of 94.85, 76.54, 98.11, and 99.13% is obtained using a customised function for Lukasiewicz triangular norm operator on leukemia, central nervous system, lung, and ovarian datasets, respectively. Performance analysis of the conventional fuzzy rough quick reduct and the proposed method are performed using parameters such as classification accuracy, precision, recall, F-measure, scatter plots, receiver operating characteristic area, McNemar test, chi-squared test, Matthew's correlation coefficient and false discovery rate that are used to prove that the proposed approach performs better than available methods in the literature.
format Online
Article
Text
id pubmed-6407447
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher The Institution of Engineering and Technology
record_format MEDLINE/PubMed
spelling pubmed-64074472019-03-16 Prediction of cancer using customised fuzzy rough machine learning approaches Arunkumar, Chinnaswamy Ramakrishnan, Srinivasan Healthc Technol Lett Article This Letter proposes a customised approach for attribute selection applied to the fuzzy rough quick reduct algorithm. The unbalanced data is balanced using synthetic minority oversampling technique. The huge dimensionality of the cancer data is reduced using a correlation-based filter. The dimensionality reduced balanced attribute gene subset is used to compute the final minimal reduct set using a customised fuzzy triangular norm operator on the fuzzy rough quick reduct algorithm. The customised fuzzy triangular norm operator is used with a Lukasiewicz fuzzy implicator to compute the fuzzy approximation. The customised operator selects the least number of informative feature genes from the dimensionality reduced datasets. Classification accuracy using leave-one-out cross validation of 94.85, 76.54, 98.11, and 99.13% is obtained using a customised function for Lukasiewicz triangular norm operator on leukemia, central nervous system, lung, and ovarian datasets, respectively. Performance analysis of the conventional fuzzy rough quick reduct and the proposed method are performed using parameters such as classification accuracy, precision, recall, F-measure, scatter plots, receiver operating characteristic area, McNemar test, chi-squared test, Matthew's correlation coefficient and false discovery rate that are used to prove that the proposed approach performs better than available methods in the literature. The Institution of Engineering and Technology 2018-12-24 /pmc/articles/PMC6407447/ /pubmed/30881694 http://dx.doi.org/10.1049/htl.2018.5055 Text en http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open access article published by the IET under the Creative Commons Attribution-NonCommercial-NoDerivs License (http://creativecommons.org/licenses/by-nc-nd/3.0/)
spellingShingle Article
Arunkumar, Chinnaswamy
Ramakrishnan, Srinivasan
Prediction of cancer using customised fuzzy rough machine learning approaches
title Prediction of cancer using customised fuzzy rough machine learning approaches
title_full Prediction of cancer using customised fuzzy rough machine learning approaches
title_fullStr Prediction of cancer using customised fuzzy rough machine learning approaches
title_full_unstemmed Prediction of cancer using customised fuzzy rough machine learning approaches
title_short Prediction of cancer using customised fuzzy rough machine learning approaches
title_sort prediction of cancer using customised fuzzy rough machine learning approaches
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6407447/
https://www.ncbi.nlm.nih.gov/pubmed/30881694
http://dx.doi.org/10.1049/htl.2018.5055
work_keys_str_mv AT arunkumarchinnaswamy predictionofcancerusingcustomisedfuzzyroughmachinelearningapproaches
AT ramakrishnansrinivasan predictionofcancerusingcustomisedfuzzyroughmachinelearningapproaches