Cargando…

Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)

Associative classification mining (ACM) can be used to provide predictive models with high accuracy as well as interpretability. However, traditional ACM ignores the difference of significances among the features used for mining. Although weighted associative classification mining (WACM) addresses t...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Pulan, Wild, David J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3515483/
https://www.ncbi.nlm.nih.gov/pubmed/23227228
http://dx.doi.org/10.1371/journal.pone.0051018
_version_ 1782252190573789184
author Yu, Pulan
Wild, David J.
author_facet Yu, Pulan
Wild, David J.
author_sort Yu, Pulan
collection PubMed
description Associative classification mining (ACM) can be used to provide predictive models with high accuracy as well as interpretability. However, traditional ACM ignores the difference of significances among the features used for mining. Although weighted associative classification mining (WACM) addresses this issue by assigning different weights to features, most implementations can only be utilized when pre-assigned weights are available. In this paper, we propose a link-based approach to automatically derive weight information from a dataset using link-based models which treat the dataset as a bipartite model. By combining this link-based feature weighting method with a traditional ACM method–classification based on associations (CBA), a Link-based Associative Classifier (LAC) is developed. We then demonstrate the application of LAC to biomedical datasets for association discovery between chemical compounds and bioactivities or diseases. The results indicate that the novel link-based weighting method is comparable to support vector machine (SVM) and RELIEF method, and is capable of capturing significant features. Additionally, LAC is shown to produce models with high accuracies and discover interesting associations which may otherwise remain unrevealed by traditional ACM.
format Online
Article
Text
id pubmed-3515483
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35154832012-12-07 Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) Yu, Pulan Wild, David J. PLoS One Research Article Associative classification mining (ACM) can be used to provide predictive models with high accuracy as well as interpretability. However, traditional ACM ignores the difference of significances among the features used for mining. Although weighted associative classification mining (WACM) addresses this issue by assigning different weights to features, most implementations can only be utilized when pre-assigned weights are available. In this paper, we propose a link-based approach to automatically derive weight information from a dataset using link-based models which treat the dataset as a bipartite model. By combining this link-based feature weighting method with a traditional ACM method–classification based on associations (CBA), a Link-based Associative Classifier (LAC) is developed. We then demonstrate the application of LAC to biomedical datasets for association discovery between chemical compounds and bioactivities or diseases. The results indicate that the novel link-based weighting method is comparable to support vector machine (SVM) and RELIEF method, and is capable of capturing significant features. Additionally, LAC is shown to produce models with high accuracies and discover interesting associations which may otherwise remain unrevealed by traditional ACM. Public Library of Science 2012-12-05 /pmc/articles/PMC3515483/ /pubmed/23227228 http://dx.doi.org/10.1371/journal.pone.0051018 Text en © 2012 Yu, Wild http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yu, Pulan
Wild, David J.
Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
title Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
title_full Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
title_fullStr Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
title_full_unstemmed Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
title_short Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
title_sort discovering associations in biomedical datasets by link-based associative classifier (lac)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3515483/
https://www.ncbi.nlm.nih.gov/pubmed/23227228
http://dx.doi.org/10.1371/journal.pone.0051018
work_keys_str_mv AT yupulan discoveringassociationsinbiomedicaldatasetsbylinkbasedassociativeclassifierlac
AT wilddavidj discoveringassociationsinbiomedicaldatasetsbylinkbasedassociativeclassifierlac