Cargando…
Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC)
Associative classification mining (ACM) can be used to provide predictive models with high accuracy as well as interpretability. However, traditional ACM ignores the difference of significances among the features used for mining. Although weighted associative classification mining (WACM) addresses t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3515483/ https://www.ncbi.nlm.nih.gov/pubmed/23227228 http://dx.doi.org/10.1371/journal.pone.0051018 |
_version_ | 1782252190573789184 |
---|---|
author | Yu, Pulan Wild, David J. |
author_facet | Yu, Pulan Wild, David J. |
author_sort | Yu, Pulan |
collection | PubMed |
description | Associative classification mining (ACM) can be used to provide predictive models with high accuracy as well as interpretability. However, traditional ACM ignores the difference of significances among the features used for mining. Although weighted associative classification mining (WACM) addresses this issue by assigning different weights to features, most implementations can only be utilized when pre-assigned weights are available. In this paper, we propose a link-based approach to automatically derive weight information from a dataset using link-based models which treat the dataset as a bipartite model. By combining this link-based feature weighting method with a traditional ACM method–classification based on associations (CBA), a Link-based Associative Classifier (LAC) is developed. We then demonstrate the application of LAC to biomedical datasets for association discovery between chemical compounds and bioactivities or diseases. The results indicate that the novel link-based weighting method is comparable to support vector machine (SVM) and RELIEF method, and is capable of capturing significant features. Additionally, LAC is shown to produce models with high accuracies and discover interesting associations which may otherwise remain unrevealed by traditional ACM. |
format | Online Article Text |
id | pubmed-3515483 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-35154832012-12-07 Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) Yu, Pulan Wild, David J. PLoS One Research Article Associative classification mining (ACM) can be used to provide predictive models with high accuracy as well as interpretability. However, traditional ACM ignores the difference of significances among the features used for mining. Although weighted associative classification mining (WACM) addresses this issue by assigning different weights to features, most implementations can only be utilized when pre-assigned weights are available. In this paper, we propose a link-based approach to automatically derive weight information from a dataset using link-based models which treat the dataset as a bipartite model. By combining this link-based feature weighting method with a traditional ACM method–classification based on associations (CBA), a Link-based Associative Classifier (LAC) is developed. We then demonstrate the application of LAC to biomedical datasets for association discovery between chemical compounds and bioactivities or diseases. The results indicate that the novel link-based weighting method is comparable to support vector machine (SVM) and RELIEF method, and is capable of capturing significant features. Additionally, LAC is shown to produce models with high accuracies and discover interesting associations which may otherwise remain unrevealed by traditional ACM. Public Library of Science 2012-12-05 /pmc/articles/PMC3515483/ /pubmed/23227228 http://dx.doi.org/10.1371/journal.pone.0051018 Text en © 2012 Yu, Wild http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Yu, Pulan Wild, David J. Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) |
title | Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) |
title_full | Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) |
title_fullStr | Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) |
title_full_unstemmed | Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) |
title_short | Discovering Associations in Biomedical Datasets by Link-based Associative Classifier (LAC) |
title_sort | discovering associations in biomedical datasets by link-based associative classifier (lac) |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3515483/ https://www.ncbi.nlm.nih.gov/pubmed/23227228 http://dx.doi.org/10.1371/journal.pone.0051018 |
work_keys_str_mv | AT yupulan discoveringassociationsinbiomedicaldatasetsbylinkbasedassociativeclassifierlac AT wilddavidj discoveringassociationsinbiomedicaldatasetsbylinkbasedassociativeclassifierlac |