Cargando…
Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
Introduction: Association rule mining (ARM) is a powerful tool for exploring the informative relationships among multiple items (genes) in any dataset. The main problem of ARM is that it generates many rules containing different rule-informative values, which becomes a challenge for the user to choo...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10415913/ https://www.ncbi.nlm.nih.gov/pubmed/37576714 http://dx.doi.org/10.3389/fbinf.2023.1182176 |
_version_ | 1785087653082824704 |
---|---|
author | Mallik, Saurav Seth, Soumita Si, Amalendu Bhadra , Tapas Zhao, Zhongming |
author_facet | Mallik, Saurav Seth, Soumita Si, Amalendu Bhadra , Tapas Zhao, Zhongming |
author_sort | Mallik, Saurav |
collection | PubMed |
description | Introduction: Association rule mining (ARM) is a powerful tool for exploring the informative relationships among multiple items (genes) in any dataset. The main problem of ARM is that it generates many rules containing different rule-informative values, which becomes a challenge for the user to choose the effective rules. In addition, few works have been performed on the integration of multiple biological datasets and variable cutoff values in ARM. Methods: To solve all these problems, in this article, we developed a novel framework MOOVARM (multi-objective optimized variable cutoff-based association rule mining) for multi-omics profiles. Results: In this regard, we identified the positive ideal solution (PIS), which maximized the profit and minimized the loss, and negative ideal solution (NIS), which minimized the profit and maximized the loss for all gene sets (item sets), belonging to each extracted rule. Thereafter, we computed the distance (d +) from PIS and distance (d −) from NIS for each gene set or product. These two distances played an important role in determining the optimized associations among various pairs of genes in the multi-omics dataset. We then globally estimated the relative closeness to PIS for ranking the gene sets. When the relative closeness score of the rule is greater than or equal to the pre-defined threshold value, the rule can be considered a final resultant rule. Moreover, MOOVARM evaluated the relative score of the rule based on the status of all genes instead of individual genes. Conclusions: MOOVARM produced the final rank of the extracted (multi-objective optimized) rules of correlated genes which had better disease classification than the state-of-the-art algorithms on gene signature identification. |
format | Online Article Text |
id | pubmed-10415913 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-104159132023-08-12 Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data Mallik, Saurav Seth, Soumita Si, Amalendu Bhadra , Tapas Zhao, Zhongming Front Bioinform Bioinformatics Introduction: Association rule mining (ARM) is a powerful tool for exploring the informative relationships among multiple items (genes) in any dataset. The main problem of ARM is that it generates many rules containing different rule-informative values, which becomes a challenge for the user to choose the effective rules. In addition, few works have been performed on the integration of multiple biological datasets and variable cutoff values in ARM. Methods: To solve all these problems, in this article, we developed a novel framework MOOVARM (multi-objective optimized variable cutoff-based association rule mining) for multi-omics profiles. Results: In this regard, we identified the positive ideal solution (PIS), which maximized the profit and minimized the loss, and negative ideal solution (NIS), which minimized the profit and maximized the loss for all gene sets (item sets), belonging to each extracted rule. Thereafter, we computed the distance (d +) from PIS and distance (d −) from NIS for each gene set or product. These two distances played an important role in determining the optimized associations among various pairs of genes in the multi-omics dataset. We then globally estimated the relative closeness to PIS for ranking the gene sets. When the relative closeness score of the rule is greater than or equal to the pre-defined threshold value, the rule can be considered a final resultant rule. Moreover, MOOVARM evaluated the relative score of the rule based on the status of all genes instead of individual genes. Conclusions: MOOVARM produced the final rank of the extracted (multi-objective optimized) rules of correlated genes which had better disease classification than the state-of-the-art algorithms on gene signature identification. Frontiers Media S.A. 2023-07-27 /pmc/articles/PMC10415913/ /pubmed/37576714 http://dx.doi.org/10.3389/fbinf.2023.1182176 Text en Copyright © 2023 Mallik, Seth, Si, Bhadra and Zhao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Mallik, Saurav Seth, Soumita Si, Amalendu Bhadra , Tapas Zhao, Zhongming Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
title | Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
title_full | Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
title_fullStr | Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
title_full_unstemmed | Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
title_short | Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
title_sort | optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10415913/ https://www.ncbi.nlm.nih.gov/pubmed/37576714 http://dx.doi.org/10.3389/fbinf.2023.1182176 |
work_keys_str_mv | AT malliksaurav optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata AT sethsoumita optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata AT siamalendu optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata AT bhadratapas optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata AT zhaozhongming optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata |