Cargando…

Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data

Introduction: Association rule mining (ARM) is a powerful tool for exploring the informative relationships among multiple items (genes) in any dataset. The main problem of ARM is that it generates many rules containing different rule-informative values, which becomes a challenge for the user to choo...

Descripción completa

Detalles Bibliográficos
Autores principales: Mallik, Saurav, Seth, Soumita, Si, Amalendu, Bhadra , Tapas, Zhao, Zhongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10415913/
https://www.ncbi.nlm.nih.gov/pubmed/37576714
http://dx.doi.org/10.3389/fbinf.2023.1182176
_version_ 1785087653082824704
author Mallik, Saurav
Seth, Soumita
Si, Amalendu
Bhadra , Tapas
Zhao, Zhongming
author_facet Mallik, Saurav
Seth, Soumita
Si, Amalendu
Bhadra , Tapas
Zhao, Zhongming
author_sort Mallik, Saurav
collection PubMed
description Introduction: Association rule mining (ARM) is a powerful tool for exploring the informative relationships among multiple items (genes) in any dataset. The main problem of ARM is that it generates many rules containing different rule-informative values, which becomes a challenge for the user to choose the effective rules. In addition, few works have been performed on the integration of multiple biological datasets and variable cutoff values in ARM. Methods: To solve all these problems, in this article, we developed a novel framework MOOVARM (multi-objective optimized variable cutoff-based association rule mining) for multi-omics profiles. Results: In this regard, we identified the positive ideal solution (PIS), which maximized the profit and minimized the loss, and negative ideal solution (NIS), which minimized the profit and maximized the loss for all gene sets (item sets), belonging to each extracted rule. Thereafter, we computed the distance (d +) from PIS and distance (d −) from NIS for each gene set or product. These two distances played an important role in determining the optimized associations among various pairs of genes in the multi-omics dataset. We then globally estimated the relative closeness to PIS for ranking the gene sets. When the relative closeness score of the rule is greater than or equal to the pre-defined threshold value, the rule can be considered a final resultant rule. Moreover, MOOVARM evaluated the relative score of the rule based on the status of all genes instead of individual genes. Conclusions: MOOVARM produced the final rank of the extracted (multi-objective optimized) rules of correlated genes which had better disease classification than the state-of-the-art algorithms on gene signature identification.
format Online
Article
Text
id pubmed-10415913
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104159132023-08-12 Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data Mallik, Saurav Seth, Soumita Si, Amalendu Bhadra , Tapas Zhao, Zhongming Front Bioinform Bioinformatics Introduction: Association rule mining (ARM) is a powerful tool for exploring the informative relationships among multiple items (genes) in any dataset. The main problem of ARM is that it generates many rules containing different rule-informative values, which becomes a challenge for the user to choose the effective rules. In addition, few works have been performed on the integration of multiple biological datasets and variable cutoff values in ARM. Methods: To solve all these problems, in this article, we developed a novel framework MOOVARM (multi-objective optimized variable cutoff-based association rule mining) for multi-omics profiles. Results: In this regard, we identified the positive ideal solution (PIS), which maximized the profit and minimized the loss, and negative ideal solution (NIS), which minimized the profit and maximized the loss for all gene sets (item sets), belonging to each extracted rule. Thereafter, we computed the distance (d +) from PIS and distance (d −) from NIS for each gene set or product. These two distances played an important role in determining the optimized associations among various pairs of genes in the multi-omics dataset. We then globally estimated the relative closeness to PIS for ranking the gene sets. When the relative closeness score of the rule is greater than or equal to the pre-defined threshold value, the rule can be considered a final resultant rule. Moreover, MOOVARM evaluated the relative score of the rule based on the status of all genes instead of individual genes. Conclusions: MOOVARM produced the final rank of the extracted (multi-objective optimized) rules of correlated genes which had better disease classification than the state-of-the-art algorithms on gene signature identification. Frontiers Media S.A. 2023-07-27 /pmc/articles/PMC10415913/ /pubmed/37576714 http://dx.doi.org/10.3389/fbinf.2023.1182176 Text en Copyright © 2023 Mallik, Seth, Si, Bhadra  and Zhao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Mallik, Saurav
Seth, Soumita
Si, Amalendu
Bhadra , Tapas
Zhao, Zhongming
Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
title Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
title_full Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
title_fullStr Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
title_full_unstemmed Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
title_short Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
title_sort optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10415913/
https://www.ncbi.nlm.nih.gov/pubmed/37576714
http://dx.doi.org/10.3389/fbinf.2023.1182176
work_keys_str_mv AT malliksaurav optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata
AT sethsoumita optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata
AT siamalendu optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata
AT bhadratapas optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata
AT zhaozhongming optimalrankinganddirectionalsignatureclassificationusingtheintegralstrategyofmultiobjectiveoptimizationbasedassociationruleminingofmultiomicsdata