Cargando…

A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions

The reuse of business processes (BPs) requires similarities between them to be suitably identified. Various approaches have been introduced to address this problem, but many of them feature a high computational cost and a low level of automation. This paper presents a clustering algorithm that group...

Descripción completa

Detalles Bibliográficos
Autores principales: Ordoñez, Hugo, Torres-Jimenez, Jose, Cobos, Carlos, Ordoñez, Armando, Herrera-Viedma, Enrique, Maldonado-Martinez, Gildardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6564420/
https://www.ncbi.nlm.nih.gov/pubmed/31194758
http://dx.doi.org/10.1371/journal.pone.0217686
_version_ 1783426649504612352
author Ordoñez, Hugo
Torres-Jimenez, Jose
Cobos, Carlos
Ordoñez, Armando
Herrera-Viedma, Enrique
Maldonado-Martinez, Gildardo
author_facet Ordoñez, Hugo
Torres-Jimenez, Jose
Cobos, Carlos
Ordoñez, Armando
Herrera-Viedma, Enrique
Maldonado-Martinez, Gildardo
author_sort Ordoñez, Hugo
collection PubMed
description The reuse of business processes (BPs) requires similarities between them to be suitably identified. Various approaches have been introduced to address this problem, but many of them feature a high computational cost and a low level of automation. This paper presents a clustering algorithm that groups business processes retrieved from a multimodal search system (based on textual and structural information). The algorithm is based on Incremental Covering Arrays (ICAs) with different alphabets to determine the possible number of groups to be created for each row of the ICA. The proposed algorithm also incorporates Balanced Bayesian Information Criterion to determine the optimal number of groups and the best solution for each query. Experimental evaluation shows that the use of ICAs with strength four (4) and different alphabets reduces the number of solutions needed to be evaluated and optimizes the number of clusters. The proposed algorithm outperforms other algorithms in various measures (precision, recall, and F-measure) by between 12% and 88%. Friedman and Wilcoxon non-parametric tests gave a 90–95% significance level to the obtained results. Better options of repository search for BPs help companies to reuse them. By thus reusing BPs, managers and analysts can more easily get to know the evolution and trajectory of the company processes, a situation that could be expected to lead to improved managerial and commercial decision making.
format Online
Article
Text
id pubmed-6564420
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65644202019-06-20 A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions Ordoñez, Hugo Torres-Jimenez, Jose Cobos, Carlos Ordoñez, Armando Herrera-Viedma, Enrique Maldonado-Martinez, Gildardo PLoS One Research Article The reuse of business processes (BPs) requires similarities between them to be suitably identified. Various approaches have been introduced to address this problem, but many of them feature a high computational cost and a low level of automation. This paper presents a clustering algorithm that groups business processes retrieved from a multimodal search system (based on textual and structural information). The algorithm is based on Incremental Covering Arrays (ICAs) with different alphabets to determine the possible number of groups to be created for each row of the ICA. The proposed algorithm also incorporates Balanced Bayesian Information Criterion to determine the optimal number of groups and the best solution for each query. Experimental evaluation shows that the use of ICAs with strength four (4) and different alphabets reduces the number of solutions needed to be evaluated and optimizes the number of clusters. The proposed algorithm outperforms other algorithms in various measures (precision, recall, and F-measure) by between 12% and 88%. Friedman and Wilcoxon non-parametric tests gave a 90–95% significance level to the obtained results. Better options of repository search for BPs help companies to reuse them. By thus reusing BPs, managers and analysts can more easily get to know the evolution and trajectory of the company processes, a situation that could be expected to lead to improved managerial and commercial decision making. Public Library of Science 2019-06-13 /pmc/articles/PMC6564420/ /pubmed/31194758 http://dx.doi.org/10.1371/journal.pone.0217686 Text en © 2019 Ordoñez et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ordoñez, Hugo
Torres-Jimenez, Jose
Cobos, Carlos
Ordoñez, Armando
Herrera-Viedma, Enrique
Maldonado-Martinez, Gildardo
A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions
title A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions
title_full A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions
title_fullStr A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions
title_full_unstemmed A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions
title_short A business process clustering algorithm using incremental covering arrays to explore search space and balanced Bayesian information criterion to evaluate quality of solutions
title_sort business process clustering algorithm using incremental covering arrays to explore search space and balanced bayesian information criterion to evaluate quality of solutions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6564420/
https://www.ncbi.nlm.nih.gov/pubmed/31194758
http://dx.doi.org/10.1371/journal.pone.0217686
work_keys_str_mv AT ordonezhugo abusinessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT torresjimenezjose abusinessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT coboscarlos abusinessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT ordonezarmando abusinessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT herreraviedmaenrique abusinessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT maldonadomartinezgildardo abusinessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT ordonezhugo businessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT torresjimenezjose businessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT coboscarlos businessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT ordonezarmando businessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT herreraviedmaenrique businessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions
AT maldonadomartinezgildardo businessprocessclusteringalgorithmusingincrementalcoveringarraystoexploresearchspaceandbalancedbayesianinformationcriteriontoevaluatequalityofsolutions