Cargando…

Selected Data Mining Tools for Data Analysis in Distributed Environment

In this paper, we deal with distributed data represented either as a finite set [Formula: see text] of decision tables with equal sets of attributes or a finite set [Formula: see text] of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Moshkov, Mikhail, Zielosko, Beata, Tetteh, Evans Teiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602006/
https://www.ncbi.nlm.nih.gov/pubmed/37420421
http://dx.doi.org/10.3390/e24101401
_version_ 1784817205979906048
author Moshkov, Mikhail
Zielosko, Beata
Tetteh, Evans Teiko
author_facet Moshkov, Mikhail
Zielosko, Beata
Tetteh, Evans Teiko
author_sort Moshkov, Mikhail
collection PubMed
description In this paper, we deal with distributed data represented either as a finite set [Formula: see text] of decision tables with equal sets of attributes or a finite set [Formula: see text] of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision trees common to all tables from the set [Formula: see text]: building a decision table in which the set of decision trees coincides with the set of decision trees common to all tables from [Formula: see text]. We show when we can build such a decision table and how to build it in a polynomial time. If we have such a table, we can apply various decision tree learning algorithms to it. We extend the considered approach to the study of test (reducts) and decision rules common to all tables from [Formula: see text]. In the latter case, we discuss a way to study the association rules common to all information systems from the set [Formula: see text]: building a joint information system for which the set of true association rules that are realizable for a given row [Formula: see text] and have a given attribute a on the right-hand side coincides with the set of association rules that are true for all information systems from [Formula: see text] , have the attribute a on the right-hand side, and are realizable for the row [Formula: see text]. We then show how to build a joint information system in a polynomial time. When we build such an information system, we can apply various association rule learning algorithms to it.
format Online
Article
Text
id pubmed-9602006
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96020062022-10-27 Selected Data Mining Tools for Data Analysis in Distributed Environment Moshkov, Mikhail Zielosko, Beata Tetteh, Evans Teiko Entropy (Basel) Article In this paper, we deal with distributed data represented either as a finite set [Formula: see text] of decision tables with equal sets of attributes or a finite set [Formula: see text] of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision trees common to all tables from the set [Formula: see text]: building a decision table in which the set of decision trees coincides with the set of decision trees common to all tables from [Formula: see text]. We show when we can build such a decision table and how to build it in a polynomial time. If we have such a table, we can apply various decision tree learning algorithms to it. We extend the considered approach to the study of test (reducts) and decision rules common to all tables from [Formula: see text]. In the latter case, we discuss a way to study the association rules common to all information systems from the set [Formula: see text]: building a joint information system for which the set of true association rules that are realizable for a given row [Formula: see text] and have a given attribute a on the right-hand side coincides with the set of association rules that are true for all information systems from [Formula: see text] , have the attribute a on the right-hand side, and are realizable for the row [Formula: see text]. We then show how to build a joint information system in a polynomial time. When we build such an information system, we can apply various association rule learning algorithms to it. MDPI 2022-10-01 /pmc/articles/PMC9602006/ /pubmed/37420421 http://dx.doi.org/10.3390/e24101401 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Moshkov, Mikhail
Zielosko, Beata
Tetteh, Evans Teiko
Selected Data Mining Tools for Data Analysis in Distributed Environment
title Selected Data Mining Tools for Data Analysis in Distributed Environment
title_full Selected Data Mining Tools for Data Analysis in Distributed Environment
title_fullStr Selected Data Mining Tools for Data Analysis in Distributed Environment
title_full_unstemmed Selected Data Mining Tools for Data Analysis in Distributed Environment
title_short Selected Data Mining Tools for Data Analysis in Distributed Environment
title_sort selected data mining tools for data analysis in distributed environment
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602006/
https://www.ncbi.nlm.nih.gov/pubmed/37420421
http://dx.doi.org/10.3390/e24101401
work_keys_str_mv AT moshkovmikhail selecteddataminingtoolsfordataanalysisindistributedenvironment
AT zieloskobeata selecteddataminingtoolsfordataanalysisindistributedenvironment
AT tettehevansteiko selecteddataminingtoolsfordataanalysisindistributedenvironment