Cargando…
Selected Data Mining Tools for Data Analysis in Distributed Environment
In this paper, we deal with distributed data represented either as a finite set [Formula: see text] of decision tables with equal sets of attributes or a finite set [Formula: see text] of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision tr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602006/ https://www.ncbi.nlm.nih.gov/pubmed/37420421 http://dx.doi.org/10.3390/e24101401 |
_version_ | 1784817205979906048 |
---|---|
author | Moshkov, Mikhail Zielosko, Beata Tetteh, Evans Teiko |
author_facet | Moshkov, Mikhail Zielosko, Beata Tetteh, Evans Teiko |
author_sort | Moshkov, Mikhail |
collection | PubMed |
description | In this paper, we deal with distributed data represented either as a finite set [Formula: see text] of decision tables with equal sets of attributes or a finite set [Formula: see text] of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision trees common to all tables from the set [Formula: see text]: building a decision table in which the set of decision trees coincides with the set of decision trees common to all tables from [Formula: see text]. We show when we can build such a decision table and how to build it in a polynomial time. If we have such a table, we can apply various decision tree learning algorithms to it. We extend the considered approach to the study of test (reducts) and decision rules common to all tables from [Formula: see text]. In the latter case, we discuss a way to study the association rules common to all information systems from the set [Formula: see text]: building a joint information system for which the set of true association rules that are realizable for a given row [Formula: see text] and have a given attribute a on the right-hand side coincides with the set of association rules that are true for all information systems from [Formula: see text] , have the attribute a on the right-hand side, and are realizable for the row [Formula: see text]. We then show how to build a joint information system in a polynomial time. When we build such an information system, we can apply various association rule learning algorithms to it. |
format | Online Article Text |
id | pubmed-9602006 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96020062022-10-27 Selected Data Mining Tools for Data Analysis in Distributed Environment Moshkov, Mikhail Zielosko, Beata Tetteh, Evans Teiko Entropy (Basel) Article In this paper, we deal with distributed data represented either as a finite set [Formula: see text] of decision tables with equal sets of attributes or a finite set [Formula: see text] of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision trees common to all tables from the set [Formula: see text]: building a decision table in which the set of decision trees coincides with the set of decision trees common to all tables from [Formula: see text]. We show when we can build such a decision table and how to build it in a polynomial time. If we have such a table, we can apply various decision tree learning algorithms to it. We extend the considered approach to the study of test (reducts) and decision rules common to all tables from [Formula: see text]. In the latter case, we discuss a way to study the association rules common to all information systems from the set [Formula: see text]: building a joint information system for which the set of true association rules that are realizable for a given row [Formula: see text] and have a given attribute a on the right-hand side coincides with the set of association rules that are true for all information systems from [Formula: see text] , have the attribute a on the right-hand side, and are realizable for the row [Formula: see text]. We then show how to build a joint information system in a polynomial time. When we build such an information system, we can apply various association rule learning algorithms to it. MDPI 2022-10-01 /pmc/articles/PMC9602006/ /pubmed/37420421 http://dx.doi.org/10.3390/e24101401 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Moshkov, Mikhail Zielosko, Beata Tetteh, Evans Teiko Selected Data Mining Tools for Data Analysis in Distributed Environment |
title | Selected Data Mining Tools for Data Analysis in Distributed Environment |
title_full | Selected Data Mining Tools for Data Analysis in Distributed Environment |
title_fullStr | Selected Data Mining Tools for Data Analysis in Distributed Environment |
title_full_unstemmed | Selected Data Mining Tools for Data Analysis in Distributed Environment |
title_short | Selected Data Mining Tools for Data Analysis in Distributed Environment |
title_sort | selected data mining tools for data analysis in distributed environment |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602006/ https://www.ncbi.nlm.nih.gov/pubmed/37420421 http://dx.doi.org/10.3390/e24101401 |
work_keys_str_mv | AT moshkovmikhail selecteddataminingtoolsfordataanalysisindistributedenvironment AT zieloskobeata selecteddataminingtoolsfordataanalysisindistributedenvironment AT tettehevansteiko selecteddataminingtoolsfordataanalysisindistributedenvironment |