Cargando…
Multilayer network based comparative document analysis (MUNCoDA)
The proposed multilayer network-based comparative document analysis (MUNCoDA) method supports the identification of the common points of a set of documents, which deal with the same subject area. As documents are transformed into networks of informative word-pairs, the collection of documents form a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226890/ https://www.ncbi.nlm.nih.gov/pubmed/32426247 http://dx.doi.org/10.1016/j.mex.2020.100902 |
_version_ | 1783534385505501184 |
---|---|
author | Sebestyén, Viktor Domokos, Endre Abonyi, János |
author_facet | Sebestyén, Viktor Domokos, Endre Abonyi, János |
author_sort | Sebestyén, Viktor |
collection | PubMed |
description | The proposed multilayer network-based comparative document analysis (MUNCoDA) method supports the identification of the common points of a set of documents, which deal with the same subject area. As documents are transformed into networks of informative word-pairs, the collection of documents form a multilayer network that allows the comparative evaluation of the texts. The multilayer network can be visualized and analyzed to highlight how the texts are structured. The topics of the documents can be clustered based on the developed similarity measures. By exploring the network centralities, topic importance values can be assigned. The method is fully automated by KNIME preprocessing tools and MATLAB/Octave code. • Networks can be formed based on informative word pairs of a multiple documents; • The analysis of the proposed multilayer networks provides information for multi-document summarization; • Words and documents can be clustered based on node similarity and edge overlap measures. |
format | Online Article Text |
id | pubmed-7226890 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-72268902020-05-18 Multilayer network based comparative document analysis (MUNCoDA) Sebestyén, Viktor Domokos, Endre Abonyi, János MethodsX Computer Science The proposed multilayer network-based comparative document analysis (MUNCoDA) method supports the identification of the common points of a set of documents, which deal with the same subject area. As documents are transformed into networks of informative word-pairs, the collection of documents form a multilayer network that allows the comparative evaluation of the texts. The multilayer network can be visualized and analyzed to highlight how the texts are structured. The topics of the documents can be clustered based on the developed similarity measures. By exploring the network centralities, topic importance values can be assigned. The method is fully automated by KNIME preprocessing tools and MATLAB/Octave code. • Networks can be formed based on informative word pairs of a multiple documents; • The analysis of the proposed multilayer networks provides information for multi-document summarization; • Words and documents can be clustered based on node similarity and edge overlap measures. Elsevier 2020-04-29 /pmc/articles/PMC7226890/ /pubmed/32426247 http://dx.doi.org/10.1016/j.mex.2020.100902 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Computer Science Sebestyén, Viktor Domokos, Endre Abonyi, János Multilayer network based comparative document analysis (MUNCoDA) |
title | Multilayer network based comparative document analysis (MUNCoDA) |
title_full | Multilayer network based comparative document analysis (MUNCoDA) |
title_fullStr | Multilayer network based comparative document analysis (MUNCoDA) |
title_full_unstemmed | Multilayer network based comparative document analysis (MUNCoDA) |
title_short | Multilayer network based comparative document analysis (MUNCoDA) |
title_sort | multilayer network based comparative document analysis (muncoda) |
topic | Computer Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226890/ https://www.ncbi.nlm.nih.gov/pubmed/32426247 http://dx.doi.org/10.1016/j.mex.2020.100902 |
work_keys_str_mv | AT sebestyenviktor multilayernetworkbasedcomparativedocumentanalysismuncoda AT domokosendre multilayernetworkbasedcomparativedocumentanalysismuncoda AT abonyijanos multilayernetworkbasedcomparativedocumentanalysismuncoda |