Cargando…

Authorship attribution based on Life-Like Network Automata

The authorship attribution is a problem of considerable practical and technical interest. Several methods have been designed to infer the authorship of disputed documents in multiple contexts. While traditional statistical methods based solely on word counts and related measurements have provided a...

Descripción completa

Detalles Bibliográficos
Autores principales: Machicao, Jeaneth, Corrêa, Edilson A., Miranda, Gisele H. B., Amancio, Diego R., Bruno, Odemir M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5863954/
https://www.ncbi.nlm.nih.gov/pubmed/29566100
http://dx.doi.org/10.1371/journal.pone.0193703
_version_ 1783308459427495936
author Machicao, Jeaneth
Corrêa, Edilson A.
Miranda, Gisele H. B.
Amancio, Diego R.
Bruno, Odemir M.
author_facet Machicao, Jeaneth
Corrêa, Edilson A.
Miranda, Gisele H. B.
Amancio, Diego R.
Bruno, Odemir M.
author_sort Machicao, Jeaneth
collection PubMed
description The authorship attribution is a problem of considerable practical and technical interest. Several methods have been designed to infer the authorship of disputed documents in multiple contexts. While traditional statistical methods based solely on word counts and related measurements have provided a simple, yet effective solution in particular cases; they are prone to manipulation. Recently, texts have been successfully modeled as networks, where words are represented by nodes linked according to textual similarity measurements. Such models are useful to identify informative topological patterns for the authorship recognition task. However, there is no consensus on which measurements should be used. Thus, we proposed a novel method to characterize text networks, by considering both topological and dynamical aspects of networks. Using concepts and methods from cellular automata theory, we devised a strategy to grasp informative spatio-temporal patterns from this model. Our experiments revealed an outperformance over structural analysis relying only on topological measurements, such as clustering coefficient, betweenness and shortest paths. The optimized results obtained here pave the way for a better characterization of textual networks.
format Online
Article
Text
id pubmed-5863954
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58639542018-03-28 Authorship attribution based on Life-Like Network Automata Machicao, Jeaneth Corrêa, Edilson A. Miranda, Gisele H. B. Amancio, Diego R. Bruno, Odemir M. PLoS One Research Article The authorship attribution is a problem of considerable practical and technical interest. Several methods have been designed to infer the authorship of disputed documents in multiple contexts. While traditional statistical methods based solely on word counts and related measurements have provided a simple, yet effective solution in particular cases; they are prone to manipulation. Recently, texts have been successfully modeled as networks, where words are represented by nodes linked according to textual similarity measurements. Such models are useful to identify informative topological patterns for the authorship recognition task. However, there is no consensus on which measurements should be used. Thus, we proposed a novel method to characterize text networks, by considering both topological and dynamical aspects of networks. Using concepts and methods from cellular automata theory, we devised a strategy to grasp informative spatio-temporal patterns from this model. Our experiments revealed an outperformance over structural analysis relying only on topological measurements, such as clustering coefficient, betweenness and shortest paths. The optimized results obtained here pave the way for a better characterization of textual networks. Public Library of Science 2018-03-22 /pmc/articles/PMC5863954/ /pubmed/29566100 http://dx.doi.org/10.1371/journal.pone.0193703 Text en © 2018 Machicao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Machicao, Jeaneth
Corrêa, Edilson A.
Miranda, Gisele H. B.
Amancio, Diego R.
Bruno, Odemir M.
Authorship attribution based on Life-Like Network Automata
title Authorship attribution based on Life-Like Network Automata
title_full Authorship attribution based on Life-Like Network Automata
title_fullStr Authorship attribution based on Life-Like Network Automata
title_full_unstemmed Authorship attribution based on Life-Like Network Automata
title_short Authorship attribution based on Life-Like Network Automata
title_sort authorship attribution based on life-like network automata
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5863954/
https://www.ncbi.nlm.nih.gov/pubmed/29566100
http://dx.doi.org/10.1371/journal.pone.0193703
work_keys_str_mv AT machicaojeaneth authorshipattributionbasedonlifelikenetworkautomata
AT correaedilsona authorshipattributionbasedonlifelikenetworkautomata
AT mirandagiselehb authorshipattributionbasedonlifelikenetworkautomata
AT amanciodiegor authorshipattributionbasedonlifelikenetworkautomata
AT brunoodemirm authorshipattributionbasedonlifelikenetworkautomata