Cargando…

Network motifs for translator stylometry identification

Despite the extensive literature investigating stylometry analysis in authorship attribution research, translator stylometry is an understudied research area. The identification of translator stylometry contributes to many fields including education, intellectual property rights and forensic linguis...

Descripción completa

Detalles Bibliográficos
Autores principales: El-Fiqi, Heba, Petraki, Eleni, Abbass, Hussein A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6368295/
https://www.ncbi.nlm.nih.gov/pubmed/30735512
http://dx.doi.org/10.1371/journal.pone.0211809
_version_ 1783393962315218944
author El-Fiqi, Heba
Petraki, Eleni
Abbass, Hussein A.
author_facet El-Fiqi, Heba
Petraki, Eleni
Abbass, Hussein A.
author_sort El-Fiqi, Heba
collection PubMed
description Despite the extensive literature investigating stylometry analysis in authorship attribution research, translator stylometry is an understudied research area. The identification of translator stylometry contributes to many fields including education, intellectual property rights and forensic linguistics. In a two stage process, this paper first evaluates the use of existing lexical measures for the translator stylometry problem. Similar to previous research we found that using vocabulary richness in its traditional form as it has been used in the literature could not identify translator stylometry. This encouraged us to design an approach with the aim of identifying the distinctive patterns of a translator by employing network-motifs. Networks motifs are small sub-graphs which aim at capturing the local structure of a complex network. The proposed approach achieved an average accuracy of 83% in three-way classification. These results demonstrate that classic tools based on lexical features can be used for identifying translator stylometry if they get augmented with appropriate non-parametric scaling. Moreover, the use of complex network analysis and network motifs mining provided made it possible to design features that can solve translator stylometry analysis problems.
format Online
Article
Text
id pubmed-6368295
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63682952019-02-22 Network motifs for translator stylometry identification El-Fiqi, Heba Petraki, Eleni Abbass, Hussein A. PLoS One Research Article Despite the extensive literature investigating stylometry analysis in authorship attribution research, translator stylometry is an understudied research area. The identification of translator stylometry contributes to many fields including education, intellectual property rights and forensic linguistics. In a two stage process, this paper first evaluates the use of existing lexical measures for the translator stylometry problem. Similar to previous research we found that using vocabulary richness in its traditional form as it has been used in the literature could not identify translator stylometry. This encouraged us to design an approach with the aim of identifying the distinctive patterns of a translator by employing network-motifs. Networks motifs are small sub-graphs which aim at capturing the local structure of a complex network. The proposed approach achieved an average accuracy of 83% in three-way classification. These results demonstrate that classic tools based on lexical features can be used for identifying translator stylometry if they get augmented with appropriate non-parametric scaling. Moreover, the use of complex network analysis and network motifs mining provided made it possible to design features that can solve translator stylometry analysis problems. Public Library of Science 2019-02-08 /pmc/articles/PMC6368295/ /pubmed/30735512 http://dx.doi.org/10.1371/journal.pone.0211809 Text en © 2019 El-Fiqi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
El-Fiqi, Heba
Petraki, Eleni
Abbass, Hussein A.
Network motifs for translator stylometry identification
title Network motifs for translator stylometry identification
title_full Network motifs for translator stylometry identification
title_fullStr Network motifs for translator stylometry identification
title_full_unstemmed Network motifs for translator stylometry identification
title_short Network motifs for translator stylometry identification
title_sort network motifs for translator stylometry identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6368295/
https://www.ncbi.nlm.nih.gov/pubmed/30735512
http://dx.doi.org/10.1371/journal.pone.0211809
work_keys_str_mv AT elfiqiheba networkmotifsfortranslatorstylometryidentification
AT petrakieleni networkmotifsfortranslatorstylometryidentification
AT abbasshusseina networkmotifsfortranslatorstylometryidentification