Cargando…
Network motifs for translator stylometry identification
Despite the extensive literature investigating stylometry analysis in authorship attribution research, translator stylometry is an understudied research area. The identification of translator stylometry contributes to many fields including education, intellectual property rights and forensic linguis...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6368295/ https://www.ncbi.nlm.nih.gov/pubmed/30735512 http://dx.doi.org/10.1371/journal.pone.0211809 |
_version_ | 1783393962315218944 |
---|---|
author | El-Fiqi, Heba Petraki, Eleni Abbass, Hussein A. |
author_facet | El-Fiqi, Heba Petraki, Eleni Abbass, Hussein A. |
author_sort | El-Fiqi, Heba |
collection | PubMed |
description | Despite the extensive literature investigating stylometry analysis in authorship attribution research, translator stylometry is an understudied research area. The identification of translator stylometry contributes to many fields including education, intellectual property rights and forensic linguistics. In a two stage process, this paper first evaluates the use of existing lexical measures for the translator stylometry problem. Similar to previous research we found that using vocabulary richness in its traditional form as it has been used in the literature could not identify translator stylometry. This encouraged us to design an approach with the aim of identifying the distinctive patterns of a translator by employing network-motifs. Networks motifs are small sub-graphs which aim at capturing the local structure of a complex network. The proposed approach achieved an average accuracy of 83% in three-way classification. These results demonstrate that classic tools based on lexical features can be used for identifying translator stylometry if they get augmented with appropriate non-parametric scaling. Moreover, the use of complex network analysis and network motifs mining provided made it possible to design features that can solve translator stylometry analysis problems. |
format | Online Article Text |
id | pubmed-6368295 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-63682952019-02-22 Network motifs for translator stylometry identification El-Fiqi, Heba Petraki, Eleni Abbass, Hussein A. PLoS One Research Article Despite the extensive literature investigating stylometry analysis in authorship attribution research, translator stylometry is an understudied research area. The identification of translator stylometry contributes to many fields including education, intellectual property rights and forensic linguistics. In a two stage process, this paper first evaluates the use of existing lexical measures for the translator stylometry problem. Similar to previous research we found that using vocabulary richness in its traditional form as it has been used in the literature could not identify translator stylometry. This encouraged us to design an approach with the aim of identifying the distinctive patterns of a translator by employing network-motifs. Networks motifs are small sub-graphs which aim at capturing the local structure of a complex network. The proposed approach achieved an average accuracy of 83% in three-way classification. These results demonstrate that classic tools based on lexical features can be used for identifying translator stylometry if they get augmented with appropriate non-parametric scaling. Moreover, the use of complex network analysis and network motifs mining provided made it possible to design features that can solve translator stylometry analysis problems. Public Library of Science 2019-02-08 /pmc/articles/PMC6368295/ /pubmed/30735512 http://dx.doi.org/10.1371/journal.pone.0211809 Text en © 2019 El-Fiqi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article El-Fiqi, Heba Petraki, Eleni Abbass, Hussein A. Network motifs for translator stylometry identification |
title | Network motifs for translator stylometry identification |
title_full | Network motifs for translator stylometry identification |
title_fullStr | Network motifs for translator stylometry identification |
title_full_unstemmed | Network motifs for translator stylometry identification |
title_short | Network motifs for translator stylometry identification |
title_sort | network motifs for translator stylometry identification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6368295/ https://www.ncbi.nlm.nih.gov/pubmed/30735512 http://dx.doi.org/10.1371/journal.pone.0211809 |
work_keys_str_mv | AT elfiqiheba networkmotifsfortranslatorstylometryidentification AT petrakieleni networkmotifsfortranslatorstylometryidentification AT abbasshusseina networkmotifsfortranslatorstylometryidentification |