Cargando…

Automated identification of borrowings in multilingual wordlists

Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a...

Descripción completa

Detalles Bibliográficos
Autores principales: List, Johann-Mattis, Forkel, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445856/
https://www.ncbi.nlm.nih.gov/pubmed/37645101
http://dx.doi.org/10.12688/openreseurope.13843.3
_version_ 1785094271929417728
author List, Johann-Mattis
Forkel, Robert
author_facet List, Johann-Mattis
Forkel, Robert
author_sort List, Johann-Mattis
collection PubMed
description Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a new method for the task and tests it on a newly compiled large comparative dataset of 48 South-East Asian languages from Southern China. The method yields very promising results, while it is conceptually straightforward and easy to apply. This makes the approach a perfect candidate for computer-assisted exploratory studies on lexical borrowing in contact areas.
format Online
Article
Text
id pubmed-10445856
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-104458562023-08-29 Automated identification of borrowings in multilingual wordlists List, Johann-Mattis Forkel, Robert Open Res Eur Research Article Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a new method for the task and tests it on a newly compiled large comparative dataset of 48 South-East Asian languages from Southern China. The method yields very promising results, while it is conceptually straightforward and easy to apply. This makes the approach a perfect candidate for computer-assisted exploratory studies on lexical borrowing in contact areas. F1000 Research Limited 2022-03-23 /pmc/articles/PMC10445856/ /pubmed/37645101 http://dx.doi.org/10.12688/openreseurope.13843.3 Text en Copyright: © 2022 List JM and Forkel R https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
List, Johann-Mattis
Forkel, Robert
Automated identification of borrowings in multilingual wordlists
title Automated identification of borrowings in multilingual wordlists
title_full Automated identification of borrowings in multilingual wordlists
title_fullStr Automated identification of borrowings in multilingual wordlists
title_full_unstemmed Automated identification of borrowings in multilingual wordlists
title_short Automated identification of borrowings in multilingual wordlists
title_sort automated identification of borrowings in multilingual wordlists
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445856/
https://www.ncbi.nlm.nih.gov/pubmed/37645101
http://dx.doi.org/10.12688/openreseurope.13843.3
work_keys_str_mv AT listjohannmattis automatedidentificationofborrowingsinmultilingualwordlists
AT forkelrobert automatedidentificationofborrowingsinmultilingualwordlists