Cargando…
Automated identification of borrowings in multilingual wordlists
Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445856/ https://www.ncbi.nlm.nih.gov/pubmed/37645101 http://dx.doi.org/10.12688/openreseurope.13843.3 |
_version_ | 1785094271929417728 |
---|---|
author | List, Johann-Mattis Forkel, Robert |
author_facet | List, Johann-Mattis Forkel, Robert |
author_sort | List, Johann-Mattis |
collection | PubMed |
description | Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a new method for the task and tests it on a newly compiled large comparative dataset of 48 South-East Asian languages from Southern China. The method yields very promising results, while it is conceptually straightforward and easy to apply. This makes the approach a perfect candidate for computer-assisted exploratory studies on lexical borrowing in contact areas. |
format | Online Article Text |
id | pubmed-10445856 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-104458562023-08-29 Automated identification of borrowings in multilingual wordlists List, Johann-Mattis Forkel, Robert Open Res Eur Research Article Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a new method for the task and tests it on a newly compiled large comparative dataset of 48 South-East Asian languages from Southern China. The method yields very promising results, while it is conceptually straightforward and easy to apply. This makes the approach a perfect candidate for computer-assisted exploratory studies on lexical borrowing in contact areas. F1000 Research Limited 2022-03-23 /pmc/articles/PMC10445856/ /pubmed/37645101 http://dx.doi.org/10.12688/openreseurope.13843.3 Text en Copyright: © 2022 List JM and Forkel R https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article List, Johann-Mattis Forkel, Robert Automated identification of borrowings in multilingual wordlists |
title | Automated identification of borrowings in multilingual wordlists |
title_full | Automated identification of borrowings in multilingual wordlists |
title_fullStr | Automated identification of borrowings in multilingual wordlists |
title_full_unstemmed | Automated identification of borrowings in multilingual wordlists |
title_short | Automated identification of borrowings in multilingual wordlists |
title_sort | automated identification of borrowings in multilingual wordlists |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445856/ https://www.ncbi.nlm.nih.gov/pubmed/37645101 http://dx.doi.org/10.12688/openreseurope.13843.3 |
work_keys_str_mv | AT listjohannmattis automatedidentificationofborrowingsinmultilingualwordlists AT forkelrobert automatedidentificationofborrowingsinmultilingualwordlists |