Cargando…

Dataset for comparable evaluation of machine translation between 11 South African languages

This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each...

Descripción completa

Detalles Bibliográficos
Autores principales: McKellar, Cindy A., Puttkammer, Martin J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992927/
https://www.ncbi.nlm.nih.gov/pubmed/32016149
http://dx.doi.org/10.1016/j.dib.2020.105146
_version_ 1783492931699605504
author McKellar, Cindy A.
Puttkammer, Martin J.
author_facet McKellar, Cindy A.
Puttkammer, Martin J.
author_sort McKellar, Cindy A.
collection PubMed
description This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each of the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, Siswati, Tshivenḓa and Xitsonga.
format Online
Article
Text
id pubmed-6992927
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-69929272020-02-03 Dataset for comparable evaluation of machine translation between 11 South African languages McKellar, Cindy A. Puttkammer, Martin J. Data Brief Arts and Humanity This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each of the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, Siswati, Tshivenḓa and Xitsonga. Elsevier 2020-01-14 /pmc/articles/PMC6992927/ /pubmed/32016149 http://dx.doi.org/10.1016/j.dib.2020.105146 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Arts and Humanity
McKellar, Cindy A.
Puttkammer, Martin J.
Dataset for comparable evaluation of machine translation between 11 South African languages
title Dataset for comparable evaluation of machine translation between 11 South African languages
title_full Dataset for comparable evaluation of machine translation between 11 South African languages
title_fullStr Dataset for comparable evaluation of machine translation between 11 South African languages
title_full_unstemmed Dataset for comparable evaluation of machine translation between 11 South African languages
title_short Dataset for comparable evaluation of machine translation between 11 South African languages
title_sort dataset for comparable evaluation of machine translation between 11 south african languages
topic Arts and Humanity
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992927/
https://www.ncbi.nlm.nih.gov/pubmed/32016149
http://dx.doi.org/10.1016/j.dib.2020.105146
work_keys_str_mv AT mckellarcindya datasetforcomparableevaluationofmachinetranslationbetween11southafricanlanguages
AT puttkammermartinj datasetforcomparableevaluationofmachinetranslationbetween11southafricanlanguages