Cargando…
Dataset for comparable evaluation of machine translation between 11 South African languages
This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992927/ https://www.ncbi.nlm.nih.gov/pubmed/32016149 http://dx.doi.org/10.1016/j.dib.2020.105146 |
_version_ | 1783492931699605504 |
---|---|
author | McKellar, Cindy A. Puttkammer, Martin J. |
author_facet | McKellar, Cindy A. Puttkammer, Martin J. |
author_sort | McKellar, Cindy A. |
collection | PubMed |
description | This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each of the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, Siswati, Tshivenḓa and Xitsonga. |
format | Online Article Text |
id | pubmed-6992927 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-69929272020-02-03 Dataset for comparable evaluation of machine translation between 11 South African languages McKellar, Cindy A. Puttkammer, Martin J. Data Brief Arts and Humanity This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each of the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, Siswati, Tshivenḓa and Xitsonga. Elsevier 2020-01-14 /pmc/articles/PMC6992927/ /pubmed/32016149 http://dx.doi.org/10.1016/j.dib.2020.105146 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Arts and Humanity McKellar, Cindy A. Puttkammer, Martin J. Dataset for comparable evaluation of machine translation between 11 South African languages |
title | Dataset for comparable evaluation of machine translation between 11 South African languages |
title_full | Dataset for comparable evaluation of machine translation between 11 South African languages |
title_fullStr | Dataset for comparable evaluation of machine translation between 11 South African languages |
title_full_unstemmed | Dataset for comparable evaluation of machine translation between 11 South African languages |
title_short | Dataset for comparable evaluation of machine translation between 11 South African languages |
title_sort | dataset for comparable evaluation of machine translation between 11 south african languages |
topic | Arts and Humanity |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992927/ https://www.ncbi.nlm.nih.gov/pubmed/32016149 http://dx.doi.org/10.1016/j.dib.2020.105146 |
work_keys_str_mv | AT mckellarcindya datasetforcomparableevaluationofmachinetranslationbetween11southafricanlanguages AT puttkammermartinj datasetforcomparableevaluationofmachinetranslationbetween11southafricanlanguages |