Cargando…

Dataset for comparable evaluation of machine translation between 11 South African languages

This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each...

Descripción completa

Detalles Bibliográficos
Autores principales: McKellar, Cindy A., Puttkammer, Martin J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6992927/
https://www.ncbi.nlm.nih.gov/pubmed/32016149
http://dx.doi.org/10.1016/j.dib.2020.105146
Descripción
Sumario:This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel with four reference translations available for each of the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, Siswati, Tshivenḓa and Xitsonga.