Cargando…

Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation

With the rapid development of machine translation (MT), the MT evaluation becomes very important to timely tell us whether the MT system makes any progress. The conventional MT evaluation methods tend to calculate the similarity between hypothesis translations offered by automatic translation system...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Aaron L.-F., Wong, Derek F., Chao, Lidia S., He, Liangye, Lu, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032676/
https://www.ncbi.nlm.nih.gov/pubmed/24892086
http://dx.doi.org/10.1155/2014/760301
_version_ 1782317678103363584
author Han, Aaron L.-F.
Wong, Derek F.
Chao, Lidia S.
He, Liangye
Lu, Yi
author_facet Han, Aaron L.-F.
Wong, Derek F.
Chao, Lidia S.
He, Liangye
Lu, Yi
author_sort Han, Aaron L.-F.
collection PubMed
description With the rapid development of machine translation (MT), the MT evaluation becomes very important to timely tell us whether the MT system makes any progress. The conventional MT evaluation methods tend to calculate the similarity between hypothesis translations offered by automatic translation systems and reference translations offered by professional translators. There are several weaknesses in existing evaluation metrics. Firstly, the designed incomprehensive factors result in language-bias problem, which means they perform well on some special language pairs but weak on other language pairs. Secondly, they tend to use no linguistic features or too many linguistic features, of which no usage of linguistic feature draws a lot of criticism from the linguists and too many linguistic features make the model weak in repeatability. Thirdly, the employed reference translations are very expensive and sometimes not available in the practice. In this paper, the authors propose an unsupervised MT evaluation metric using universal part-of-speech tagset without relying on reference translations. The authors also explore the performances of the designed metric on traditional supervised evaluation tasks. Both the supervised and unsupervised experiments show that the designed methods yield higher correlation scores with human judgments.
format Online
Article
Text
id pubmed-4032676
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40326762014-06-02 Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation Han, Aaron L.-F. Wong, Derek F. Chao, Lidia S. He, Liangye Lu, Yi ScientificWorldJournal Research Article With the rapid development of machine translation (MT), the MT evaluation becomes very important to timely tell us whether the MT system makes any progress. The conventional MT evaluation methods tend to calculate the similarity between hypothesis translations offered by automatic translation systems and reference translations offered by professional translators. There are several weaknesses in existing evaluation metrics. Firstly, the designed incomprehensive factors result in language-bias problem, which means they perform well on some special language pairs but weak on other language pairs. Secondly, they tend to use no linguistic features or too many linguistic features, of which no usage of linguistic feature draws a lot of criticism from the linguists and too many linguistic features make the model weak in repeatability. Thirdly, the employed reference translations are very expensive and sometimes not available in the practice. In this paper, the authors propose an unsupervised MT evaluation metric using universal part-of-speech tagset without relying on reference translations. The authors also explore the performances of the designed metric on traditional supervised evaluation tasks. Both the supervised and unsupervised experiments show that the designed methods yield higher correlation scores with human judgments. Hindawi Publishing Corporation 2014 2014-04-28 /pmc/articles/PMC4032676/ /pubmed/24892086 http://dx.doi.org/10.1155/2014/760301 Text en Copyright © 2014 Aaron L.-F. Han et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Han, Aaron L.-F.
Wong, Derek F.
Chao, Lidia S.
He, Liangye
Lu, Yi
Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation
title Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation
title_full Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation
title_fullStr Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation
title_full_unstemmed Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation
title_short Unsupervised Quality Estimation Model for English to German Translation and Its Application in Extensive Supervised Evaluation
title_sort unsupervised quality estimation model for english to german translation and its application in extensive supervised evaluation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032676/
https://www.ncbi.nlm.nih.gov/pubmed/24892086
http://dx.doi.org/10.1155/2014/760301
work_keys_str_mv AT hanaaronlf unsupervisedqualityestimationmodelforenglishtogermantranslationanditsapplicationinextensivesupervisedevaluation
AT wongderekf unsupervisedqualityestimationmodelforenglishtogermantranslationanditsapplicationinextensivesupervisedevaluation
AT chaolidias unsupervisedqualityestimationmodelforenglishtogermantranslationanditsapplicationinextensivesupervisedevaluation
AT heliangye unsupervisedqualityestimationmodelforenglishtogermantranslationanditsapplicationinextensivesupervisedevaluation
AT luyi unsupervisedqualityestimationmodelforenglishtogermantranslationanditsapplicationinextensivesupervisedevaluation