Cargando…

Evaluating prose style transfer with the Bible

In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currentl...

Descripción completa

Detalles Bibliográficos
Autores principales: Carlson, Keith, Riddell, Allen, Rockmore, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6227951/
https://www.ncbi.nlm.nih.gov/pubmed/30473797
http://dx.doi.org/10.1098/rsos.171920
_version_ 1783370011873640448
author Carlson, Keith
Riddell, Allen
Rockmore, Daniel
author_facet Carlson, Keith
Riddell, Allen
Rockmore, Daniel
author_sort Carlson, Keith
collection PubMed
description In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currently, there are few publicly available corpora for this task. In this work, we identify a high-quality source of aligned, stylistically distinct text in different versions of the Bible. We provide a standardized split, into training, development and testing data, of the public domain versions in our corpus. This corpus is highly parallel since many Bible versions are included. Sentences are aligned due to the presence of chapter and verse numbers within all versions of the text. In addition to the corpus, we present the results, as measured by the BLEU and PINC metrics, of several models trained on our data which can serve as baselines for future research. While we present these data as a style transfer corpus, we believe that it is of unmatched quality and may be useful for other natural language tasks as well.
format Online
Article
Text
id pubmed-6227951
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-62279512018-11-23 Evaluating prose style transfer with the Bible Carlson, Keith Riddell, Allen Rockmore, Daniel R Soc Open Sci Computer Science In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currently, there are few publicly available corpora for this task. In this work, we identify a high-quality source of aligned, stylistically distinct text in different versions of the Bible. We provide a standardized split, into training, development and testing data, of the public domain versions in our corpus. This corpus is highly parallel since many Bible versions are included. Sentences are aligned due to the presence of chapter and verse numbers within all versions of the text. In addition to the corpus, we present the results, as measured by the BLEU and PINC metrics, of several models trained on our data which can serve as baselines for future research. While we present these data as a style transfer corpus, we believe that it is of unmatched quality and may be useful for other natural language tasks as well. The Royal Society 2018-10-24 /pmc/articles/PMC6227951/ /pubmed/30473797 http://dx.doi.org/10.1098/rsos.171920 Text en © 2018 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Computer Science
Carlson, Keith
Riddell, Allen
Rockmore, Daniel
Evaluating prose style transfer with the Bible
title Evaluating prose style transfer with the Bible
title_full Evaluating prose style transfer with the Bible
title_fullStr Evaluating prose style transfer with the Bible
title_full_unstemmed Evaluating prose style transfer with the Bible
title_short Evaluating prose style transfer with the Bible
title_sort evaluating prose style transfer with the bible
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6227951/
https://www.ncbi.nlm.nih.gov/pubmed/30473797
http://dx.doi.org/10.1098/rsos.171920
work_keys_str_mv AT carlsonkeith evaluatingprosestyletransferwiththebible
AT riddellallen evaluatingprosestyletransferwiththebible
AT rockmoredaniel evaluatingprosestyletransferwiththebible