Cargando…
An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
In computer vision, one of the common practices to augment the image dataset is by creating new images using geometric transformation preserving similarity. This data augmentation was one of the most significant factors for winning the Image Net competition in 2012 with vast neural networks. Unlike...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304040/ http://dx.doi.org/10.1007/978-3-030-50420-5_29 |
_version_ | 1783548186215841792 |
---|---|
author | Gupta, Varun Krzyżak, Adam |
author_facet | Gupta, Varun Krzyżak, Adam |
author_sort | Gupta, Varun |
collection | PubMed |
description | In computer vision, one of the common practices to augment the image dataset is by creating new images using geometric transformation preserving similarity. This data augmentation was one of the most significant factors for winning the Image Net competition in 2012 with vast neural networks. Unlike in computer vision and speech data, there have not been many techniques explored to augment data in natural language processing (NLP). The only technique explored in the text data is lexical substitution, which only focuses on replacing words by synonyms. In this paper, we investigate the use of different pointer networks with the sequence-to-sequence models, which have shown excellent results in neural machine translation (NMT) and text simplification tasks, in generating similar sentences using a sequence-to-sequence model and the paraphrase dataset (PPDB). The evaluation of these paraphrases is carried out by augmenting the training dataset of IMDb movie review dataset and comparing its performance with the baseline model. To our best knowledge, this is the first study on generating paraphrases using these models with the help of PPDB dataset. |
format | Online Article Text |
id | pubmed-7304040 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73040402020-06-19 An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation Gupta, Varun Krzyżak, Adam Computational Science – ICCS 2020 Article In computer vision, one of the common practices to augment the image dataset is by creating new images using geometric transformation preserving similarity. This data augmentation was one of the most significant factors for winning the Image Net competition in 2012 with vast neural networks. Unlike in computer vision and speech data, there have not been many techniques explored to augment data in natural language processing (NLP). The only technique explored in the text data is lexical substitution, which only focuses on replacing words by synonyms. In this paper, we investigate the use of different pointer networks with the sequence-to-sequence models, which have shown excellent results in neural machine translation (NMT) and text simplification tasks, in generating similar sentences using a sequence-to-sequence model and the paraphrase dataset (PPDB). The evaluation of these paraphrases is carried out by augmenting the training dataset of IMDb movie review dataset and comparing its performance with the baseline model. To our best knowledge, this is the first study on generating paraphrases using these models with the help of PPDB dataset. 2020-05-22 /pmc/articles/PMC7304040/ http://dx.doi.org/10.1007/978-3-030-50420-5_29 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Gupta, Varun Krzyżak, Adam An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation |
title | An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation |
title_full | An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation |
title_fullStr | An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation |
title_full_unstemmed | An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation |
title_short | An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation |
title_sort | empirical evaluation of attention and pointer networks for paraphrase generation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304040/ http://dx.doi.org/10.1007/978-3-030-50420-5_29 |
work_keys_str_mv | AT guptavarun anempiricalevaluationofattentionandpointernetworksforparaphrasegeneration AT krzyzakadam anempiricalevaluationofattentionandpointernetworksforparaphrasegeneration AT guptavarun empiricalevaluationofattentionandpointernetworksforparaphrasegeneration AT krzyzakadam empiricalevaluationofattentionandpointernetworksforparaphrasegeneration |