Cargando…

An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation

In computer vision, one of the common practices to augment the image dataset is by creating new images using geometric transformation preserving similarity. This data augmentation was one of the most significant factors for winning the Image Net competition in 2012 with vast neural networks. Unlike...

Descripción completa

Detalles Bibliográficos
Autores principales: Gupta, Varun, Krzyżak, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304040/
http://dx.doi.org/10.1007/978-3-030-50420-5_29
_version_ 1783548186215841792
author Gupta, Varun
Krzyżak, Adam
author_facet Gupta, Varun
Krzyżak, Adam
author_sort Gupta, Varun
collection PubMed
description In computer vision, one of the common practices to augment the image dataset is by creating new images using geometric transformation preserving similarity. This data augmentation was one of the most significant factors for winning the Image Net competition in 2012 with vast neural networks. Unlike in computer vision and speech data, there have not been many techniques explored to augment data in natural language processing (NLP). The only technique explored in the text data is lexical substitution, which only focuses on replacing words by synonyms. In this paper, we investigate the use of different pointer networks with the sequence-to-sequence models, which have shown excellent results in neural machine translation (NMT) and text simplification tasks, in generating similar sentences using a sequence-to-sequence model and the paraphrase dataset (PPDB). The evaluation of these paraphrases is carried out by augmenting the training dataset of IMDb movie review dataset and comparing its performance with the baseline model. To our best knowledge, this is the first study on generating paraphrases using these models with the help of PPDB dataset.
format Online
Article
Text
id pubmed-7304040
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73040402020-06-19 An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation Gupta, Varun Krzyżak, Adam Computational Science – ICCS 2020 Article In computer vision, one of the common practices to augment the image dataset is by creating new images using geometric transformation preserving similarity. This data augmentation was one of the most significant factors for winning the Image Net competition in 2012 with vast neural networks. Unlike in computer vision and speech data, there have not been many techniques explored to augment data in natural language processing (NLP). The only technique explored in the text data is lexical substitution, which only focuses on replacing words by synonyms. In this paper, we investigate the use of different pointer networks with the sequence-to-sequence models, which have shown excellent results in neural machine translation (NMT) and text simplification tasks, in generating similar sentences using a sequence-to-sequence model and the paraphrase dataset (PPDB). The evaluation of these paraphrases is carried out by augmenting the training dataset of IMDb movie review dataset and comparing its performance with the baseline model. To our best knowledge, this is the first study on generating paraphrases using these models with the help of PPDB dataset. 2020-05-22 /pmc/articles/PMC7304040/ http://dx.doi.org/10.1007/978-3-030-50420-5_29 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Gupta, Varun
Krzyżak, Adam
An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
title An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
title_full An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
title_fullStr An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
title_full_unstemmed An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
title_short An Empirical Evaluation of Attention and Pointer Networks for Paraphrase Generation
title_sort empirical evaluation of attention and pointer networks for paraphrase generation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304040/
http://dx.doi.org/10.1007/978-3-030-50420-5_29
work_keys_str_mv AT guptavarun anempiricalevaluationofattentionandpointernetworksforparaphrasegeneration
AT krzyzakadam anempiricalevaluationofattentionandpointernetworksforparaphrasegeneration
AT guptavarun empiricalevaluationofattentionandpointernetworksforparaphrasegeneration
AT krzyzakadam empiricalevaluationofattentionandpointernetworksforparaphrasegeneration