Cargando…

Generating Fake but Realistic Headlines Using Deep Neural Networks

Social media platforms such as Twitter and Facebook implement filters to detect fake news as they foresee their transition from social media platform to primary sources of news. The robustness of such filters lies in the variety and the quality of the data used to train them. There is, therefore, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Dandekar, Ashish, Zen, Remmy A. M., Bressan, Stéphane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121779/
http://dx.doi.org/10.1007/978-3-319-64471-4_34
_version_ 1783515277333364736
author Dandekar, Ashish
Zen, Remmy A. M.
Bressan, Stéphane
author_facet Dandekar, Ashish
Zen, Remmy A. M.
Bressan, Stéphane
author_sort Dandekar, Ashish
collection PubMed
description Social media platforms such as Twitter and Facebook implement filters to detect fake news as they foresee their transition from social media platform to primary sources of news. The robustness of such filters lies in the variety and the quality of the data used to train them. There is, therefore, a need for a tool that automatically generates fake but realistic news. In this paper, we propose a deep learning model that automatically generates news headlines. The model is trained with a corpus of existing headlines from different topics. Once trained, the model generates a fake but realistic headline given a seed and a topic. For example, given the seed “Kim Jong Un” and the topic “Business”, the model generates the headline “kim jong un says climate change is already making money”. In order to better capture and leverage the syntactic structure of the headlines for the task of synthetic headline generation, we extend the architecture - Contextual Long Short Term Memory, proposed by Ghosh et al. - to also learn a part-of-speech model. We empirically and comparatively evaluate the performance of the proposed model on a real corpora of headlines. We compare our proposed approach and its variants using Long Short Term Memory and Gated Recurrent Units as the building blocks. We evaluate and compare the topical coherence of the generated headlines using a state-of-the-art classifier. We, also, evaluate the quality of the generated headline using a machine translation quality metric and its novelty using a metric we propose for this purpose. We show that the proposed model is practical and competitively efficient and effective.
format Online
Article
Text
id pubmed-7121779
institution National Center for Biotechnology Information
language English
publishDate 2017
record_format MEDLINE/PubMed
spelling pubmed-71217792020-04-06 Generating Fake but Realistic Headlines Using Deep Neural Networks Dandekar, Ashish Zen, Remmy A. M. Bressan, Stéphane Database and Expert Systems Applications Article Social media platforms such as Twitter and Facebook implement filters to detect fake news as they foresee their transition from social media platform to primary sources of news. The robustness of such filters lies in the variety and the quality of the data used to train them. There is, therefore, a need for a tool that automatically generates fake but realistic news. In this paper, we propose a deep learning model that automatically generates news headlines. The model is trained with a corpus of existing headlines from different topics. Once trained, the model generates a fake but realistic headline given a seed and a topic. For example, given the seed “Kim Jong Un” and the topic “Business”, the model generates the headline “kim jong un says climate change is already making money”. In order to better capture and leverage the syntactic structure of the headlines for the task of synthetic headline generation, we extend the architecture - Contextual Long Short Term Memory, proposed by Ghosh et al. - to also learn a part-of-speech model. We empirically and comparatively evaluate the performance of the proposed model on a real corpora of headlines. We compare our proposed approach and its variants using Long Short Term Memory and Gated Recurrent Units as the building blocks. We evaluate and compare the topical coherence of the generated headlines using a state-of-the-art classifier. We, also, evaluate the quality of the generated headline using a machine translation quality metric and its novelty using a metric we propose for this purpose. We show that the proposed model is practical and competitively efficient and effective. 2017-07-04 /pmc/articles/PMC7121779/ http://dx.doi.org/10.1007/978-3-319-64471-4_34 Text en © Springer International Publishing AG 2017 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Dandekar, Ashish
Zen, Remmy A. M.
Bressan, Stéphane
Generating Fake but Realistic Headlines Using Deep Neural Networks
title Generating Fake but Realistic Headlines Using Deep Neural Networks
title_full Generating Fake but Realistic Headlines Using Deep Neural Networks
title_fullStr Generating Fake but Realistic Headlines Using Deep Neural Networks
title_full_unstemmed Generating Fake but Realistic Headlines Using Deep Neural Networks
title_short Generating Fake but Realistic Headlines Using Deep Neural Networks
title_sort generating fake but realistic headlines using deep neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121779/
http://dx.doi.org/10.1007/978-3-319-64471-4_34
work_keys_str_mv AT dandekarashish generatingfakebutrealisticheadlinesusingdeepneuralnetworks
AT zenremmyam generatingfakebutrealisticheadlinesusingdeepneuralnetworks
AT bressanstephane generatingfakebutrealisticheadlinesusingdeepneuralnetworks