Cargando…
Generating Fake but Realistic Headlines Using Deep Neural Networks
Social media platforms such as Twitter and Facebook implement filters to detect fake news as they foresee their transition from social media platform to primary sources of news. The robustness of such filters lies in the variety and the quality of the data used to train them. There is, therefore, a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121779/ http://dx.doi.org/10.1007/978-3-319-64471-4_34 |
_version_ | 1783515277333364736 |
---|---|
author | Dandekar, Ashish Zen, Remmy A. M. Bressan, Stéphane |
author_facet | Dandekar, Ashish Zen, Remmy A. M. Bressan, Stéphane |
author_sort | Dandekar, Ashish |
collection | PubMed |
description | Social media platforms such as Twitter and Facebook implement filters to detect fake news as they foresee their transition from social media platform to primary sources of news. The robustness of such filters lies in the variety and the quality of the data used to train them. There is, therefore, a need for a tool that automatically generates fake but realistic news. In this paper, we propose a deep learning model that automatically generates news headlines. The model is trained with a corpus of existing headlines from different topics. Once trained, the model generates a fake but realistic headline given a seed and a topic. For example, given the seed “Kim Jong Un” and the topic “Business”, the model generates the headline “kim jong un says climate change is already making money”. In order to better capture and leverage the syntactic structure of the headlines for the task of synthetic headline generation, we extend the architecture - Contextual Long Short Term Memory, proposed by Ghosh et al. - to also learn a part-of-speech model. We empirically and comparatively evaluate the performance of the proposed model on a real corpora of headlines. We compare our proposed approach and its variants using Long Short Term Memory and Gated Recurrent Units as the building blocks. We evaluate and compare the topical coherence of the generated headlines using a state-of-the-art classifier. We, also, evaluate the quality of the generated headline using a machine translation quality metric and its novelty using a metric we propose for this purpose. We show that the proposed model is practical and competitively efficient and effective. |
format | Online Article Text |
id | pubmed-7121779 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
record_format | MEDLINE/PubMed |
spelling | pubmed-71217792020-04-06 Generating Fake but Realistic Headlines Using Deep Neural Networks Dandekar, Ashish Zen, Remmy A. M. Bressan, Stéphane Database and Expert Systems Applications Article Social media platforms such as Twitter and Facebook implement filters to detect fake news as they foresee their transition from social media platform to primary sources of news. The robustness of such filters lies in the variety and the quality of the data used to train them. There is, therefore, a need for a tool that automatically generates fake but realistic news. In this paper, we propose a deep learning model that automatically generates news headlines. The model is trained with a corpus of existing headlines from different topics. Once trained, the model generates a fake but realistic headline given a seed and a topic. For example, given the seed “Kim Jong Un” and the topic “Business”, the model generates the headline “kim jong un says climate change is already making money”. In order to better capture and leverage the syntactic structure of the headlines for the task of synthetic headline generation, we extend the architecture - Contextual Long Short Term Memory, proposed by Ghosh et al. - to also learn a part-of-speech model. We empirically and comparatively evaluate the performance of the proposed model on a real corpora of headlines. We compare our proposed approach and its variants using Long Short Term Memory and Gated Recurrent Units as the building blocks. We evaluate and compare the topical coherence of the generated headlines using a state-of-the-art classifier. We, also, evaluate the quality of the generated headline using a machine translation quality metric and its novelty using a metric we propose for this purpose. We show that the proposed model is practical and competitively efficient and effective. 2017-07-04 /pmc/articles/PMC7121779/ http://dx.doi.org/10.1007/978-3-319-64471-4_34 Text en © Springer International Publishing AG 2017 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Dandekar, Ashish Zen, Remmy A. M. Bressan, Stéphane Generating Fake but Realistic Headlines Using Deep Neural Networks |
title | Generating Fake but Realistic Headlines Using Deep Neural Networks |
title_full | Generating Fake but Realistic Headlines Using Deep Neural Networks |
title_fullStr | Generating Fake but Realistic Headlines Using Deep Neural Networks |
title_full_unstemmed | Generating Fake but Realistic Headlines Using Deep Neural Networks |
title_short | Generating Fake but Realistic Headlines Using Deep Neural Networks |
title_sort | generating fake but realistic headlines using deep neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121779/ http://dx.doi.org/10.1007/978-3-319-64471-4_34 |
work_keys_str_mv | AT dandekarashish generatingfakebutrealisticheadlinesusingdeepneuralnetworks AT zenremmyam generatingfakebutrealisticheadlinesusingdeepneuralnetworks AT bressanstephane generatingfakebutrealisticheadlinesusingdeepneuralnetworks |