Cargando…

Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts

We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data prep...

Descripción completa

Detalles Bibliográficos
Autores principales: Gómez-Adorno, Helena, Markov, Ilia, Sidorov, Grigori, Posadas-Durán, Juan-Pablo, Sanchez-Perez, Miguel A., Chanona-Hernandez, Liliana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066026/
https://www.ncbi.nlm.nih.gov/pubmed/27795703
http://dx.doi.org/10.1155/2016/1638936
_version_ 1782460411575009280
author Gómez-Adorno, Helena
Markov, Ilia
Sidorov, Grigori
Posadas-Durán, Juan-Pablo
Sanchez-Perez, Miguel A.
Chanona-Hernandez, Liliana
author_facet Gómez-Adorno, Helena
Markov, Ilia
Sidorov, Grigori
Posadas-Durán, Juan-Pablo
Sanchez-Perez, Miguel A.
Chanona-Hernandez, Liliana
author_sort Gómez-Adorno, Helena
collection PubMed
description We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.
format Online
Article
Text
id pubmed-5066026
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-50660262016-10-30 Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts Gómez-Adorno, Helena Markov, Ilia Sidorov, Grigori Posadas-Durán, Juan-Pablo Sanchez-Perez, Miguel A. Chanona-Hernandez, Liliana Comput Intell Neurosci Research Article We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available. Hindawi Publishing Corporation 2016 2016-10-03 /pmc/articles/PMC5066026/ /pubmed/27795703 http://dx.doi.org/10.1155/2016/1638936 Text en Copyright © 2016 Helena Gómez-Adorno et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Gómez-Adorno, Helena
Markov, Ilia
Sidorov, Grigori
Posadas-Durán, Juan-Pablo
Sanchez-Perez, Miguel A.
Chanona-Hernandez, Liliana
Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
title Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
title_full Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
title_fullStr Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
title_full_unstemmed Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
title_short Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
title_sort improving feature representation based on a neural network for author profiling in social media texts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066026/
https://www.ncbi.nlm.nih.gov/pubmed/27795703
http://dx.doi.org/10.1155/2016/1638936
work_keys_str_mv AT gomezadornohelena improvingfeaturerepresentationbasedonaneuralnetworkforauthorprofilinginsocialmediatexts
AT markovilia improvingfeaturerepresentationbasedonaneuralnetworkforauthorprofilinginsocialmediatexts
AT sidorovgrigori improvingfeaturerepresentationbasedonaneuralnetworkforauthorprofilinginsocialmediatexts
AT posadasduranjuanpablo improvingfeaturerepresentationbasedonaneuralnetworkforauthorprofilinginsocialmediatexts
AT sanchezperezmiguela improvingfeaturerepresentationbasedonaneuralnetworkforauthorprofilinginsocialmediatexts
AT chanonahernandezliliana improvingfeaturerepresentationbasedonaneuralnetworkforauthorprofilinginsocialmediatexts