Full Text

Turn on search term navigation

Copyright © 2016 Helena Gómez-Adorno et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.

Details

Title
Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
Author
Gómez-Adorno, Helena; Markov, Ilia; Sidorov, Grigori; Juan-Pablo Posadas-Durán; Sanchez-Perez, Miguel A; Chanona-Hernandez, Liliana
Publication year
2016
Publication date
2016
Publisher
John Wiley & Sons, Inc.
ISSN
16875265
e-ISSN
16875273
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1834479368
Copyright
Copyright © 2016 Helena Gómez-Adorno et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.