Back to Search Start Over

Increasing authorship identification through emotional analysis

Authors :
Martins, Ricardo
Almeida, J. J.
Henriques, Pedro Rangel
Novais, Paulo
Universidade do Minho
Publication Year :
2018
Publisher :
Springer Verlag, 2018.

Abstract

Writing style is considered the manner how an author expresses his thoughts, influenced by language characteristics of an individual, period, school, or nation. Most of the times, this writing style can identify the author. Yet, one of the most famous examples comes from 1914 in Portuguese literature, with Fernando Pessoa and his heteronyms Alberto Caeiro, Álvaro de Campos and Ricardo Reis, who had completely different writing styles and led people to believe that they were different individuals. So, the discussion about authorship identification already exists along a century. Currently, there are several alternatives to identify authors of text, however, these solutions do not consider the emotion contained in the text as source of information in the writing style. This paper is about a process to analyse the emotion contained in social media messages as Facebook (http://www.facebook.com) in order to identify the author’s emotional profile and use it to improve the ability to predict the authors of the messages. Using preprocessing techniques, lexicon-based approaches and machine learning, we achieved an authorship identification improvement around 5% in the whole dataset and more than 50% in specific authors, when considering the emotional profile on the writing style.<br />This work has been supported by COMPETE: POCI-01-0145-FEDER-0070 43 and FCT – Fundação para a Ciencia e Tecnologia within the Project Scope UID/CEC/00319/2013.

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.od.......307..d771252fe7d8d9b6994375b8ab13883a