151. User Profiling based on Tweeter Data using WordNet and News Paper Archive
- Author
-
Alok Ranjan Pal and Antara Pal
- Subjects
Statement (computer science) ,Computer science ,business.industry ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,WordNet ,computer.software_genre ,Semantics ,Newspaper ,Semantic similarity ,Similarity (psychology) ,Profiling (information science) ,Artificial intelligence ,business ,computer ,Natural language ,Natural language processing - Abstract
In this paper, a method has been proposed for user profiling based on tweeter data. The sentiments of the tweets are retrieved programmatically with the help of WordNet and News Paper Archive. In this experiment, the English WordNet 2.1 has been used as an online semantic dictionary and machine readable version of the "Times of India" news paper has been used to generate a news paper archive. The algorithm is tested on a data set of 1000 tweets from four different categories which are initially tagged by their innate senses for validation of the derived result.First of all, the data set is evaluated with the help of newspaper archive by using lexical overlap and the accuracy in sense retrieval task is 48.7%. The reason behind this scenario is the varieties of representations of a single statement in natural language which creates a mare similarity between the lexical entities of the statements. To overcome this problem, the contexts of the tweets are expanded with the help of WordNet by considering the synonyms of every meaningful word of the tweets and after that the senses of these tweets are evaluated. As the contexts of the statements are expanded in this approach, semantic relatedness between the statements is resolved in an efficient way which leads the system towards a better performance.
- Published
- 2021