1. Feature Selection for Twitter Classification
- Author
-
Ostrowski David
- Subjects
business.industry ,Computer science ,Feature selection ,Mutual information ,Information theory ,computer.software_genre ,Identification (information) ,Market research ,Statistical classification ,Information extraction ,Preprocessor ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
Twitter-based messages have presented challenges in the identification of features as applied to classification. This paper explores filtering techniques for improved trend detection and information extraction. Starting with a pre-filtered source (Twitter), we will examine the application of both information theory and Natural Language Processing (NLP) based techniques as a means of preprocessing for classification. Results demonstrate that both means allow for improved results in classification among highly idiosyncratic data (Twitter).
- Published
- 2014
- Full Text
- View/download PDF