201. Enhancing deep learning sentiment analysis with ensemble techniques in social applications
- Author
-
Oscar Araque, Carlos A. Iglesias, J. Fernando Snchez-Rada, and Ignacio Corcuera-Platas
- Subjects
Ensemble forecasting ,Computer science ,business.industry ,Deep learning ,Sentiment analysis ,Feature extraction ,General Engineering ,020206 networking & telecommunications ,02 engineering and technology ,Machine learning ,computer.software_genre ,Ensemble learning ,Computer Science Applications ,Ensembles of classifiers ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Data mining ,Artificial intelligence ,business ,Classifier (UML) ,computer ,Engineering(all) - Abstract
A taxonomy that classifies ensemble models in the literature is presented.Surface and deep features integration is explored to improve classification.Several ensembles of classifiers and features are proposed and evaluated.Performance of the proposed models is evaluated on several sentiment datasets. Deep learning techniques for Sentiment Analysis have become very popular. They provide automatic feature extraction and both richer representation capabilities and better performance than traditional feature based techniques (i.e., surface methods). Traditional surface approaches are based on complex manually extracted features, and this extraction process is a fundamental question in feature driven methods. These long-established approaches can yield strong baselines, and their predictive capabilities can be used in conjunction with the arising deep learning methods. In this paper we seek to improve the performance of deep learning techniques integrating them with traditional surface approaches based on manually extracted features. The contributions of this paper are sixfold. First, we develop a deep learning based sentiment classifier using a word embeddings model and a linear machine learning algorithm. This classifier serves as a baseline to compare to subsequent results. Second, we propose two ensemble techniques which aggregate our baseline classifier with other surface classifiers widely used in Sentiment Analysis. Third, we also propose two models for combining both surface and deep features to merge information from several sources. Fourth, we introduce a taxonomy for classifying the different models found in the literature, as well as the ones we propose. Fifth, we conduct several experiments to compare the performance of these models with the deep learning baseline. For this, we use seven public datasets that were extracted from the microblogging and movie reviews domain. Finally, as a result, a statistical study confirms that the performance of these proposed models surpasses that of our original baseline on F1-Score.
- Published
- 2017
- Full Text
- View/download PDF