Back to Search
Start Over
Orthographic features for emotion classification in Chinese in informal short texts
- Source :
- Language Resources and Evaluation. 55:329-352
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- Informal short texts on the web are rich in emotions as they often reflect unfiltered immediate reactions to breaking news events. The emotion density, however, stands in contrast to its poverty of linguistic contexts and features for emotion classification. This paper tackles that challenge by proposing orthographic features based on orthographic code mixing and code-switching for both non-ML and ML approaches. Our results show that orthographic features routinely outperform grammatical features for emotion classification for short texts in all approaches as expected. Orthographic features were also shown to make more significant contributions, especially in terms of precision and in formal texts when state of the art deep learning algorithms are applied. This result confirms the effectiveness of the orthographic change feature to the task of emotion classification. These results are argued to be applicable to all languages because of the common code-shifting in languages with non-Latin orthographies, and the use of non-letter symbols in all languages.
- Subjects :
- 050101 languages & linguistics
Linguistics and Language
Computer science
Emotion classification
02 engineering and technology
Library and Information Sciences
computer.software_genre
Language and Linguistics
Education
Code-mixing
Task (project management)
0202 electrical engineering, electronic engineering, information engineering
Feature (machine learning)
0501 psychology and cognitive sciences
business.industry
Deep learning
05 social sciences
Orthographic projection
Contrast (statistics)
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
020201 artificial intelligence & image processing
Artificial intelligence
Computational linguistics
business
computer
Natural language processing
Subjects
Details
- ISSN :
- 15740218 and 1574020X
- Volume :
- 55
- Database :
- OpenAIRE
- Journal :
- Language Resources and Evaluation
- Accession number :
- edsair.doi...........64d15e59bf1e4a5e5d2ddd763b1ebc4b