Back to Search Start Over

Zero-Shot Learning for Cross-Lingual News Sentiment Classification.

Authors :
Pelicon, Andraž
Pranjić, Marko
Miljković, Dragana
Škrlj, Blaž
Pollak, Senja
Source :
Applied Sciences (2076-3417); Sep2020, Vol. 10 Issue 17, p5993, 21p
Publication Year :
2020

Abstract

In this paper, we address the task of zero-shot cross-lingual news sentiment classification. Given the annotated dataset of positive, neutral, and negative news in Slovene, the aim is to develop a news classification system that assigns the sentiment category not only to Slovene news, but to news in another language without any training data required. Our system is based on the multilingual BERTmodel, while we test different approaches for handling long documents and propose a novel technique for sentiment enrichment of the BERT model as an intermediate training step. With the proposed approach, we achieve state-of-the-art performance on the sentiment analysis task on Slovenian news. We evaluate the zero-shot cross-lingual capabilities of our system on a novel news sentiment test set in Croatian. The results show that the cross-lingual approach also largely outperforms the majority classifier, as well as all settings without sentiment enrichment in pre-training. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20763417
Volume :
10
Issue :
17
Database :
Complementary Index
Journal :
Applied Sciences (2076-3417)
Publication Type :
Academic Journal
Accession number :
145988861
Full Text :
https://doi.org/10.3390/app10175993