Back to Search
Start Over
EventDNA: a dataset for Dutch news event extraction as a basis for news diversification.
- Source :
- Language Resources & Evaluation; Mar2023, Vol. 57 Issue 1, p189-221, 33p
- Publication Year :
- 2023
-
Abstract
- News organizations increasingly tailor their news offering to the reader through personalized recommendation algorithms. However, automated recommendation algorithms reflect a commercial logic based on calculated relevance to the user, rather than aiming at a well-informed citizenry. In this paper, we introduce the EventDNA corpus, a dataset of 1773 Dutch-language news articles annotated with information on entities, news events and IPTC Media Topic codes, with the ultimate goal to outline a recommendation algorithm that uses news event diversity rather than previous reading behaviour as a key driver for personalized news recommendation. We describe the EventDNA annotation guidelines, which are inspired by the well-known ERE framework and conclude that it is not practical to apply a fixed event typology such as used in ERE to an unrestricted data context. The corpus and related source code is made available at https://github.com/NewsDNA-LT3/.github. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 1574020X
- Volume :
- 57
- Issue :
- 1
- Database :
- Complementary Index
- Journal :
- Language Resources & Evaluation
- Publication Type :
- Academic Journal
- Accession number :
- 162506769
- Full Text :
- https://doi.org/10.1007/s10579-022-09623-2