Back to Search Start Over

NEREL: A Russian Dataset with Nested Named Entities, Relations and Events

Authors :
Loukachevitch, Natalia
Artemova, Ekaterina
Batura, Tatiana
Braslavski, Pavel
Denisov, Ilia
Ivanov, Vladimir
Manandhar, Suresh
Pugachev, Alexander
Tutubalina, Elena
Publication Year :
2021

Abstract

In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.<br />Comment: accepted to RANLP

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2108.13112
Document Type :
Working Paper