Back to Search Start Over

I beg to differ: how disagreement is handled in the annotation of legal machine learning data sets.

Authors :
Braun, Daniel
Source :
Artificial Intelligence & Law; Sep2024, Vol. 32 Issue 3, p839-862, 24p
Publication Year :
2024

Abstract

Legal documents, like contracts or laws, are subject to interpretation. Different people can have different interpretations of the very same document. Large parts of judicial branches all over the world are concerned with settling disagreements that arise, in part, from these different interpretations. In this context, it only seems natural that during the annotation of legal machine learning data sets, disagreement, how to report it, and how to handle it should play an important role. This article presents an analysis of the current state-of-the-art in the annotation of legal machine learning data sets. The results of the analysis show that all of the analysed data sets remove all traces of disagreement, instead of trying to utilise the information that might be contained in conflicting annotations. Additionally, the publications introducing the data sets often do provide little information about the process that derives the "gold standard" from the initial annotations, often making it difficult to judge the reliability of the annotation process. Based on the state-of-the-art, the article provides easily implementable suggestions on how to improve the handling and reporting of disagreement in the annotation of legal machine learning data sets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09248463
Volume :
32
Issue :
3
Database :
Complementary Index
Journal :
Artificial Intelligence & Law
Publication Type :
Academic Journal
Accession number :
178778434
Full Text :
https://doi.org/10.1007/s10506-023-09369-4