Back to Search Start Over

On the Fragility of Active Learners

Authors :
Ghose, Abhishek
Nguyen, Emma Thuong
Ghose, Abhishek
Nguyen, Emma Thuong
Publication Year :
2024

Abstract

Active learning (AL) techniques aim to maximally utilize a labeling budget by iteratively selecting instances that are most likely to improve prediction accuracy. However, their benefit compared to random sampling has not been consistent across various setups, e.g., different datasets, classifiers. In this empirical study, we examine how a combination of different factors might obscure any gains from an AL technique. Focusing on text classification, we rigorously evaluate AL techniques over around 1000 experiments that vary wrt the dataset, batch size, text representation and the classifier. We show that AL is only effective in a narrow set of circumstances. We also address the problem of using metrics that are better aligned with real world expectations. The impact of this study is in its insights for a practitioner: (a) the choice of text representation and classifier is as important as that of an AL technique, (b) choice of the right metric is critical in assessment of the latter, and, finally, (c) reported AL results must be holistically interpreted, accounting for variables other than just the query strategy.

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1438539715
Document Type :
Electronic Resource