Back to Search Start Over

Deep active learning with simulated rationales for text classification

Authors :
Paul Guélorget
Bruno Grilheres
Titus Zaharia
Institut Polytechnique de Paris (IP Paris)
Airbus Defence and Space
Airbus Group
Département Advanced Research And Techniques For Multidimensional Imaging Systems (ARTEMIS)
Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP)
ARMEDIA (ARMEDIA-SAMOVAR)
Services répartis, Architectures, MOdélisation, Validation, Administration des Réseaux (SAMOVAR)
Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP)-Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP)
Source :
Lecture Notes in Computer Science, vol 12068. Springer, Pattern Recognition and Artificial Intelligence: international conference, ICPRAI 2020, Zhongshan, China, October 19–23, 2020, proceedings, ICPRAI 2020: 2nd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2020: 2nd International Conference on Pattern Recognition and Artificial Intelligence:, Oct 2020, Zhongshan (online), China. pp.363-379, ⟨10.1007/978-3-030-59830-3_32⟩, Pattern Recognition and Artificial Intelligence ISBN: 9783030598297, ICPRAI
Publication Year :
2020
Publisher :
HAL CCSD, 2020.

Abstract

International audience; Neural networks have become a preferred tool for text classification tasks, demonstrating state of the art performances when trained on a large set of labeled data. However, in an early active learning setup, the scarcity of the ground-truth labels available severely penalizes the generalization capability of the neural network. In order to overcome such limitations, in this paper, we introduce a new learning strategy, which consist of inserting in the early stages of the learning process some additional, local and salient knowledge, presented under the form of simulated, human like rationales. We show how such knowledge can be automatically extracted from documents by analyzing the class activation maps of a convolutional neural network. The experimental results obtained demonstrate that the exploitation of such rationales permits to significantly speed-up the learning process, with a spectacular increase of the accuracy rates, starting from a very reduced number of documents (10–20).

Details

Language :
English
ISBN :
978-3-030-59829-7
ISBNs :
9783030598297
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science, vol 12068. Springer, Pattern Recognition and Artificial Intelligence: international conference, ICPRAI 2020, Zhongshan, China, October 19–23, 2020, proceedings, ICPRAI 2020: 2nd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2020: 2nd International Conference on Pattern Recognition and Artificial Intelligence:, Oct 2020, Zhongshan (online), China. pp.363-379, ⟨10.1007/978-3-030-59830-3_32⟩, Pattern Recognition and Artificial Intelligence ISBN: 9783030598297, ICPRAI
Accession number :
edsair.doi.dedup.....c7b9d9cbf97cbc03840bfcd361929c5c
Full Text :
https://doi.org/10.1007/978-3-030-59830-3_32⟩