Back to Search
Start Over
One Model is Not Enough: Ensembles for Isolated Sign Language Recognition
- Source :
- Sensors, Vol 22, Iss 13, p 5043 (2022)
- Publication Year :
- 2022
- Publisher :
- MDPI AG, 2022.
-
Abstract
- In this paper, we dive into sign language recognition, focusing on the recognition of isolated signs. The task is defined as a classification problem, where a sequence of frames (i.e., images) is recognized as one of the given sign language glosses. We analyze two appearance-based approaches, I3D and TimeSformer, and one pose-based approach, SPOTER. The appearance-based approaches are trained on a few different data modalities, whereas the performance of SPOTER is evaluated on different types of preprocessing. All the methods are tested on two publicly available datasets: AUTSL and WLASL300. We experiment with ensemble techniques to achieve new state-of-the-art results of 73.84% accuracy on the WLASL300 dataset by using the CMA-ES optimization method to find the best ensemble weight parameters. Furthermore, we present an ensembling technique based on the Transformer model, which we call Neural Ensembler.
- Subjects :
- sign language recognition
CNN
Transformer
ensemble
Chemical technology
TP1-1185
Subjects
Details
- Language :
- English
- ISSN :
- 14248220
- Volume :
- 22
- Issue :
- 13
- Database :
- Directory of Open Access Journals
- Journal :
- Sensors
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.7b78a784ec7a4db7ab39ca6e958d2272
- Document Type :
- article
- Full Text :
- https://doi.org/10.3390/s22135043