1. A ROBUST ENSEMBLE MODEL FOR SPOKEN LANGUAGE RECOGNITION
- Author
-
Nancy WOODS and Gideon BABATUNDE
- Subjects
spoken language recognition ,computer vision ,image recognition ,cnn ,Information technology ,T58.5-58.64 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
The identity of a language being spoken has been tackled over the years via statistical models on audio samples. A drawback of these approaches is the unavailability of phonetically transcribed data for all languages. This work proposes an approach based on image classification that utilized image representations of audio samples. Our model used Neural Networks and deep learning algorithms to analyse and classify three languages. The input to our network is a Spectrogram that was processed through the networks to extract local visual and temporal features for language prediction. From the model, we achieved 95.56 % accuracy on the test samples from the 3 languages.
- Published
- 2020
- Full Text
- View/download PDF