Back to Search Start Over

Investigation of Spoken-Language Detection and Classification in Broadcasted Audio Content

Authors :
Rigas Kotsakis
Maria Matsiola
George Kalliris
Charalampos Dimoulas
Source :
Information, Vol 11, Iss 4, p 211 (2020)
Publication Year :
2020
Publisher :
MDPI AG, 2020.

Abstract

The current paper focuses on the investigation of spoken-language classification in audio broadcasting content. The approach reflects a real-word scenario, encountered in modern media/monitoring organizations, where semi-automated indexing/documentation is deployed, which could be facilitated by the proposed language detection preprocessing. Multilingual audio recordings of specific radio streams are formed into a small dataset, which is used for the adaptive classification experiments, without seeking—at this step—for a generic language recognition model. Specifically, hierarchical discrimination schemes are followed to separate voice signals before classifying the spoken languages. Supervised and unsupervised machine learning is utilized at various windowing configurations to test the validity of our hypothesis. Besides the analysis of the achieved recognition scores (partial and overall), late integration models are proposed for semi-automatically annotation of new audio recordings. Hence, data augmentation mechanisms are offered, aiming at gradually formulating a Generic Audio Language Classification Repository. This database constitutes a program-adaptive collection that, beside the self-indexing metadata mechanisms, could facilitate generic language classification models in the future, through state-of-art techniques like deep learning. This approach matches the investigatory inception of the project, which seeks for indicators that could be applied in a second step with a larger dataset and/or an already pre-trained model, with the purpose to deliver overall results.

Details

Language :
English
ISSN :
11040211 and 20782489
Volume :
11
Issue :
4
Database :
Directory of Open Access Journals
Journal :
Information
Publication Type :
Academic Journal
Accession number :
edsdoj.8d6f82600d124c93b625183ce651665f
Document Type :
article
Full Text :
https://doi.org/10.3390/info11040211