1. A Morphological Model for Simulating Acoustic Scenes and Its Application to Sound Event Detection
- Author
-
Axel Roebel, Grégoire Lafay, Mathieu Lagrange, Mathias Rossignol, Emmanouil Benetos, ADTSI, Institut de Recherche en Communications et en Cybernétique de Nantes (IRCCyN), Mines Nantes (Mines Nantes)-École Centrale de Nantes (ECN)-Ecole Polytechnique de l'Université de Nantes (EPUN), Université de Nantes (UN)-Université de Nantes (UN)-PRES Université Nantes Angers Le Mans (UNAM)-Centre National de la Recherche Scientifique (CNRS)-Mines Nantes (Mines Nantes)-École Centrale de Nantes (ECN)-Ecole Polytechnique de l'Université de Nantes (EPUN), Université de Nantes (UN)-Université de Nantes (UN)-PRES Université Nantes Angers Le Mans (UNAM)-Centre National de la Recherche Scientifique (CNRS), Analyse et synthèse sonores [Paris], Sciences et Technologies de la Musique et du Son (STMS), Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS), Queen Mary University of London (QMUL), and ANR-11-JS03-0005,Houle,Apprentissage de structures hiérarchiques pour l'analyse computationnelle de scènes auditives(2011)
- Subjects
Auditory scene analysis ,Acoustics and Ultrasonics ,auditory scene analysis ,Computer science ,Speech recognition ,Morphological model ,acoustic event detection ,02 engineering and technology ,030507 speech-language pathology & audiology ,03 medical and health sciences ,[STAT.ML]Statistics [stat]/Machine Learning [stat.ML] ,experimental validation ,Robustness (computer science) ,0202 electrical engineering, electronic engineering, information engineering ,Computer Science (miscellaneous) ,Computer vision ,Electrical and Electronic Engineering ,business.industry ,Acoustic model ,Speech processing ,Sound event detection ,Sound recording and reproduction ,Computational Mathematics ,Computational auditory scene analysis ,020201 artificial intelligence & image processing ,Artificial intelligence ,0305 other medical science ,business ,[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing - Abstract
International audience; This paper introduces a model of environmental acoustic scenes which adopts a morphological approach by ab-stracting temporal structures of acoustic scenes. To demonstrate its potential, this model is employed to evaluate the performance of a large set of acoustic events detection systems. This model allows us to explicitly control key morphological aspects of the acoustic scene and isolate their impact on the performance of the system under evaluation. Thus, more information can be gained on the behavior of evaluated systems, providing guidance for further improvements. The proposed model is validated using submitted systems from the IEEE DCASE Challenge; results indicate that the proposed scheme is able to successfully build datasets useful for evaluating some aspects the performance of event detection systems, more particularly their robustness to new listening conditions and the increasing level of background sounds.
- Published
- 2016
- Full Text
- View/download PDF