14 results on '"Gabrielli, Leonardo"'
Search Results
2. An advanced multimodal driver-assistance prototype for emergency-vehicle detection.
- Author
-
Gabrielli, Leonardo, Migliorelli, Lucia, Cantarini, Michela, Mancini, Adriano, and Squartini, Stefano
- Subjects
- *
DRIVER assistance systems , *EMERGENCY vehicles , *TRAFFIC safety , *VIDEO monitors , *SIGNAL processing - Abstract
In the automotive industry, intelligent monitoring systems for advanced human-vehicle interaction aimed at enhancing the safety of drivers and passengers represent a rapidly growing area of research. Safe driving behavior relies on the driver's awareness of the road context, enabling them to make appropriate decisions and act consistently in anomalous circumstances. A potentially dangerous situation can arise when an emergency vehicle rapidly approaches with sirens blaring. In such cases, it is crucial for the driver to perform the correct maneuvers to prioritize the emergency vehicle. For this purpose, an Advanced Driver Assistance System (ADAS) can provide timely alerts to the driver about an approaching emergency vehicle. In this work, we present a driver-assistance prototype that leverages multimodal information from an integrated audio and video monitoring system. In the initial stage, sound analysis technologies based on computational audio processing are employed to recognize the proximity of an emergency vehicle based on the sound of its siren. When such an event occurs, an in-vehicle monitoring system is activated, analyzing the driver's facial patterns using deep-learning-based algorithms to assess their awareness. This work illustrates the design of such a prototype, presenting the hardware technologies, the software architecture, and the deep-learning algorithms for audio and video data analysis that make the driver-assistance prototype operational in a commercial car. At this initial experimental stage, the algorithms for analyzing the audio and video data have yielded promising results. The area under the precision-recall curve for siren identification stands at 0.92, while the accuracy in evaluating driver gaze orientation reaches 0.97. In conclusion, engaging in research within this field has the potential to significantly improve road safety by increasing driver awareness and facilitating timely and well-informed reactions to crucial situations. This could substantially reduce risks and ultimately protect lives on the road. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Evolutionary tuning of filters coefficients for binaural audio equalization
- Author
-
Pepe, Giovanni, Gabrielli, Leonardo, Squartini, Stefano, and Cattani, Luca
- Published
- 2020
- Full Text
- View/download PDF
4. Once Again
- Author
-
Gabrielli, Leonardo
- Published
- 2017
5. A digital waveguide-based approach for Clavinet modeling and synthesis
- Author
-
Gabrielli, Leonardo, Välimäki, Vesa, Penttinen, Henri, Squartini, Stefano, and Bilbao, Stefan
- Published
- 2013
- Full Text
- View/download PDF
6. Adaptive Linear Prediction Filtering in DWT Domain for Real-Time Musical Onset Detection
- Author
-
Gabrielli, Leonardo, Piazza, Francesco, and Squartini, Stefano
- Published
- 2011
- Full Text
- View/download PDF
7. Few-Shot Emergency Siren Detection.
- Author
-
Cantarini, Michela, Gabrielli, Leonardo, and Squartini, Stefano
- Subjects
- *
AMBULANCES , *EMERGENCY vehicles , *SENSOR placement , *SUPERVISED learning , *DEEP learning , *NOISE control , *SHOT peening , *WORKFLOW - Abstract
It is a well-established practice to build a robust system for sound event detection by training supervised deep learning models on large datasets, but audio data collection and labeling are often challenging and require large amounts of effort. This paper proposes a workflow based on few-shot metric learning for emergency siren detection performed in steps: prototypical networks are trained on publicly available sources or synthetic data in multiple combinations, and at inference time, the best knowledge learned in associating a sound with its class representation is transferred to identify ambulance sirens, given only a few instances for the prototype computation. Performance is evaluated on siren recordings acquired by sensors inside and outside the cabin of an equipped car, investigating the contribution of filtering techniques for background noise reduction. The results show the effectiveness of the proposed approach, achieving AUPRC scores equal to 0.86 and 0.91 in unfiltered and filtered conditions, respectively, outperforming a convolutional baseline model with and without fine-tuning for domain adaptation. Extensive experiments conducted on several recording sensor placements prove that few-shot learning is a reliable technique even in real-world scenarios and gives valuable insights for developing an in-car emergency vehicle detection system. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. Special Issue on Deep Learning for Applications in Acoustics: Modeling, Synthesis, and Listening.
- Author
-
Gabrielli, Leonardo, Fazekas, György, and Nam, Juhan
- Subjects
ACOUSTICS ,GENERATIVE adversarial networks ,ARTIFICIAL neural networks ,DEEP learning ,COMPUTER vision ,MACHINE learning - Abstract
In the fields of audio analysis, processing and acoustic modelling, Deep Learning has been swiftly adopted, initially borrowing their methods from the image processing and computer vision field, and then finding creative and innovative solutions to suit domain-specific needs of acoustic research. In Navarro-Caceres et al. [[8]] the authors propose an assistive music composition system that is able to satisfy multiple criteria and find multiple candidates in chord sequence generation using an Artificial Immune System, a biologically inspired AI technique based on Genetic Programming for multi-objective optimisation. The generated chord progressions fit a desired tonal tension profile, taking for instance consonance or the melodic attraction of successive chords into account, thus aiding the composition process. [Extracted from the article]
- Published
- 2021
- Full Text
- View/download PDF
9. The Rhodes electric piano: Analysis and simulation of the inharmonic overtones.
- Author
-
Gabrielli, Leonardo, Cantarini, Michela, Castellini, Paolo, and Squartini, Stefano
- Subjects
- *
KEYBOARD instruments , *PIANO , *TUNING forks , *DATA mining , *INTERMODULATION - Abstract
The Rhodes piano is an electromechanical keyboard instrument, released for the first time in 1946 and subsequently manufactured for at least four decades, reaching an iconic status and being now generally referred to as the electric piano. A few academic works discuss its operating principle and propose different physical modeling strategies; however, the inharmonic modes that characterize the attack transient have not been subject of a dedicated study before. This study addresses this topic by first observing the spectrum at the pickup output, applying a psychoacoustic model to assess perceptual relevance, and then conducts a series of scanning laser Doppler vibrometry (SLDV) experiments on the Rhodes asymmetric tuning fork. This study compares the modes of the Rhodes piano to those of its individual parts, allowing for the extraction of important information regarding role and natural modes. On the basis of this study, numerical experiments are conducted that show the intermodulation of the modes due to the magnetic pickup and allow the tones produced by the Rhodes from the collected data to be closely matched. Finally, this study is able to extract the distribution of the most important modes found on the whole keyboard range of a Rhodes piano, which can be useful for sound synthesis. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
10. Designing Audio Equalization Filters by Deep Neural Networks.
- Author
-
Pepe, Giovanni, Gabrielli, Leonardo, Squartini, Stefano, and Cattani, Luca
- Subjects
ACOUSTIC filters ,ARTIFICIAL neural networks ,IMPULSE response ,DEEP learning ,EUCLIDEAN distance ,LOUDSPEAKERS - Abstract
Audio equalization is an active research topic aiming at improving the audio quality of a loudspeaker system by correcting the overall frequency response using linear filters. The estimation of their coefficients is not an easy task, especially in binaural and multipoint scenarios, due to the contribution of multiple impulse responses to each listening point. This paper presents a deep learning approach for tuning filter coefficients employing three different neural networks architectures—the Multilayer Perceptron, the Convolutional Neural Network, and the Convolutional Autoencoder. Suitable loss functions are proposed for each architecture, and are formulated in terms of spectral Euclidean distance. The experiments were conducted in the automotive scenario, considering several loudspeakers and microphones. The obtained results show that deep learning techniques give superior performance compared to baseline methods, achieving almost flat magnitude frequency response. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
11. Polyphonic Sound Event Detection by Using Capsule Neural Networks.
- Author
-
Vesperini, Fabio, Gabrielli, Leonardo, Principi, Emanuele, and Squartini, Stefano
- Abstract
Artificial sound event detection (SED) aims to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, deep learning offers valuable techniques for this goal, such as convolutional neural networks (CNNs). The capsule neural network (CapsNet) architecture has been recently introduced in the image processing field with the intent to overcome some of the known limitations of CNNs, specifically regarding the scarce robustness to affine transformations (i.e., perspective, size, and orientation) and the detection of overlapped images. This motivated the authors to employ CapsNets to deal with the polyphonic SED task, in which multiple sound events occur simultaneously. Specifically, we propose to exploit the capsule units to represent a set of distinctive properties for each individual sound event. Capsule units are connected through a so-called dynamic routing that encourages learning part-whole relationships and improves the detection performance in a polyphonic context. This paper reports extensive evaluations carried out on three publicly available datasets, showing how the CapsNet-based algorithm not only outperforms standard CNNs but also achieves the best results with respect to the state-of-the-art algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
12. A Statistical Framework for Automatic Leakage Detection in Smart Water and Gas Grids.
- Author
-
Fagiani, Marco, Squartini, Stefano, Gabrielli, Leonardo, Severini, Marco, and Piazza, Francesco
- Subjects
LEAK detection ,WATER leakage ,GAS leakage ,COMPUTATIONAL intelligence ,GRID computing - Abstract
In the last few years, due to the technological improvement of advanced metering infrastructures, water and natural gas grids can be regarded as smart-grids, similarly to power ones. However, considering the number of studies related to the application of computational intelligence to distribution grids, the gap between power grids and water/gas grids is notably wide. For this purpose, in this paper, a framework for leakage identification is presented. The framework is composed of three sections aimed at the extraction and the selection of features and at the detection of leakages. A variation of the Sequential Feature Selection (SFS) algorithm is used to select the best performing features within a set, including, also, innovative temporal ones. The leakage identification is based on novelty detection and exploits the characterization of a normality model. Three statistical approaches, The Gaussian Mixture Model (GMM), Hidden Markov Model (HMM) and One-Class Support Vector Machine (OC-SVM), are adopted, under a comparative perspective. Both residential and office building environments are investigated by means of two datasets. One is the Almanac of Minutely Power dataset (AMPds), and it provides water and gas data consumption at 1, 10 and 30 min of time resolution; the other is the Department of International Development (DFID) dataset, and it provides water and gas data consumption at 30 min of time resolution. The achieved performance, computed by means of the Area Under the Curve (AUC), reaches 90% in the office building case study, thus confirming the suitability of the proposed approach for applications in smart water and gas grids. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
13. A Real-Time Dual-Channel Speech Reinforcement System for Intra-Cabin Communication.
- Author
-
FACCENDA, FRANCESCO, SQUARTINI, STEFANO, PRINCIPI, EMANUELE, GABRIELLI, LEONARDO, and PIAZZA, FRANCESCO
- Subjects
REAL-time computing ,PASSENGERS ,MICROPHONES ,ELECTRONIC amplifiers ,LOUDSPEAKERS ,FEEDBACK control systems ,ORAL communication - Abstract
In order to facilitate the communication experience in a cabin (as among passengers located at distant seats in big vehicles), suitable systems involving the presence of microphones, amplifiers, and loudspeakers are needed. However when the sound is acquired and reproduced within the same acoustic environment, instability problems can arise due to acoustic feedback. The main objective of Speech Reinforcement techniques consists in reducing the occurrence of feedback effects, thus allowing for a comfortable intracabin communication. In this work we focus on the PEM-AFROW algorithm, an effective technique for acoustic feedback control, recently appearing in the literature. Moreover a suppressor filter is included within the feedback loop in order to improve the overall system stability and increase its inherent robustness to higher gain values. The overall algorithmic scheme is able to deal with the Dual-Channel communication case study, i.e., in the presence of two communicating speakers and with the speech information flowing in both directions. This means that, in contrast to the Single-Channel case study, echo paths, introduced by the double microphone and loudspeaker, must be considered. In order to keep the latencies low and allow a real-time processing, the partitioned block frequency domain adaptive filter (PB-FDAF) algorithm has been adopted. Voice Activity and Double Talk Detectors have been also included into the algorithmic framework. Performed computer simulations in various acoustic conditions have shown the effectiveness of the approach. [ABSTRACT FROM AUTHOR]
- Published
- 2013
14. A3CarScene: An audio-visual dataset for driving scene understanding.
- Author
-
Cantarini M, Gabrielli L, Mancini A, Squartini S, and Longo R
- Abstract
Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene , a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled ( motorway, trunk, primary, secondar y, tertiary, residential , and service roads), the degree of urbanization of the area ( city, town, suburban area, village, exurban and rural areas ), the weather conditions ( clear, cloudy, overcast , and rainy ), the level of lighting ( daytime, evening, night , and tunnel ), the type ( asphalt or cobblestones ) and moisture status ( dry or wet ) of the road pavement, and the state of the windows ( open or closed ). This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving., Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (© 2023 The Authors.)
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.