Back to Search
Start Over
A model of auditory perception as front end for automatic speech recognition
- Source :
- The Journal of the Acoustical Society of America. 106:2040-2050
- Publication Year :
- 1999
- Publisher :
- Acoustical Society of America (ASA), 1999.
-
Abstract
- A front end for automatic speech recognizers is proposed and evaluated which is based on a quantitative model of the "effective" peripheral auditory processing. The model simulates both spectral and temporal properties of sound processing in the auditory system which were found in psychoacoustical and physiological experiments. The robustness of the auditory-based representation of speech was evaluated in speaker-independent, isolated word recognition experiments in different types of additive noise. The results show a higher robustness of the auditory front end in noise, compared to common mel-scale cepstral feature extraction. In a second set of experiments, different processing stages of the auditory front end were modified to study their contribution to robust speech signal representation in detail. The adaptive compression stage which enhances temporal changes of the input signal appeared to be the most important processing stage towards robust speech representation in noise. Low-pass filtering of the fast fluctuating envelope in each frequency band further reduces the influence of noise in the auditory-based representation of speech.
- Subjects :
- Auditory perception
Time Factors
Auditory scene analysis
Acoustics and Ultrasonics
Computer science
Speech recognition
Models, Psychological
computer.software_genre
Models, Biological
Arts and Humanities (miscellaneous)
Cepstrum
medicine
Humans
Auditory system
Audio signal processing
Voice activity detection
Automatism
Filter (signal processing)
Speech processing
medicine.anatomical_structure
Computational auditory scene analysis
Word recognition
Auditory Perception
Speech Perception
Neurocomputational speech processing
Noise
computer
Subjects
Details
- ISSN :
- 00014966
- Volume :
- 106
- Database :
- OpenAIRE
- Journal :
- The Journal of the Acoustical Society of America
- Accession number :
- edsair.doi.dedup.....b014ce0063e4f968570099a350acf0c7