Back to Search
Start Over
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
- Source :
- IEEE Access, Vol 8, Pp 78630-78638 (2020)
- Publication Year :
- 2020
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2020.
-
Abstract
- With the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio classification becomes more important in audio analysis as it prepares for content-based audio retrieval. There is quite a bit of research on the topic of audio classification methods, audio feature analysis, and extraction based on audio classification. Many works of literature extract features of audio signals based on time or Fourier transform frequency domain. The emergence of the wavelet theory provides a time-frequency analysis tool for signal analysis. Wavelet transformation is a local transformation of the signal in time and frequency which can effectively extract information from the signal, and perform multi-scale refinement analysis on functions or signals through operations such as stretching and translation instead of the traditional Fourier transformation. In the time-frequency analysis of the signal, the wavelet analysis captures the local time and frequency characters of the signal which can improve the ability of signal analysis. It can also change certain locals of the signal without affecting other aspects of it. In this paper, the frequency domain features are combined with the wavelet domain features. At the same time that the MFCC features are extracted, the discrete wavelet transform is used to extract the features of the wavelet domain. Then the statistical features are extracted for each audio example, and the SVM model is used to realize the different forms of audio classification identification.
- Subjects :
- Discrete wavelet transform
General Computer Science
Computer science
02 engineering and technology
Signal
03 medical and health sciences
symbols.namesake
0302 clinical medicine
Wavelet
0202 electrical engineering, electronic engineering, information engineering
General Materials Science
030223 otorhinolaryngology
wavelet transform
Signal processing
Audio signal
business.industry
General Engineering
Pattern recognition
audio processing
Fourier transform
Frequency domain
Audio analyzer
symbols
020201 artificial intelligence & image processing
lcsh:Electrical engineering. Electronics. Nuclear engineering
Mel-frequency cepstrum
Artificial intelligence
business
audio feature
lcsh:TK1-9971
Content audio
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....deda9ed9a3ac92d3607b786ef8ab86ca