Back to Search
Start Over
A review of infant cry analysis and classification
- Source :
- EURASIP Journal on Audio, Speech, and Music Processing, Vol 2021, Iss 1, Pp 1-17 (2021)
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- This paper reviews recent research works in infant cry signal analysis and classification tasks. A broad range of literatures are reviewed mainly from the aspects of data acquisition, cross domain signal processing techniques, and machine learning classification methods. We introduce pre-processing approaches and describe a diversity of features such as MFCC, spectrogram, and fundamental frequency, etc. Both acoustic features and prosodic features extracted from different domains can discriminate frame-based signals from one another and can be used to train machine learning classifiers. Together with traditional machine learning classifiers such as KNN, SVM, and GMM, newly developed neural network architectures such as CNN and RNN are applied in infant cry research. We present some significant experimental results on pathological cry identification, cry reason classification, and cry sound detection with some typical databases. This survey systematically studies the previous research in all relevant areas of infant cry and provides an insight on the current cutting-edge works in infant cry signal analysis and classification. We also propose future research directions in data processing, feature extraction, and neural network classification fields to better understand, interpret, and process infant cry signals.
- Subjects :
- Infant cry detection
020205 medical informatics
Acoustics and Ultrasonics
Computer science
Pathological cry
Speech recognition
Feature extraction
lcsh:QC221-246
02 engineering and technology
lcsh:QA75.5-76.95
Machine learning
0202 electrical engineering, electronic engineering, information engineering
Electrical and Electronic Engineering
Signal processing
Artificial neural network
Infant cry
Support vector machine
Identification (information)
Statistical classification
ComputingMethodologies_PATTERNRECOGNITION
lcsh:Acoustics. Sound
Infant cry classification
Spectrogram
020201 artificial intelligence & image processing
lcsh:Electronic computers. Computer science
Mel-frequency cepstrum
Subjects
Details
- ISSN :
- 16874722
- Volume :
- 2021
- Database :
- OpenAIRE
- Journal :
- EURASIP Journal on Audio, Speech, and Music Processing
- Accession number :
- edsair.doi.dedup.....9654436f617c0889f08b720ff902411b
- Full Text :
- https://doi.org/10.1186/s13636-021-00197-5