Back to Search Start Over

A review of infant cry analysis and classification

Authors :
Thosini Bamunu Mudiyanselage
Yutong Gao
Yi Pan
Chunyan Ji
Source :
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2021, Iss 1, Pp 1-17 (2021)
Publication Year :
2021
Publisher :
Springer Science and Business Media LLC, 2021.

Abstract

This paper reviews recent research works in infant cry signal analysis and classification tasks. A broad range of literatures are reviewed mainly from the aspects of data acquisition, cross domain signal processing techniques, and machine learning classification methods. We introduce pre-processing approaches and describe a diversity of features such as MFCC, spectrogram, and fundamental frequency, etc. Both acoustic features and prosodic features extracted from different domains can discriminate frame-based signals from one another and can be used to train machine learning classifiers. Together with traditional machine learning classifiers such as KNN, SVM, and GMM, newly developed neural network architectures such as CNN and RNN are applied in infant cry research. We present some significant experimental results on pathological cry identification, cry reason classification, and cry sound detection with some typical databases. This survey systematically studies the previous research in all relevant areas of infant cry and provides an insight on the current cutting-edge works in infant cry signal analysis and classification. We also propose future research directions in data processing, feature extraction, and neural network classification fields to better understand, interpret, and process infant cry signals.

Details

ISSN :
16874722
Volume :
2021
Database :
OpenAIRE
Journal :
EURASIP Journal on Audio, Speech, and Music Processing
Accession number :
edsair.doi.dedup.....9654436f617c0889f08b720ff902411b
Full Text :
https://doi.org/10.1186/s13636-021-00197-5