17 results on '"Automatic age estimation"'
Search Results
2. Facial Age Estimation Using Machine Learning Techniques: An Overview.
- Author
-
ELKarazle, Khaled, Raman, Valliappan, and Then, Patrick
- Subjects
MACHINE learning ,HUMAN-computer interaction ,AGE ,ACCESS control ,ENGINEERING standards - Abstract
Automatic age estimation from facial images is an exciting machine learning topic that has attracted researchers' attention over the past several years. Numerous human–computer interaction applications, such as targeted marketing, content access control, or soft-biometrics systems, employ age estimation models to carry out secondary tasks such as user filtering or identification. Despite the vast array of applications that could benefit from automatic age estimation, building an automatic age estimation system comes with issues such as data disparity, the unique ageing pattern of each individual, and facial photo quality. This paper provides a survey on the standard methods of building automatic age estimation models, the benchmark datasets for building these models, and some of the latest proposed pieces of literature that introduce new age estimation methods. Finally, we present and discuss the standard evaluation metrics used to assess age estimation models. In addition to the survey, we discuss the identified gaps in the reviewed literature and present recommendations for future research. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
3. Facial Age Estimation Using Machine Learning Techniques: An Overview
- Author
-
Khaled ELKarazle, Valliappan Raman, and Patrick Then
- Subjects
automatic age estimation ,age estimation review ,deep learning ,facial recognition ,features extraction ,image processing ,Technology - Abstract
Automatic age estimation from facial images is an exciting machine learning topic that has attracted researchers’ attention over the past several years. Numerous human–computer interaction applications, such as targeted marketing, content access control, or soft-biometrics systems, employ age estimation models to carry out secondary tasks such as user filtering or identification. Despite the vast array of applications that could benefit from automatic age estimation, building an automatic age estimation system comes with issues such as data disparity, the unique ageing pattern of each individual, and facial photo quality. This paper provides a survey on the standard methods of building automatic age estimation models, the benchmark datasets for building these models, and some of the latest proposed pieces of literature that introduce new age estimation methods. Finally, we present and discuss the standard evaluation metrics used to assess age estimation models. In addition to the survey, we discuss the identified gaps in the reviewed literature and present recommendations for future research.
- Published
- 2022
- Full Text
- View/download PDF
4. Towards Accuracy Enhancement of Age Group Classification Using Generative Adversarial Networks.
- Author
-
ELKarazle, Khaled, Raman, Valliappan, and Then, Patrick
- Subjects
- *
GENERATIVE adversarial networks , *PROBABILISTIC generative models , *AGE groups , *CONVOLUTIONAL neural networks , *ELECTRONIC data processing , *MACHINE learning - Abstract
Age estimation models can be employed in many applications, including soft biometrics, content access control, targeted advertising, and many more. However, as some facial images are taken in unrestrained conditions, the quality relegates, which results in the loss of several essential ageing features. This study investigates how introducing a new layer of data processing based on a super-resolution generative adversarial network (SRGAN) model can influence the accuracy of age estimation by enhancing the quality of both the training and testing samples. Additionally, we introduce a novel convolutional neural network (CNN) classifier to distinguish between several age classes. We train one of our classifiers on a reconstructed version of the original dataset and compare its performance with an identical classifier trained on the original version of the same dataset. Our findings reveal that the classifier which trains on the reconstructed dataset produces better classification accuracy, opening the door for more research into building data-centric machine learning systems. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
5. Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks
- Author
-
Ruben Zazo, Phani Sankar Nidadavolu, Nanxin Chen, Joaquin Gonzalez-Rodriguez, and Najim Dehak
- Subjects
Automatic age estimation ,LSTM ,RNN ,DNN ,NIST ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Age estimation from speech has recently received increased interest as it is useful for many applications such as user-profiling, targeted marketing, or personalized call-routing. This kind of applications need to quickly estimate the age of the speaker and might greatly benefit from real-time capabilities. Long short-term memory (LSTM) recurrent neural networks (RNN) have shown to outperform state-of-the-art approaches in related speech-based tasks, such as language identification or voice activity detection, especially when an accurate real-time response is required. In this paper, we propose a novel age estimation system based on LSTM-RNNs. This system is able to deal with short utterances (from 3 to 10 s) and it can be easily deployed in a real-time architecture. The proposed system has been tested and compared with a state-of-the-art i-vector approach using data from NIST speaker recognition evaluation 2008 and 2010 data sets. Experiments on short duration utterances show a relative improvement up to 28% in terms of mean absolute error of this new approach over the baseline system.
- Published
- 2018
- Full Text
- View/download PDF
6. Aging Progression of Elderly People Using Image Morphing
- Author
-
Kumari, L. L. Gayani, Dharmaratne, Anuja T., Kim, Tai-hoon, editor, Adeli, Hojjat, editor, Ramos, Carlos, editor, and Kang, Byeong-Ho, editor
- Published
- 2011
- Full Text
- View/download PDF
7. A multi-agent system for the classification of gender and age from images.
- Author
-
González-Briones, Alfonso, Villarrubia, Gabriel, De Paz, Juan F., and Corchado, Juan M.
- Subjects
IMAGE processing ,RADIANCE ,SMART television devices ,PATTERN recognition systems ,REAL-time computing - Abstract
Highlights • Multi-agent system for the classification of people by their age and gender. • The system integrates different techniques for the acquisition and preprocessing of images. • Evaluation of different techniques (Fisherfaces, Eigenfaces, LBP, ANN) and their combination with filters (Gabor, Sobel). • The system operates in environments with uncontrolled conditions (brightness, contrast, saturation or sharpness). Abstract The automatic classification of human images on the basis of age range and gender can be used in audiovisual content adaptation for Smart TVs or marquee advertising. Knowledge about users is used by publishing agencies and departments regulating TV content; on the basis of this information (age, gender) they are able to provide content that suits the interests of users. To this end, the creation of a highly precise image pattern recognition system is necessary, this may be one of the greatest challenges faced by computer technology in the last decades. These recognition systems must apply different pattern recognition techniques, in order to distinct gender and age in the images. In this work, we propose a multi-agent system that integrates different techniques for the acquisition, preprocessing and processing of images for the classification of age and gender. The system has been tested in an office building. Thanks to the use of a multi-agent system which allows to apply different workflows simultaneously, the performance of different methods could be compared (each flow with a different configuration). Experimental results have confirmed that a good preprocessing stage is necessary if we want the classification methods to perform well (Fisherfaces, Eigenfaces, Local Binary Patterns, Multilayer perceptron). The Fisherfaces method has proved to be more effective than MLP and the training time was shorter. In terms of the classification of age, Fisherfaces offers the best results in comparison to the rest of the system's classifiers. The use of filters has allowed to reduce dimensionality, as a result the workload was reduced, a great advantage in a system that performs classification in real time. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
8. Age estimation from faces using deep learning:a comparative analysis
- Author
-
Othmani, A. (Alice), Taleb, A. R. (Abdul Rahman), Abdelkawy, H. (Hazem), Hadid, A. (Abdenour), Othmani, A. (Alice), Taleb, A. R. (Abdul Rahman), Abdelkawy, H. (Hazem), and Hadid, A. (Abdenour)
- Abstract
Automatic Age Estimation (AAE) has attracted attention due to the wide variety of possible applications. However, it is a challenging task because of the large variation of facial appearance and several other extrinsic and intrinsic factors. Most of the proposed approaches in the literature use hand-crafted features to encode ageing patterns. Deeply learned features extracted by Convolutional Neural Networks (CNNs) algorithms usually perform better than hand-crafted features. The main contribution of this paper is an extensive comparative analysis of several frameworks for real AAE based on deep learning architectures. Different well-known CNN architectures are considered and their performances are compared. MORPH, FG-NET, FACES, PubFig and CASIA-web Face datasets are used in our experiments. The robustness of the best deep estimator is evaluated under noise, expression changes, “crossing” ethnicity and “crossing” gender. The experimental results demonstrate the high performances of the popular CNNs frameworks against the state-of-art methods of automatic age estimation. A Layer-wise transfer learning evaluation is done to study the optimal number of layers to fine-tune on AAE task. An evaluation framework of Knowledge transfer from face recognition task across AAE is performed. We have made our best-performing CNNs models publicly available that would allow one to duplicate the results and for further research on the use of CNNs for AAE from face images.
- Published
- 2020
9. Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks
- Author
-
Najim Dehak, Joaquin Gonzalez-Rodriguez, Phani Sankar Nidadavolu, Nanxin Chen, and Ruben Zazo
- Subjects
Voice activity detection ,General Computer Science ,Language identification ,Computer science ,Speech recognition ,General Engineering ,020206 networking & telecommunications ,02 engineering and technology ,Speaker recognition ,RNN ,Recurrent neural network ,NIST ,Age estimation ,Automatic age estimation ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,General Materials Science ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,Electrical and Electronic Engineering ,LSTM ,lcsh:TK1-9971 ,DNN - Abstract
Age estimation from speech has recently received increased interest as it is useful for many applications such as user-profiling, targeted marketing, or personalized call-routing. This kind of applications need to quickly estimate the age of the speaker and might greatly benefit from real-time capabilities. Long short-term memory (LSTM) recurrent neural networks (RNN) have shown to outperform state-of-the-art approaches in related speech-based tasks, such as language identification or voice activity detection, especially when an accurate real-time response is required. In this paper, we propose a novel age estimation system based on LSTM-RNNs. This system is able to deal with short utterances (from 3 to 10 s) and it can be easily deployed in a real-time architecture. The proposed system has been tested and compared with a state-of-the-art i-vector approach using data from NIST speaker recognition evaluation 2008 and 2010 data sets. Experiments on short duration utterances show a relative improvement up to 28% in terms of mean absolute error of this new approach over the baseline system.
- Published
- 2018
10. Age estimation from faces using deep learning: A comparative analysis
- Author
-
Abdenour Hadid, Hazem Abdelkawy, Alice Othmani, and Abdul Rahman Taleb
- Subjects
comparative analysis ,Computer science ,convolutional neural network ,02 engineering and technology ,Convolutional neural network ,Facial recognition system ,deep ageing patterns learning ,Robustness (computer science) ,0202 electrical engineering, electronic engineering, information engineering ,business.industry ,Deep learning ,020207 software engineering ,Pattern recognition ,knowledge transfer ,Expression (mathematics) ,cross-domain age estimation ,Face (geometry) ,Automatic age estimation ,Signal Processing ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Noise (video) ,business ,Transfer of learning ,Software - Abstract
Automatic Age Estimation (AAE) has attracted attention due to the wide variety of possible applications. However, it is a challenging task because of the large variation of facial appearance and several other extrinsic and intrinsic factors. Most of the proposed approaches in the literature use hand-crafted features to encode ageing patterns. Deeply learned features extracted by Convolutional Neural Networks (CNNs) algorithms usually perform better than hand-crafted features. The main contribution of this paper is an extensive comparative analysis of several frameworks for real AAE based on deep learning architectures. Different well-known CNN architectures are considered and their performances are compared. MORPH, FG-NET, FACES, PubFig and CASIA-web Face datasets are used in our experiments. The robustness of the best deep estimator is evaluated under noise, expression changes, “crossing” ethnicity and “crossing” gender. The experimental results demonstrate the high performances of the popular CNNs frameworks against the state-of-art methods of automatic age estimation. A Layer-wise transfer learning evaluation is done to study the optimal number of layers to fine-tune on AAE task. An evaluation framework of Knowledge transfer from face recognition task across AAE is performed. We have made our best-performing CNNs models publicly available that would allow one to duplicate the results and for further research on the use of CNNs for AAE from face images.
- Published
- 2020
11. A multi-agent system for the classification of gender and age from images
- Author
-
Juan M. Corchado, Alfonso González-Briones, Gabriel Villarrubia, and Juan F. De Paz
- Subjects
Local binary patterns ,Computer science ,02 engineering and technology ,Machine learning ,computer.software_genre ,Facial recognition system ,0202 electrical engineering, electronic engineering, information engineering ,1203.17 Informática ,Preprocessing of images ,business.industry ,Multi-agent system ,020206 networking & telecommunications ,Content adaptation ,Facial recognition ,Automatic gender estimation ,Eigenface ,Multilayer perceptron ,Signal Processing ,Pattern recognition (psychology) ,Automatic age estimation ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,computer ,Software ,Computer technology - Abstract
The automatic classification of human images on the basis of age range and gender can be used in audiovisual content adaptation for Smart TVs or marquee advertising. Knowledge about users is used by publishing agencies and departments regulating TV content; on the basis of this information (age, gender) they are able to provide content that suits the interests of users. To this end, the creation of a highly precise image pattern recognition system is necessary, this may be one of the greatest challenges faced by computer technology in the last decades. These recognition systems must apply different pattern recognition techniques, in order to distinct gender and age in the images. In this work, we propose a multi-agent system that integrates different techniques for the acquisition, preprocessing and processing of images for the classification of age and gender. The system has been tested in an office building. Thanks to the use of a multi-agent system which allows to apply different workflows simultaneously, the performance of different methods could be compared (each flow with a different configuration). Experimental results have confirmed that a good preprocessing stage is necessary if we want the classification methods to perform well (Fisherfaces, Eigenfaces, Local Binary Patterns, Multilayer perceptron). The Fisherfaces method has proved to be more effective than MLP and the training time was shorter. In terms of the classification of age, Fisherfaces offers the best results in comparison to the rest of the system’s classifiers. The use of filters has allowed to reduce dimensionality, as a result the workload was reduced, a great advantage in a system that performs classification in real time.
- Published
- 2018
12. Age estimation from faces using deep learning: A comparative analysis.
- Author
-
Othmani, Alice, Taleb, Abdul Rahman, Abdelkawy, Hazem, and Hadid, Abdenour
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,COMPARATIVE studies ,HUMAN facial recognition software ,KNOWLEDGE transfer ,FUSIFORM gyrus - Abstract
Automatic Age Estimation (AAE) has attracted attention due to the wide variety of possible applications. However, it is a challenging task because of the large variation of facial appearance and several other extrinsic and intrinsic factors. Most of the proposed approaches in the literature use hand-crafted features to encode ageing patterns. Deeply learned features extracted by Convolutional Neural Networks (CNNs) algorithms usually perform better than hand-crafted features. The main contribution of this paper is an extensive comparative analysis of several frameworks for real AAE based on deep learning architectures. Different well-known CNN architectures are considered and their performances are compared. MORPH, FG-NET, FACES, PubFig and CASIA-web Face datasets are used in our experiments. The robustness of the best deep estimator is evaluated under noise, expression changes, "crossing" ethnicity and "crossing" gender. The experimental results demonstrate the high performances of the popular CNNs frameworks against the state-of-art methods of automatic age estimation. A Layer-wise transfer learning evaluation is done to study the optimal number of layers to fine-tune on AAE task. An evaluation framework of Knowledge transfer from face recognition task across AAE is performed. We have made our best-performing CNNs models publicly available that would allow one to duplicate the results and for further research on the use of CNNs for AAE from face images. • An up-to-date literature overview of Age Estimation (AE) methods • A comparative analysis of several frameworks for AE based on deep learning • Different well-known CNNs are considered and their performances are compared • The robustness of the best deep AE is evaluated under intrinsic and extrinsic factors • An evaluation framework of Knowledge transfer for AAE is performed, [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
13. Client-specific A-stack model for adult face verification across ageing
- Author
-
Weifeng Li and Andrzej Drygajlo
- Subjects
Biometrics ,Computer science ,02 engineering and technology ,Machine learning ,computer.software_genre ,Principle component analysis ,Face verification ,0202 electrical engineering, electronic engineering, information engineering ,Stacked generalization ,Electrical and Electronic Engineering ,Automatic Age Estimation ,business.industry ,Aging progression ,Age progression ,020206 networking & telecommunications ,Classification ,Recognition ,Multiple baseline design ,Signal Processing ,Principal component analysis ,Decision boundary ,Local ternary patterns ,Images ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Classifier (UML) - Abstract
The problem of time validity of biometric models has received only a marginal attention from researchers. In this paper, we propose to manage the aging influence on the adult face verification system by an A-stack age modeling technique, which uses the age as a class-independent metadata quality measure together with scores from a single or multiple baseline classifiers, in order to obtain better face verification performance. This allows for improved long-term class separation by introducing a dynamically changing decision boundary across the age progression in the scores-age space using a short-term enrollment model. This new method, based on the concept of classifier stacking and age-aware decision boundary, compares favorably with the conventional face verification approach, which uses age-independent decision threshold calculated only in the score space at the time of enrollment. Our experiments on the YouTube and MORPH data show that the use of the proposed approach allows for improving the identification accuracy as opposed to the baseline classifier.
- Published
- 2011
- Full Text
- View/download PDF
14. Aging in biometrics: an experimental analysis on on-line signature
- Author
-
Javier Galbally, Julian Fierrez, Marcos Martinez-Diaz, UAM. Departamento de Tecnología Electrónica y de las Comunicaciones, and Análisis y Tratamiento de Voz y Señales Biométricas (ING EPS-002)
- Subjects
Aging ,Handwriting ,Anatomy and Physiology ,Databases, Factual ,Image Processing ,lcsh:Medicine ,Bioinformatics ,Social and Behavioral Sciences ,Facial recognition system ,Infographics ,Pattern Recognition, Automated ,Engineering ,Statistical Signal Processing ,Hidden Markov model ,lcsh:Science ,Aged, 80 and over ,Telecomunicaciones ,Multidisciplinary ,Middle Aged ,Signature (logic) ,Line (geometry) ,Automatic age estimation ,Trait ,Medicine ,Research Article ,Adult ,Biometry ,Biometrics ,Adolescent ,Verification systems ,Biology ,Pattern Recognition ,Neurological System ,Young Adult ,Humans ,Signature recognition ,Aged ,Motor Systems ,Population Biology ,business.industry ,lcsh:R ,Pattern recognition ,Communications ,Recognition ,Face (geometry) ,Signal Processing ,lcsh:Q ,Artificial intelligence ,business ,Physiological Processes - Abstract
Galbally J, Martinez-Diaz M, Fierrez J (2013) Aging in Biometrics: An Experimental Analysis on On-Line Signature. PLoS ONE 8(7): e69897. doi:10.1371/journal.pone.0069897, The first consistent and reproducible evaluation of the effect of aging on dynamic signature is reported. Experiments are carried out on a database generated from two previous datasets which were acquired, under very similar conditions, in 6 sessions distributed in a 15-month time span. Three different systems, representing the current most popular approaches in signature recognition, are used in the experiments, proving the degradation suffered by this trait with the passing of time. Several template update strategies are also studied as possible measures to reduce the impact of aging on the system’s performance. Different results regarding the way in which signatures tend to change with time, and their most and least stable features, are also given., This work has been partially supported by projects Contexts (S2009/TIC-1485) from CAM, by Bio-Challenge (TEC2009-11186) and Bio-Shield (TEC2012-34881) from Spanish MECD, by Cátedra UAM-Telefónica, and by the Spanish Dirección General de la Guardia Civil.
- Published
- 2013
15. Learning from Facial Aging Patterns for Automatic Age Estimation
- Author
-
Geng, Xin, Zhou, Zhihua, Zhang, Yu, Li, Gang, Dai, Honghua, Geng, Xin, Zhou, Zhihua, Zhang, Yu, Li, Gang, and Dai, Honghua
- Abstract
Age Specific Human-Computer Interaction (ASHCI) has vast potential applications in daily life. However, automatic age estimation technique is still underdeveloped. One of the main reasons is that the aging effects on human faces present several unique characteristics which make age estimation a challenging task that requires non-standard classification approaches. According to the speciality of the facial aging effects, this paper proposes the AGES (AGing pattErn Subspace) method for automatic age estimation. The basic idea is to model the aging pattern, which is defined as a sequence of personal aging face images, by learning a representative subspace. The proper aging pattern for an unseen face image is then determined by the projection in the subspace that can best reconstruct the face image, while the position of the face image in that aging pattern will indicate its age. The AGES method has shown encouraging performance in the comparative experiments either as an age estimator or as an age range estimator. Copyright 2006 ACM.
- Published
- 2006
16. Global And Local Feature Based Multi-Classifier A-Stack Model For Aging Face Identification
- Author
-
Weifeng Li and Andrzej Drygajlo
- Subjects
Biometrics ,Contextual image classification ,stacked generalization ,business.industry ,Computer science ,Local Ternary Patterns (LTPs) ,Pattern recognition ,Machine learning ,computer.software_genre ,Facial recognition system ,Face identification ,symbols.namesake ,Gaussian mixture model (GMM) ,Principal component analysis ,symbols ,Artificial intelligence ,business ,Classifier (UML) ,computer ,Gaussian process ,Principal Component Analysis (PCA) ,Automatic Age Estimation - Abstract
The problem of time validity of biometric models has received only a marginal attention from researchers. Actual and up-to-date at the time of their creation, extracted features and models relevant to a person's face may eventually become outdated, leading to a failure in the face identification task. If physical characteristics of the individual change over time, their classification model has to be updated. In this paper we present a mutli-classifier A-stack scheme, which is based on the concept of classifier stacking and makes use of the age information and scores of multiple baseline classifiers, in order to improve the identification performance during age progression. Our experiments on the MORPH database show that the use of the proposed multi-classifier stacking fusion allows for improving the identification accuracy as opposed to the baseline classifier and single-classifier A-stack method.
17. Speaker age estimation for elderly speech recognition in European Portuguese
- Author
-
Annika Hämäläinen, Vahid Hedayati, Miguel Sales Dias, Isabel Trancoso, Thomas Pellegrini, Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Université Toulouse III - Paul Sabatier (UT3), Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa (INESC-ID), Instituto Superior Técnico, Universidade Técnica de Lisboa (IST)-Instituto de Engenharia de Sistemas e Computadores (INESC), Universidade de Lisboa = University of Lisbon (ULISBOA), Instituto Universitário de Lisboa (ISCTE-IUL), FCT Fundacao para a Cîencia e a Tecnologia, under projects PTDC/EEA-PLP/121111/2010, PTDC/EIA-CCO/122542/2010, and PEst-OE/EEI/LA0021/2013, QREN 5329 Fala Global project, which is co-funded by Microsoft and the European Structural Funds for Portugal (FEDER) through POR Lisboa (Regional Operational Programme of Lisbon), as part of the National Strategic Reference Framework (QREN), the national program of incentives for Portuguese businesses and industry, International Speech Communication Association (ISCA), European Project: 611396,EC:FP7:ICT,FP7-ICT-2013-SME-DCA,SPEDIAL(2013), Chng E.S. , Li H., Meng H., Ma B. and Xie L, Centre National de la Recherche Scientifique - CNRS (FRANCE), Instituto de Engenharia de Sistemas e Computadores - Investigação e Desenvolvimento - INESC-ID (PORTUGAL), Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Instituto Universitario de Lisboa - ISCTE IUL (PORTUGAL), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), Microsoft (USA), and Universidade de Lisboa - ULisboa (PORTUGAL)
- Subjects
Speech production ,Computer science ,Speech recognition ,Automatic speech recognition ,[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV] ,Vision par ordinateur et reconnaissance de formes ,Intelligence artificielle ,Elderly speech ,Ciências Naturais::Ciências Físicas [Domínio/Área Científica] ,I-vector extraction ,[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR] ,language.human_language ,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] ,Traitement des images ,[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing ,European Portuguese ,Transcription (linguistics) ,Age estimation ,[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV] ,Automatic age estimation ,language ,Traitement du signal et de l'image ,Synthèse d'image et réalité virtuelle - Abstract
Phone-like acoustic models (AMs) used in large-vocabulary automatic speech recognition (ASR) systems are usually trained with speech collected from young adult speakers. Using such models, ASR performance may decrease by about 10% absolute when transcribing elderly speech. Ageing is known to alter speech production in ways that require ASR systems to be adapted, in particular at the level of acoustic modeling. In this study, we investigated automatic age estimation in order to select age-specific adapted AMs. A large corpus of read speech from European Portuguese speakers aged 60 or over was used. Age estimation (AE) based on i-vectors and support vector regression achieved mean error rates of about 4.2 and 4.5 years for males and females, respectively. Compared with a baseline ASR system with AMs trained using young adult speech and a WER of 13.9%, the selection of five-year-range adapted AMs, based on the estimated age of the speakers, led to a decrease in WER of about 9.3% relative (1.3% absolute). Comparable gains in ASR performance were observed when considering two larger age ranges (60-75 and 76-90) instead of six five-year ranges, suggesting that it would be sufficient to use the two large ranges only. info:eu-repo/semantics/acceptedVersion
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.