Back to Search Start Over

A Speech Command Control-Based Recognition System for Dysarthric Patients Based on Deep Learning Technology

Authors :
Yu-Yi Lin
Wei-Zhong Zheng
Wei Chung Chu
Ji-Yan Han
Ying-Hsiu Hung
Guan-Min Ho
Chia-Yuan Chang
Ying-Hui Lai
Source :
Applied Sciences, Vol 11, Iss 6, p 2477 (2021)
Publication Year :
2021
Publisher :
MDPI AG, 2021.

Abstract

Voice control is an important way of controlling mobile devices; however, using it remains a challenge for dysarthric patients. Currently, there are many approaches, such as automatic speech recognition (ASR) systems, being used to help dysarthric patients control mobile devices. However, the large computation power requirement for the ASR system increases implementation costs. To alleviate this problem, this study proposed a convolution neural network (CNN) with a phonetic posteriorgram (PPG) speech feature system to recognize speech commands, called CNN–PPG; meanwhile, the CNN model with Mel-frequency cepstral coefficient (CNN–MFCC model) and ASR-based systems were used for comparison. The experiment results show that the CNN–PPG system provided 93.49% accuracy, better than the CNN–MFCC (65.67%) and ASR-based systems (89.59%). Additionally, the CNN–PPG used a smaller model size comprising only 54% parameter numbers compared with the ASR-based system; hence, the proposed system could reduce implementation costs for users. These findings suggest that the CNN–PPG system could augment a communication device to help dysarthric patients control the mobile device via speech commands in the future.

Details

Language :
English
ISSN :
20763417
Volume :
11
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.94b39d62814018ba9bb7b7c77e58cc
Document Type :
article
Full Text :
https://doi.org/10.3390/app11062477