Author: "Elmar Nöth" / Publisher: springer science and business media llc - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Elmar Nöth"' showing total 8 results

Start Over Author "Elmar Nöth" Publisher springer science and business media llc

8 results on '"Elmar Nöth"'

1. Multi-channel spectrograms for speech processing applications using deep learning methods

Author: Juan Camilo Vásquez-Correa, Philipp Klumpp, Juan Rafael Orozco-Arroyave, Maria Schuster, Elmar Nöth, and Tomas Arias-Vergara
Subjects: Audio signal, business.industry, Computer science, Speech recognition, Deep learning, Speech processing, Convolutional neural network, Convolution, 030507 speech-language pathology & audiology, 03 medical and health sciences, 0302 clinical medicine, Recurrent neural network, Artificial Intelligence, ddc:000, Feature (machine learning), Spectrogram, Computer Vision and Pattern Recognition, Artificial intelligence, 030223 otorhinolaryngology, 0305 other medical science, business
Abstract: Time–frequency representations of the speech signals provide dynamic information about how the frequency component changes with time. In order to process this information, deep learning models with convolution layers can be used to obtain feature maps. In many speech processing applications, the time–frequency representations are obtained by applying the short-time Fourier transform and using single-channel input tensors to feed the models. However, this may limit the potential of convolutional networks to learn different representations of the audio signal. In this paper, we propose a methodology to combine three different time–frequency representations of the signals by computing continuous wavelet transform, Mel-spectrograms, and Gammatone spectrograms and combining then into 3D-channel spectrograms to analyze speech in two different applications: (1) automatic detection of speech deficits in cochlear implant users and (2) phoneme class recognition to extract phone-attribute features. For this, two different deep learning-based models are considered: convolutional neural networks and recurrent neural networks with convolution layers.
Published: 2020

2. Nonlinear dynamics and Poincaré sections to model gait impairments in different stages of Parkinson’s disease

Author: Paula Andrea Pérez-Toro, Elmar Nöth, Juan Rafael Orozco-Arroyave, Juan Camilo Vásquez-Correa, and Tomas Arias-Vergara
Subjects: Correlation dimension, medicine.medical_specialty, Parkinson's disease, Computer science, Quantitative Biology::Tissues and Organs, Physics::Medical Physics, Aerospace Engineering, Ocean Engineering, Lyapunov exponent, Kinematics, 01 natural sciences, Computer Science::Robotics, symbols.namesake, Gait (human), Physical medicine and rehabilitation, 0103 physical sciences, medicine, Electrical and Electronic Engineering, 010301 acoustics, Hurst exponent, Applied Mathematics, Mechanical Engineering, medicine.disease, Nonlinear system, Control and Systems Engineering, Gait analysis, symbols
Abstract: Parkinson’s disease is a progressive neurological disorder that affects the motor system and produces several problems to control muscles and limbs. One of the major manifestations of the disease appears in gait and typically causes disability in the most advanced stages of the disease. Gait assessment is suitable for the diagnosis and monitoring of the neurological state of the patients. Gait is mainly evaluated from signals collected with inertial sensors attached to the limbs, and kinematics features are commonly extracted. Besides the classical kinematic methods, there are nonlinear phenomena in the gait process that cannot be properly modeled with kinematic features. This study proposes the use of several nonlinear dynamics features to assess gait impairments of Parkinson’s patients. We consider classical nonlinear features including correlation dimension, largest Lyapunov exponent, Hurst exponent, and others. In addition, we propose a novel nonlinear analysis based on the construction of Gaussian mixture models to find clusters in Poincare sections. The nonlinear dynamics features proposed here are used to discriminate between Parkinson’s patients and healthy subjects, and to classify patients in different stages of the disease. To the best of our knowledge, this is the first study that considers nonlinear dynamics analysis including Poincare sections to assess gait impairments of patients with Parkinson’s disease.
Published: 2020

3. Perzeptive und maschinelle Stimm- und Sprechanalyse bei chronischer Laryngitis und T1-Stimmlippenkarzinom

Author: Michael Döllinger, Ulrich Eysholdt, Elmar Nöth, A. Ziethe, B. Bartke, S. Graf, and Tino Haderlein
Subjects: medicine.medical_specialty, VOCAL CORD CARCINOMA, business.industry, media_common.quotation_subject, Laryngitis, Vocal cord cancer, Sound Spectrography, Audiology, medicine.disease, Otorhinolaryngology, Perception, Speech Production Measurement, otorhinolaryngologic diseases, medicine, business, Chronic laryngitis, media_common
Abstract: Patients with chronic laryngitis and T1 vocal cord cancer were compared using perceptual and text-based objective voice and speech analyses in order to determine which group is more affected in its ability to communicate and whether a distinction between the two pathologies is possible.In all, 13 patients with histologically proven chronic laryngitis and 13 patients with T1 vocal cord cancer were compared perceptually by five speech therapists on the basis of seven criteria and objectively by a speech recognition system and prosodic analysis.Both, the data of the five speech therapists and the results of the automatic analysis revealed no significant differences between the two patient groups.A distinction between chronic laryngitis and T1 vocal cord carcinoma by mere voice and speech analysis is not possible, because the patient groups do not show significant differences in their voice quality.
Published: 2013

4. Automated postlaryngectomy telephone test

Author: Frank Rosanowski, Korbinian Riedhammer, Tino Haderlein, Andreas Maier, Ulrich Eysholdt, and Elmar Nöth
Subjects: Speech Recognition Software, medicine.medical_specialty, Otorhinolaryngology, business.industry, Head and neck surgery, medicine, Audiology, business
Abstract: Hintergrund In dieser Studie wird ein objektives Verfahren fur die Verstandlichkeitsmessung mit dem Postlaryngektomie-Telefontest (PLTT) mittels automatischer Spracherkennungstechnik beschrieben.
Published: 2009

5. Numeric quantification of intelligibility in schoolchildren with isolated and combined cleft palate

Author: Elmar Nöth, Emeka Nkenke, B. Vogt, Andreas Maier, Anton Batliner, Ulrich Eysholdt, and Maria Schuster
Subjects: Gynecology, medicine.medical_specialty, School age child, Otorhinolaryngology, business.industry, medicine, Head and neck surgery, Congenital cleft, ddc:004, Congenital disease, business
Abstract: Hintergrund Spaltfehlbildungen konnen trotz adaquater Behandlung funktionelle Beeintrachtigungen, z. B. Lautbildungsstorungen, verursachen. Diese variieren individuell stark. Typisch sind z. B. veranderte nasale Luftfuhrung und verlagerte Artikulation, die zu einer verminderten Verstandlichkeit fuhren. Ein Zusammenhang zwischen der Art der Spaltfehlbildung und der Verstandlichkeit konnte bisher nur durch kategoriale, subjektive Bewertungen beschrieben werden. In dieser Studie wird die spaltabhangige Verstandlichkeit erstmals objektiv und numerisch mittels automatischer Spracherkennungstechnik quantifiziert.
Published: 2007

6. MOBSY: Integration of vision and dialogue in service robots

Author: Elmar Nöth, Benno Heigl, Matthias Zobel, Joachim Denzler, Jochen Schmidt, Dietrich Paulus, and Georg Stemmer
Subjects: Focus (computing), Service (systems architecture), Research areas, business.industry, Computer science, Robotics, Computer Science Applications, Task (project management), Mobile service robot, Hardware and Architecture, Human–computer interaction, Pattern recognition (psychology), Robot, Computer vision, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software
Abstract: This contribution introduces MOBSY, a fully integrated, autonomous mobile service robot system. It acts as an automatic dialogue-based receptionist for visitors to our institute. MOBSY incorporates many techniques from different research areas into one working stand-alone system. The techniques involved range from computer vision over speech understanding to classical robotics. Along with the two main aspects of vision and speech, we also focus on the integration aspect, both on the methodological and on the technical level. We describe the task and the techniques involved. Finally, we discuss the experiences that we gained with MOBSY during a live performance at our institute.
Published: 2003

7. Spracherkennung — Stand der Technik, Einsatzmöglichkeiten und Perspektiven

Author: Heinrich Niemann, Elmar Nöth, and Florian Gallwitz
Subjects: Engineering, Dictation, business.industry, Speech recognition, Pattern recognition (psychology), Subject (philosophy), business, Everyday life, Data science, Automation, Natural language, Information Systems
Abstract: Automatic speech recognition and understanding has been subject to intensive research for several decades. A few years ago, the first commercial applications for the technology resulting from these efforts began to emerge. It is now obvious that speech recognition technology will soon play an important role in everyday life and also in business. In this paper, we outline the basic principles of speech recognition technology and analyze its potentials and also its limitations We discuss current and possible future applications, ranging from simple command control systems and dictation systems to natural language dialogue and translation systems.
Published: 1999

8. Atypical Speech

Author: Elmar Nöth, Georg Stemmer, and Vijay Parsa
Subjects: Acoustics and Ultrasonics, Computer science, lcsh:Acoustics. Sound, lcsh:QC221-246, lcsh:Electronic computers. Computer science, Electrical and Electronic Engineering, lcsh:QA75.5-76.95
Published: 2010

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Elmar Nöth"'

1. Multi-channel spectrograms for speech processing applications using deep learning methods

2. Nonlinear dynamics and Poincaré sections to model gait impairments in different stages of Parkinson’s disease

3. Perzeptive und maschinelle Stimm- und Sprechanalyse bei chronischer Laryngitis und T1-Stimmlippenkarzinom

4. Automated postlaryngectomy telephone test

5. Numeric quantification of intelligibility in schoolchildren with isolated and combined cleft palate

6. MOBSY: Integration of vision and dialogue in service robots

7. Spracherkennung — Stand der Technik, Einsatzmöglichkeiten und Perspektiven

8. Atypical Speech

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

8 results on '"Elmar Nöth"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources