Search

Showing total 263 results

Search Constraints

Start Over You searched for: Topic speech perception Remove constraint Topic: speech perception Journal speech communication Remove constraint Journal: speech communication Database Academic Search Index Remove constraint Database: Academic Search Index
263 results

Search Results

1. Analysis-by-synthesis based training target extraction of the DNN for noise masking.

2. Review of analysis methods for speech applications.

3. Analyzing the influence of different speech data corpora and speech features on speech emotion recognition: A review.

4. Native language identification for Indian-speakers by an ensemble of phoneme-specific, and text-independent convolutions.

5. Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring.

6. Controllable speech synthesis by learning discrete phoneme-level prosodic representations.

7. Accurate synthesis of dysarthric Speech for ASR data augmentation.

8. Decoupled structure for improved adaptability of end-to-end models.

9. Arabic Automatic Speech Recognition: Challenges and Progress.

10. A unified system for multilingual speech recognition and language identification.

11. Low-resource automatic speech recognition and error analyses of oral cancer speech.

12. Perceptual effects of interpolated Austrian and German standard varieties.

13. Automatic assessment of English proficiency for Japanese learners without reference sentences based on deep neural network acoustic models.

14. Uneven success: automatic speech recognition and ethnicity-related dialects.

15. Multilingual speech recognition for GlobalPhone languages.

16. Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion.

17. Unsupervised Automatic Speech Recognition: A review.

18. Neural speech-rate conversion with multispeaker WaveNet vocoder.

19. Pronunciation error detection model based on feature fusion.

20. A two-level Item Response Theory model to evaluate speech synthesis and recognition.

21. Data augmentation based non-parallel voice conversion with frame-level speaker disentangler.

22. Seamless equal accuracy ratio for inclusive CTC speech recognition.

23. Progress of machine learning based automatic phoneme recognition and its prospect.

24. A study on the perception of prosodic cues to focus by Egyptian listeners: Some make use of them, but most of them don't.

25. Data augmentation using generative adversarial networks for robust speech recognition.

26. The Hearing-Aid Speech Perception Index (HASPI) Version 2.

27. Factorized and progressive knowledge distillation for CTC-based ASR models.

28. Visual-articulatory cues facilitate children with CIs to better perceive Mandarin tones in sentences.

29. The effect of musical expertise on whistled vowel identification.

30. Language fusion via adapters for low-resource speech recognition.

31. Monaural multi-talker speech recognition using factorial speech processing models.

32. An Iterative Graph Spectral Subtraction Method for Speech Enhancement.

33. A review of multi-objective deep learning speech denoising methods.

34. Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition.

35. Perceptual motivation for rhotics as a class.

36. Performance of single-channel speech enhancement algorithms on Mandarin listeners with different immersion conditions in New Zealand English.

37. Dithering techniques in automatic recognition of speech corrupted by MP3 compression: Analysis, solutions and experiments.

38. Speaker recognition using PCA-based feature transformation.

39. Advances in phase-aware signal processing in speech communication.

40. The Role of Auditory and Visual Cues in the Perception of Mandarin Emotional Speech in Male Drug Addicts.

41. Deep feature for text-dependent speaker verification.

42. Robust speech recognition in reverberant environments by using an optimal synthetic room impulse response model.

43. Fusion of bottleneck, spectral and modulation spectral features for improved speaker verification of neutral and whispered speech.

44. Sequence discriminative training for deep learning based acoustic keyword spotting.

45. Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space.

46. Automatic context window composition for distant speech recognition.

47. Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis.

48. Audiovisual perception of gemination and pharyngealization in Arabic.

49. Introduction to the special issue on auditory-visual expressive speech and gesture in humans and machines.

50. Speech excitation signal recovering based on a novel error mitigation scheme under erasure channel conditions.