Search

Your search keyword '"Dehak, Najim"' showing total 306 results

Search Constraints

Start Over You searched for: Author "Dehak, Najim" Remove constraint Author: "Dehak, Najim"
306 results on '"Dehak, Najim"'

Search Results

1. Noise-robust Speech Separation with Fast Generative Correction

2. Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification

3. Time Scale Network: A Shallow Neural Network For Time Series Data

4. DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction

5. Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

6. DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

7. Regularizing Contrastive Predictive Coding for Speech Applications

8. Stabilized training of joint energy-based models and their practical applications

9. Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition

10. Time-domain speech super-resolution with GAN based modeling for telephony speaker verification

11. Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech

12. Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations

13. Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser

14. AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification

15. Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification

16. Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

17. Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition

18. Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

19. The JHU submission to VoxSRC-21: Track 3

20. Beyond Isolated Utterances: Conversational Emotion Recognition

21. Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios

22. Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems

23. What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition

24. WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

25. Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition

26. Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation

27. Deep Feature CycleGANs: Speaker Identity Preserving Non-parallel Microphone-Telephone Domain Adaptation for Speaker Verification

29. Adversarial Attacks and Defenses for Speech Recognition Systems

30. Study of Pre-processing Defenses against Adversarial Attacks on State-of-the-art Speaker Recognition Systems

31. Focus on the present: a regularization method for the ASR source-target attention layer

32. CopyPaste: An Augmentation Method for Speech Emotion Recognition

33. Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models

34. How Phonotactics Affect Multilingual and Zero-shot ASR Performance

35. Learning Speaker Embedding from Text-to-Speech

36. Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery

37. Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild

38. That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages

39. Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings?

40. x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

41. Analysis of Deep Feature Loss based Enhancement for Speaker Verification

42. Speaker detection in the wild: Lessons learned from JSALT 2019

43. Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition

44. Deep neural networks for emotion recognition combining audio and transcripts

45. Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs

46. Feature Enhancement with Deep Feature Losses for Speaker Verification

47. Unsupervised Feature Enhancement for speaker verification

48. Hierarchical Transformers for Long Document Classification

50. rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method

Catalog

Books, media, physical & digital resources