Search

Your search keyword '"Raj, Bhiksha"' showing total 925 results

Search Constraints

Start Over You searched for: Author "Raj, Bhiksha" Remove constraint Author: "Raj, Bhiksha"
925 results on '"Raj, Bhiksha"'

Search Results

1. Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video

2. Tessellated Linear Model for Age Prediction from Voice

3. SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

4. XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation

5. Perturbation Ontology based Graph Attention Networks

6. MACE: Leveraging Audio for Evaluating Audio Captioning Systems

7. FLAASH: Flow-Attention Adaptive Semantic Hierarchical Fusion for Multi-Modal Tobacco Content Analysis

8. On the Diversity of Synthetic Data and its Impact on Training Large Language Models

9. What Do Speech Foundation Models Not Learn About Speech?

10. Improving Speaker Representations Using Contrastive Losses on Multi-scale Features

11. RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement

12. Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection

13. ImageFolder: Autoregressive Image Generation with Folded Tokens

14. Revisiting Acoustic Features for Robust ASR

15. ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

16. DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing

17. PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification

18. Efficient Autoregressive Audio Modeling via Next-Scale Prediction

19. Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?

20. Audio Entailment: Assessing Deductive Reasoning for Audio Understanding

21. SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios

22. Emergent Interpretable Symbols and Content-Style Disentanglement via Variance-Invariance Constraints

23. uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes

24. From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking

25. ControlVAR: Exploring Controllable Visual Autoregressive Modeling

26. ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models

27. EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding

28. Fashion Image Retrieval with Occlusion

29. R-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations

30. Slight Corruption in Pre-training Data Makes Better Diffusion Models

31. Synergistic Global-space Camera and Human Reconstruction from Videos

32. Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features

33. Learning with Noisy Foundation Models

34. Speech Robust Bench: A Robustness Benchmark For Speech Recognition

35. $\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

36. AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

37. Evaluating and Improving Continual Learning in Spoken Language Understanding

38. Domain Adaptation for Contrastive Audio-Language Models

39. Customizable Perturbation Synthesis for Robust SLAM Benchmarking

40. A General Framework for Learning from Weak Supervision

41. On Catastrophic Inheritance of Large Foundation Models

42. PAM: Prompting Audio-Language Models for Audio Quality Assessment

43. AugSumm: towards generalizable speech summarization using synthetic labels from large language model

44. FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

45. Token Prediction as Implicit Classification to Identify LLM-Generated Text

46. Pairwise Similarity Learning is SimPLE

47. Privacy-oriented manipulation of speaker representations

48. Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms

49. Continual Contrastive Spoken Language Understanding

50. Prompting Audios Using Acoustic Properties For Emotion Representation

Catalog

Books, media, physical & digital resources