Search

Your search keyword '"Raj, Bhiksha"' showing total 758 results

Search Constraints

Start Over You searched for: Author "Raj, Bhiksha" Remove constraint Author: "Raj, Bhiksha"
758 results on '"Raj, Bhiksha"'

Search Results

1. Efficient Autoregressive Audio Modeling via Next-Scale Prediction

2. Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?

3. Audio Entailment: Assessing Deductive Reasoning for Audio Understanding

4. SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios

5. Emergent Interpretable Symbols and Content-Style Disentanglement via Variance-Invariance Constraints

6. From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking

7. ControlVAR: Exploring Controllable Visual Autoregressive Modeling

8. ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models

9. EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding

10. Slight Corruption in Pre-training Data Makes Better Diffusion Models

11. Synergistic Global-space Camera and Human Reconstruction from Videos

12. Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features

13. Learning with Noisy Foundation Models

14. $\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

15. AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

16. Evaluating and Improving Continual Learning in Spoken Language Understanding

17. Domain Adaptation for Contrastive Audio-Language Models

18. Customizable Perturbation Synthesis for Robust SLAM Benchmarking

19. A General Framework for Learning from Weak Supervision

20. On Catastrophic Inheritance of Large Foundation Models

21. PAM: Prompting Audio-Language Models for Audio Quality Assessment

22. FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

23. Token Prediction as Implicit Classification to Identify LLM-Generated Text

24. Pairwise Similarity Learning is SimPLE

25. Privacy-oriented manipulation of speaker representations

26. Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms

27. Continual Contrastive Spoken Language Understanding

28. Prompting Audios Using Acoustic Properties For Emotion Representation

29. LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

30. uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models

31. Completing Visual Objects via Bridging Generation and Segmentation

32. Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

33. QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition

34. Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

35. Importance of negative sampling in weak label learning

36. Training Audio Captioning Models without Audio

37. Fixed Inter-Neuron Covariability Induces Adversarial Robustness

38. Training on Foveated Images Improves Robustness to Adversarial Attacks

39. The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features

40. Rethinking Voice-Face Correlation: A Geometry View

41. BASS: Block-wise Adaptation for Speech Summarization

42. UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation

43. PaintSeg: Training-free Segmentation via Painting

44. Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments

45. Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations

46. GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content

47. FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding

48. Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms

49. Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms

50. Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session

Catalog

Books, media, physical & digital resources