Search

Your search keyword '"Chung, Joon"' showing total 1,488 results

Search Constraints

Start Over You searched for: Author "Chung, Joon" Remove constraint Author: "Chung, Joon"
1,488 results on '"Chung, Joon"'

Search Results

1. AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models

2. Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding

3. Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding

4. SpoofCeleb: Speech Deepfake Detection and SASV In The Wild

5. Text-To-Speech Synthesis In The Wild

6. The VoxCeleb Speaker Recognition Challenge: A Retrospective

7. Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting

8. VoxSim: A perceptual voice similarity dataset

9. Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment

10. ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions

11. Disentangled Representation Learning for Environment-agnostic Speaker Recognition

12. Lightweight Audio Segmentation for Long-form Speech Translation

13. FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching

14. To what extent can ASV systems naturally defend against spoofing attacks?

15. Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

16. Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

17. Towards Automated Movie Trailer Generation

18. Scaling Up Video Summarization Pretraining with Large Language Models

19. EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning

23. Can CLIP Help Sound Source Localization?

24. Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model

25. Rethinking Session Variability: Leveraging Session Embeddings for Session Robustness in Speaker Verification

26. VoiceLDM: Text-to-Speech with Environmental Context

27. TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning

28. SlowFast Network for Continuous Sign Language Recognition

29. Sound Source Localization is All about Cross-Modal Alignment

30. Let There Be Sound: Reconstructing High Quality Speech from Silent Videos

31. FlexiAST: Flexibility is What AST Needs

33. That's What I Said: Fully-Controllable Talking Face Generation

34. Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

35. Self-Sufficient Framework for Continuous Sign Language Recognition

37. Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech

38. VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

39. MarginNCE: Robust Sound Localization with a Negative Margin

40. Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition

41. Metric Learning for User-defined Keyword Spotting

42. Disentangled representation learning for multilingual speaker recognition

43. In search of strong embedding extractors for speaker diarisation

44. Large-scale learning of generalised representations for speaker recognition

45. Method comparison and estimation of causal effects of insomnia on health outcomes in a survey sampled population

46. Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

47. Curriculum learning for self-supervised speaker verification

48. Pushing the limits of raw waveform speaker recognition

49. Single-Nucleus RNA Sequencing Reveals Loss of Distal Convoluted Tubule 1 Renal Tubules in HIV Viral Protein R Transgenic Mice

50. VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge

Catalog

Books, media, physical & digital resources