Search

Your search keyword '"Jung, Jee-weon"' showing total 196 results

Search Constraints

Start Over You searched for: Author "Jung, Jee-weon" Remove constraint Author: "Jung, Jee-weon"
196 results on '"Jung, Jee-weon"'

Search Results

1. ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

2. SpoofCeleb: Speech Deepfake Detection and SASV In The Wild

3. Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

4. Text-To-Speech Synthesis In The Wild

5. The VoxCeleb Speaker Recognition Challenge: A Retrospective

6. ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale

7. Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing

8. Disentangled Representation Learning for Environment-agnostic Speaker Recognition

9. On the Evaluation of Speech Foundation Models for Spoken Language Understanding

10. To what extent can ASV systems naturally defend against spoofing attacks?

11. Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement

12. a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification

13. TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

14. Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

15. ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

16. OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

17. Improving Design of Input Condition Invariant Speech Enhancement

18. AugSumm: towards generalizable speech summarization using synthetic labels from large language model

19. Understanding Probe Behaviors through Variational Bounds of Mutual Information

20. UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions

21. One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition

22. Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation

23. Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

24. Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks

25. Encoder-decoder multimodal speaker change detection

26. Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing

27. Towards single integrated spoofing-aware speaker verification embeddings

28. VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

29. Absolute decision corrupts absolutely: conservative online speaker diarisation

30. High-resolution embedding extractor for speaker diarisation

31. Disentangled representation learning for multilingual speaker recognition

32. In search of strong embedding extractors for speaker diarisation

33. Large-scale learning of generalised representations for speaker recognition

34. Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

35. Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification

36. SASV 2022: The First Spoofing-Aware Speaker Verification Challenge

37. Curriculum learning for self-supervised speaker verification

38. Pushing the limits of raw waveform speaker recognition

39. Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

40. SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan

41. Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity

42. Multi-scale speaker embedding-based graph attention networks for speaker diarisation

43. AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks

44. End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection

45. Attentive max feature map and joint training for acoustic scene classification

46. Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System

47. Graph Attention Networks for Anti-Spoofing

48. Adapting Speaker Embeddings for Speaker Diarisation

49. Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network

50. Graph Attention Networks for Speaker Verification

Catalog

Books, media, physical & digital resources