Search

Your search keyword '"Adi, Yossi"' showing total 175 results

Search Constraints

Start Over You searched for: Author "Adi, Yossi" Remove constraint Author: "Adi, Yossi"
175 results on '"Adi, Yossi"'

Search Results

1. LAST: Language Model Aware Speech Tokenization

2. Latent Watermarking of Audio Generative Models

3. Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline

4. The Llama 3 Herd of Models

5. Discrete Flow Matching

6. Audio Conditioning for Music Generation via Discrete Bottleneck Features

7. A Language Modeling Approach to Diacritic-Free Hebrew TTS

8. HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing

9. Improving Visual Commonsense in Language Models via Multiple Image Generation

10. NAST: Noise Aware Speech Tokenization for Speech Language Models

11. Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

12. The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

13. An Independence-promoting Loss for Music Generation with Language Models

14. The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

15. Transformers are Multi-State RNNs

16. Masked Audio Generation using a Single Non-Autoregressive Transformer

17. Generative Spoken Language Model based on continuous word-sized audio tokens

18. Low-Resource Self-Supervised Learning with SSL-Enhanced TTS

19. Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

20. Code Llama: Open Foundation Models for Code

21. EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis

22. From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion

23. Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

24. Simple and Controllable Music Generation

25. Scaling Speech Technology to 1,000+ Languages

26. AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

27. Textually Pretrained Speech Language Models

28. Layer Collaboration in the Forward-Forward Algorithm

29. A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

30. Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling

31. ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

32. Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units

33. AERO: Audio Super Resolution in the Spectral Domain

34. I Hear Your True Colors: Image Guided Audio Generation

35. Audio Language Modeling using Perceptually-Guided Discrete Representations

36. High Fidelity Neural Audio Compression

37. On the Importance of Gradient Norm in PAC-Bayesian Bounds

38. Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling

39. AudioGen: Textually Guided Audio Generation

40. Deep Audio Waveform Prior

41. Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors

42. STOP: A dataset for Spoken Task Oriented Semantic Parsing

43. A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement

44. Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies

45. Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

46. Probing phoneme, language and speaker information in unsupervised speech representations

47. Generative Spoken Dialogue Language Modeling

48. RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing

49. textless-lib: a Library for Textless Spoken Language Processing

50. Textless Speech-to-Speech Translation on Real Data

Catalog

Books, media, physical & digital resources