Search

Your search keyword '"Computer Science - Sound"' showing total 36,620 results

Search Constraints

Start Over You searched for: Descriptor "Computer Science - Sound" Remove constraint Descriptor: "Computer Science - Sound"
36,620 results on '"Computer Science - Sound"'

Search Results

1. Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation

2. Development of the Listening in Spatialized Noise-Sentences (LiSN-S) Test in Brazilian Portuguese: Presentation Software, Speech Stimuli, and Sentence Equivalence

3. Searching for Effective Preprocessing Method and CNN-based Architecture with Efficient Channel Attention on Speech Emotion Recognition

4. MetaBGM: Dynamic Soundtrack Transformation For Continuous Multi-Scene Experiences With Ambient Awareness And Personalization

5. LAST: Language Model Aware Speech Tokenization

6. Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis

7. Raw Speech Enhancement with Deep State Space Modeling

8. Eetimating Indoor Scene Depth Maps from Ultrasonic Echoes

9. FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications

10. SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints

11. Latent Watermarking of Audio Generative Models

12. Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model

13. Effects of Recording Condition and Number of Monitored Days on Discriminative Power of the Daily Phonotrauma Index

14. USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction

15. An Analysis of Linear Complexity Attention Substitutes with BEST-RQ

16. Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models

17. Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems

18. NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention

19. CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research

20. Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP

21. MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision

22. Enhancing Code-Switching Speech Recognition with LID-Based Collaborative Mixture of Experts Model

23. The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge

24. vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders

25. Applications and Advances of Artificial Intelligence in Music Generation:A Review

26. Clustering of Indonesian and Western Gamelan Orchestras through Machine Learning of Performance Parameters

27. Activity-Guided Industrial Anomalous Sound Detection against Interferences

28. The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?

29. Reassessing Noise Augmentation Methods in the Context of Adversarial Speech

30. Steered Response Power-Based Direction-of-Arrival Estimation Exploiting an Auxiliary Microphone

31. USTC-KXDIGIT System Description for ASVspoof5 Challenge

32. Pureformer-VC: Non-parallel One-Shot Voice Conversion with Pure Transformer Blocks and Triplet Discriminative Training

33. STAB: Speech Tokenizer Assessment Benchmark

34. Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024

35. LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement

36. FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation

37. Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

38. Interpretable Convolutional SyncNet

39. A Framework for Synthetic Audio Conversations Generation using Large Language Models

40. SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis

41. VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka

42. Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation

43. Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR

44. Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement

45. A multilingual training strategy for low resource Text to Speech

46. Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection

47. EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance

48. Expanding on EnCLAP with Auxiliary Retrieval Model for Automated Audio Captioning

49. MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT

50. Dissecting Temporal Understanding in Text-to-Audio Retrieval

Catalog

Books, media, physical & digital resources