Search

Your search keyword '"Wang, Wenwu"' showing total 2,103 results

Search Constraints

Start Over You searched for: Author "Wang, Wenwu" Remove constraint Author: "Wang, Wenwu"
2,103 results on '"Wang, Wenwu"'

Search Results

1. Effect of Top Al$_2$O$_3$ Interlayer Thickness on Memory Window and Reliability of FeFETs With TiN/Al$_2$O$_3$/Hf$_{0.5}$Zr$_{0.5}$O$_2$/SiO$_x$/Si (MIFIS) Gate Structure

2. PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

3. Differentiable Interacting Multiple Model Particle Filtering

4. FlowSep: Language-Queried Sound Separation with Rectified Flow Matching

5. Efficient Audio Captioning with Encoder-Level Knowledge Distillation

6. Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

7. A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

8. Sound-VECaps: Improving Audio Generation with Visual Enhanced Captions

9. Learning Retrieval Augmentation for Personalized Dialogue Generation

10. Selective Prompting Tuning for Personalized Conversations with LLMs

11. Text-Queried Target Sound Event Localization

12. Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review

13. Impact of the Top SiO2 Interlayer Thickness on Memory Window of Si Channel FeFET with TiN/SiO2/Hf0.5Zr0.5O2/SiOx/Si (MIFIS) Gate Structure

14. Zero-Shot Audio Captioning Using Soft and Hard Prompts

15. Soundscape Captioning using Sound Affective Quality Network and Large Language Model

16. Regime Learning for Differentiable Particle Filters

17. SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

18. ComposerX: Multi-Agent Symbolic Music Composition with LLMs

19. T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

20. Impact of Top SiO2 interlayer Thickness on Memory Window of Si Channel FeFET with TiN/SiO2/Hf0.5Zr0.5O2/SiOx/Si (MIFIS) Gate Structure

21. WavCraft: Audio Editing and Generation with Large Language Models

23. Enlargement of Memory Window of Si Channel FeFET by Inserting Al2O3 Interlayer on Ferroelectric Hf0.5Zr0.5O2

24. Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction

25. Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

27. Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

28. Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

29. First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

30. Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection

31. CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing

32. Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification

33. Audio Visual Speaker Localization from EgoCentric Views

34. Synth-AC: Enhancing Audio Captioning with Synthetic Supervision

35. Retrieval-Augmented Text-to-Audio Generation

36. Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection Under Domain Shift

37. AudioSR: Versatile Audio Super-resolution at Scale

38. Multimodal Fish Feeding Intensity Assessment in Aquaculture

39. Sparks of Large Audio Models: A Survey and Outlook

40. Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning

41. META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection

42. AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

43. Separate Anything You Describe

44. WavJourney: Compositional Audio Creation with Large Language Models

45. Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks

46. Text-Driven Foley Sound Generation With Latent Diffusion Model

47. Knowledge Distillation for Efficient Audio-Visual Video Captioning

Catalog

Books, media, physical & digital resources