Search

Your search keyword '"Yuexian Zou"' showing total 185 results

Search Constraints

Start Over You searched for: Author "Yuexian Zou" Remove constraint Author: "Yuexian Zou" Language undetermined Remove constraint Language: undetermined
185 results on '"Yuexian Zou"'

Search Results

1. Diffsound: Discrete Diffusion Model for Text-to-Sound Generation

4. RR-Net: Relation Reasoning for End-to-End Human-Object Interaction Detection

6. Improving Weakly Supervised Sound Event Detection with Causal Intervention

10. Deep Motion Prior for Weakly-Supervised Temporal Action Localization

11. DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention

12. Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention

15. Federated Learning for Vision-and-Language Grounding Problems

16. Modeling Label Dependencies for Audio Tagging With Graph Convolutional Network

20. RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection

22. Learning Human-Object Interaction via Interactive Semantic Reasoning

23. SpecAugment++: A Hidden Space Data Augmentation Method for AcousticScene Classification

25. Contextualized Attention-Based Knowledge Transfer for Spoken Conversational Question Answering

26. Self-Supervised Dialogue Learning for Spoken Conversational Question Answering

27. MRD-Net: Multi-Modal Residual Knowledge Distillation for Spoken Question Answering

28. Sentiment Injected Iteratively Co-Interactive Network for Spoken Language Understanding

29. Knowledge Distillation for Improved Accuracy in Spoken Question Answering

30. FWB-Net: Front White Balance Network for Color Shift Correction in Single Image Dehazing Via Atmospheric Light Estimation

31. Adaptive Bi-Directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension

32. Contrastive Self-Supervised Learning for Text-Independent Speaker Verification

33. CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning

34. Improved Blind Timing Skew Estimation Based on Spectrum Sparsity and ApFFT in Time-Interleaved ADCs

35. GISCA: Gradient-Inductive Segmentation Network With Contextual Attention for Scene Text Detection

36. Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning

37. PIN: A Novel Parallel Interactive Network for Spoken Language Understanding

38. RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection

39. Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation

40. SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

41. All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection

42. A Mutual learning framework for Few-shot Sound Event Detection

43. Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification

44. SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection

45. A Global-local Attention Framework for Weakly Labelled Audio Tagging

48. Bridging the Gap between Vision and Language Domains for Improved Image Captioning

49. Cluster Attention Contrast for Video Anomaly Detection

50. ABC-NET: Avoiding Blocking Effect & Color Shift Network for Single Image Dehazing Via Restraining Transmission Bias

Catalog

Books, media, physical & digital resources