Search

Your search keyword '"Zhong, Yiran"' showing total 166 results

Search Constraints

Start Over You searched for: Author "Zhong, Yiran" Remove constraint Author: "Zhong, Yiran"
166 results on '"Zhong, Yiran"'

Search Results

1. Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

2. Scaling Laws for Linear Complexity Language Models

3. Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling

4. You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet

5. Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective

6. Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention

7. TAVGBench: Benchmarking Text to Audible-Video Generation

8. HGRN2: Gated Linear RNNs with State Expansion

9. Linear Attention Sequence Parallelism

10. CO2: Efficient Distributed Training with Full Communication-Computation Overlap

11. Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

12. Accelerating Toeplitz Neural Network with Constant-time Inference Complexity

13. Hierarchically Gated Recurrent Neural Network for Sequence Modeling

14. Multimodal Variational Auto-encoder based Audio-Visual Segmentation

15. Improving Audio-Visual Segmentation with Bidirectional Generation

16. Image-based Geolocalization by Ground-to-2.5D Map Matching

17. All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation

18. Contrastive Conditional Latent Diffusion for Audio-visual Segmentation

19. TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer

20. Exploring Transformer Extrapolation

21. Linearized Relative Positional Encoding

22. Joint Salient Object Detection and Camouflaged Object Detection via Uncertainty-aware Learning

23. Toeplitz Neural Network for Sequence Modeling

24. An Alternative to WSSS? An Empirical Study of the Segment Anything Model (SAM) on Weakly-Supervised Semantic Segmentation Problems

25. Fine-grained Audible Video Description

26. Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

27. Improving Audio-Visual Video Parsing with Pseudo Visual Labels

28. Audio-Visual Segmentation with Semantics

29. The Devil in Linear Transformer

30. Linear Video Transformer with Feature Fixation

31. Neural Architecture Search on Efficient Transformers and Beyond

32. AMF: Adaptable Weighting Fusion with Multiple Fine-tuning for Image Classification

33. Deep Laparoscopic Stereo Matching with Transformers

34. Audio-Visual Segmentation

35. Vicinity Vision Transformer

36. Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective

37. Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition

38. Implicit Motion Handling for Video Camouflaged Object Detection

39. cosFormer: Rethinking Softmax in Attention

40. Transcribing Natural Languages for The Deaf via Neural Editing Programs

41. GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation

42. MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation

43. Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable Model

45. RGB-D Saliency Detection via Cascaded Mutual Information Minimization

46. Memory-Free Generative Replay For Class-Incremental Learning

47. Exploring Depth Contribution for Camouflaged Object Detection

48. Invertible Attention

49. Blind Motion Deblurring Super-Resolution: When Dynamic Spatio-Temporal Learning Meets Static Image Understanding

50. Deep Two-View Structure-from-Motion Revisited

Catalog

Books, media, physical & digital resources