Search

Your search keyword '"He, Zhenyu"' showing total 1,012 results

Search Constraints

Start Over You searched for: Author "He, Zhenyu" Remove constraint Author: "He, Zhenyu"
1,012 results on '"He, Zhenyu"'

Search Results

1. LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

2. Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

3. Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model

4. Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors

5. GRAPE: Generalizable and Robust Multi-view Facial Capture

6. Learning Spatial-Semantic Features for Robust Video Object Segmentation

7. Let the Code LLM Edit Itself When You Edit the Code

8. PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

9. 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation

10. Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models

11. Spatial-Temporal Multi-level Association for Video Object Segmentation

12. RTracker: Recoverable Tracking via PN Tree Structured Memory

13. Do Efficient Transformers Really Save Computation?

14. Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

15. REST: Retrieval-Based Speculative Decoding

17. Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models

18. Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation

19. Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection

20. CiteTracker: Correlating Image and Text for Visual Tracking

21. Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

22. ZeroPose: CAD-Model-based Zero-Shot Pose Estimation

24. Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation

25. Joint Visual Grounding and Tracking with Natural Language Specification

26. Audio2Gestures: Generating Diverse Gestures from Audio

27. Geo6D: Geometric Constraints Learning for 6D Pose Estimation

28. How Image Generation Helps Visible-to-Infrared Person Re-Identification?

29. An Interpretable Model With Forgetting Matrix For Deep Knowledge Tracing

30. Multi-Granularity Graph Pooling for Video-based Person Re-Identification

31. Pose-Aided Video-based Person Re-Identification via Recurrent Graph Convolutional Network

32. Towards Complete-View and High-Level Pose-based Gait Recognition

33. SSORN: Self-Supervised Outlier Removal Network for Robust Homography Estimation

34. Two-Stage Neural Contextual Bandits for Personalised News Recommendation

35. Global Tracking via Ensemble of Local Trackers

36. Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs

39. GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference

40. Active Learning for Deep Visual Tracking

41. Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders

42. Saliency-Associated Object Tracking

46. Self-Supervised Tracking via Target-Aware Data Synthesis

47. SiamCorners: Siamese Corner Networks for Visual Tracking

Catalog

Books, media, physical & digital resources