Search

Your search keyword '"Xie, Weidi"' showing total 365 results

Search Constraints

Start Over You searched for: Author "Xie, Weidi" Remove constraint Author: "Xie, Weidi"
365 results on '"Xie, Weidi"'

Search Results

1. LoRKD: Low-Rank Knowledge Decomposition for Medical Foundation Models

2. Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos

3. Can Visual Foundation Models Achieve Long-term Point Tracking?

4. Towards Evaluating and Building Versatile Large Language Models for Medicine

5. AutoRG-Brain: Grounded Report Generation for Brain MRI

6. AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description

7. EchoSight: Advancing Visual-Language Models with Wiki Knowledge

8. VISA: Reasoning Video Object Segmentation via Large Language Models

9. A Sanity Check for AI-generated Image Detection

10. MatchTime: Towards Automatic Soccer Game Commentary Generation

11. RaTEScore: A Metric for Radiology Report Generation

12. Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation

13. Made to Order: Discovering monotonic temporal changes via self-supervised video ordering

14. RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis

15. AutoAD III: The Prequel -- Back to the Pixels

16. Moving Object Segmentation: All You Need Is SAM (and Flow)

17. Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

20. Towards Building Multilingual Language Model for Medicine

21. InstaGen: Enhancing Object Detection by Training on Synthetic Dataset

22. Synchformer: Efficient Synchronization from Sparse Cues

23. Retrieval-Augmented Egocentric Video Captioning

24. Amodal Ground Truth and Completion in the Wild

25. One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts

26. Large-scale Long-tailed Disease Diagnosis on Radiology Images

27. Multi-Sentence Grounding for Long-term Instructional Video

28. Appearance-Based Refinement for Object-Centric Motion Segmentation

29. Grounded Question-Answering in Long Egocentric Videos

30. VISA: Reasoning Video Object Segmentation via Large Language Models

31. Made to Order: Discovering Monotonic Temporal Changes via Self-supervised Video Ordering

32. Appearance-Based Refinement for Object-Centric Motion Segmentation

33. Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis

34. Self-supervised Object-Centric Learning for Videos

35. AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description

36. A General Protocol to Probe Large Vision Models for 3D Physical Understanding

37. Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning

38. UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training

39. The Making and Breaking of Camouflage

40. Diagnosing Human-object Interaction Detectors

41. Joint-Relation Transformer for Multi-Person Motion Prediction

42. Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data

43. Boost Video Frame Interpolation via Motion Adaptation

44. arXiVeri: Automatic table verification with GPT

45. Zero-shot Composed Text-Image Retrieval

46. Multi-Modal Classifiers for Open-Vocabulary Object Detection

47. Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models

49. Annotation-free Audio-Visual Segmentation

50. PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering

Catalog

Books, media, physical & digital resources