Search

Your search keyword '"Liu,Ting"' showing total 123 results

Search Constraints

Start Over You searched for: Author "Liu,Ting" Remove constraint Author: "Liu,Ting" Topic computer science - computer vision and pattern recognition Remove constraint Topic: computer science - computer vision and pattern recognition
123 results on '"Liu,Ting"'

Search Results

1. The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

2. Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

3. From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

4. Memory Efficient Matting with Adaptive Token Routing

5. Video Creation by Demonstration

6. Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model

7. Understanding Robustness of Parameter-Efficient Tuning for Image Classification

8. $\epsilon$-VAE: Denoising as Visual Decoding

9. Accelerating Diffusion Transformers with Token-wise Feature Caching

10. MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios

11. MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension

12. Reprojection Errors as Prompts for Efficient Scene Coordinate Regression

13. Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching

14. SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything

15. Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?

16. TextMastero: Mastering High-Quality Scene Text Editing in Diverse Languages and Styles

17. CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

18. Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

19. M$^2$IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension

20. On Discrete Prompt Optimization for Diffusion Models

21. PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

22. 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation

23. Semantic Segmentation on VSPW Dataset through Masked Video Consistency

24. 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

25. Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

26. Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels

27. DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

28. iDAT: inverse Distillation Adapter-Tuning

29. VideoPrism: A Foundational Visual Encoder for Video Understanding

30. Distilling Vision-Language Models on Millions of Videos

31. Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

32. LAMM: Label Alignment for Multi-Modal Prompt Learning

33. GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction

34. DAP: Domain-aware Prompt Learning for Vision-and-Language Navigation

35. Efficient Cloud Pipelines for Neural Radiance Fields

36. Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation

37. Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency

38. A Parse-Then-Place Approach for Generating Graphic Layouts from Textual Descriptions

39. Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition

40. HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation

41. Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone

42. EGE-UNet: an Efficient Group Enhanced UNet for skin lesion segmentation

43. VideoGLUE: Video General Understanding Evaluation of Foundation Models

44. Lyapunov-Stable Deep Equilibrium Models

45. Learning Robust Visual-Semantic Embedding for Generalizable Person Re-identification

46. Monte Carlo Linear Clustering with Single-Point Supervision is Enough for Infrared Small Target Detection

47. Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding

48. Steering Prototypes with Prompt-tuning for Rehearsal-free Continual Learning

49. Unified Visual Relationship Detection with Vision and Language Models

50. Virtual Guidance as a Mid-level Representation for Navigation

Catalog

Books, media, physical & digital resources