Search

Your search keyword '"Song, Yibing"' showing total 373 results

Search Constraints

Start Over You searched for: Author "Song, Yibing" Remove constraint Author: "Song, Yibing"
373 results on '"Song, Yibing"'

Search Results

1. LLaVA-o1: Let Vision Language Models Reason Step-by-Step

2. Aligning Audio-Visual Joint Representations with an Agentic Workflow

3. LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization

4. Dynamic Diffusion Transformer

5. Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

6. A Causal Inspired Early-Branching Structure for Domain Generalization

8. HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation

9. InstructDET: Diversifying Referring Object Detection with Generalized Instructions

10. Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training

11. Domain Generalization via Rationale Invariance

12. Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

13. Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

14. Evolving Semantic Prototype Improves Generative Zero-Shot Learning

15. Efficient Video Action Detection with Token Dropout and Context Refinement

16. Improved Test-Time Adaptation for Domain Generalization

17. Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning

18. CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection

19. Human MotionFormer: Transferring Human Motions with Vision Transformers

20. Image Inpainting via Iteratively Decoupled Probabilistic Modeling

21. Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint

22. DiffusionDet: Diffusion Model for Object Detection

23. One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations

24. AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

25. VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

26. Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

27. Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations

28. DynaMixer: A Vision MLP Architecture with Dynamic Mixing

29. MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning

30. TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning

31. Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

34. PD-GAN: Probabilistic Diverse GAN for Image Inpainting

35. ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows

36. IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking

37. DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

38. Disentangled Cycle Consistency for Highly-realistic Virtual Try-On

39. VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples

40. Stabilized Medical Image Attacks

41. Parser-Free Virtual Try-on via Distilling Appearance Flows

43. Rethinking Image Deraining via Rain Streaks and Vapors

44. Unsupervised Deep Representation Learning for Real-Time Tracking

45. Robust Tracking against Adversarial Attacks

46. Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations

50. Self-supervised Learning of Detailed 3D Face Reconstruction

Catalog

Books, media, physical & digital resources