Search

Your search keyword '"Wang, Xintao"' showing total 789 results

Search Constraints

Start Over You searched for: Author "Wang, Xintao" Remove constraint Author: "Wang, Xintao"
789 results on '"Wang, Xintao"'

Search Results

1. NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model

2. Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos

3. CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

4. Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models

5. MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions

6. MINDECHO: Role-Playing Language Agents for Key Opinion Leaders

7. Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation

8. Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs

9. Capturing Minds, Not Just Words: Enhancing Role-Playing Language Models with Personality-Indicative Data

10. Image Conductor: Precision Control for Interactive Video Synthesis

11. Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models

12. Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals

13. VideoTetris: Towards Compositional Text-to-Video Generation

14. MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

15. ToonCrafter: Generative Cartoon Interpolation

16. ReVideo: Remake a Video with Motion and Content Control

17. From Persona to Personalization: A Survey on Role-Playing Language Agents

18. Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works

19. Character is Destiny: Can Role-Playing Language Agents Make Persona-Driven Decisions?

20. InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

21. SurveyAgent: A Conversational System for Personalized and Efficient Research Survey

24. SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model

25. BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

26. Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

27. Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

28. DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing

29. Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

30. ConcEPT: Concept-Enhanced Pre-Training for Language Models

31. DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

32. Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

33. DynamiCrafter: Animating Open-Domain Images with Video Diffusion Priors

34. SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

35. PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

36. AnimateZero: Video Diffusion Models are Zero-Shot Image Animators

37. MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

38. X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

39. StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter

40. Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources

41. VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

42. CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models

43. InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

44. New Boolean satisfiability problem heuristic strategy: Minimal Positive Negative Product Strategy

45. FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

46. DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

47. EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

48. Unifying Image Processing as Visual Prompting Question Answering

49. ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

50. Making LLaMA SEE and Draw with SEED Tokenizer

Catalog

Books, media, physical & digital resources