Search

Your search keyword '"Wang, Zhangyang"' showing total 1,035 results

Search Constraints

Start Over You searched for: Author "Wang, Zhangyang" Remove constraint Author: "Wang, Zhangyang" Search Limiters Available in Library Collection Remove constraint Search Limiters: Available in Library Collection
1,035 results on '"Wang, Zhangyang"'

Search Results

1. Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework

2. Chasing Better Deep Image Priors between Over- and Under-parameterization

3. Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

4. Large Spatial Model: End-to-end Unposed Images to Semantic 3D

5. Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

6. AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models

7. Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis

8. Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild

9. On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability

10. LLM-PBE: Assessing Data Privacy in Large Language Models

11. All Against Some: Efficient Integration of Large Language Models for Message Passing in Graph Neural Networks

12. From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

13. Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

14. Expressive Gaussian Human Avatars from Monocular RGB Video

15. 4K4DGen: Panoramic 4D Generation at 4K Resolution

16. Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses

17. Flextron: Many-in-One Flexible Large Language Model

18. LoCoCo: Dropping In Convolutions for Long Context Compression

19. Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

20. CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation

21. Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models

22. Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling

23. OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

24. DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting

25. MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements

26. InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

27. Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D

28. Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study

29. Comp4D: LLM-Guided Compositional 4D Scene Generation

30. StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

31. Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

32. Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk

33. GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

34. Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding

35. Principled Architecture-aware Scaling of Hyperparameters

36. Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization

37. Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

38. Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

39. Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference

40. LLaGA: Large Language and Graph Assistant

41. QuantumSEA: In-Time Sparse Exploration for Noise Adaptive Quantum Circuits

42. AGG: Amortized Generative 3D Gaussians for Single Image to 3D

43. VASE: Object-Centric Appearance and Shape Manipulation of Real Videos

44. SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

45. Taming Mode Collapse in Score Distillation for Text-to-3D Generation

46. 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency

47. HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

48. The Counterattack of CNNs in Self-Supervised Learning: Larger Kernel Size might be All You Need

49. Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

50. Meta ControlNet: Enhancing Task Adaptation via Meta Learning

Catalog

Books, media, physical & digital resources