Search

Your search keyword '"He, Xuehai"' showing total 43 results

Search Constraints

Start Over You searched for: Author "He, Xuehai" Remove constraint Author: "He, Xuehai"
43 results on '"He, Xuehai"'

Search Results

1. Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation

2. Mojito: Motion Trajectory and Intensity Control for Video Generation

3. EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

4. MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

5. Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA

6. FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation

7. Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning

8. MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens

9. LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

10. Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners

11. Multimodal Graph Transformer for Multimodal Question Answering

12. Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

13. ComCLIP: Training-Free Compositional Image and Text Matching

14. CPL: Counterfactual Prompt Learning for Vision and Language Models

15. JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents

16. Improve the performance of CT-based pneumonia classification via source data reweighting

17. Parameter-efficient Model Adaptation for Vision Transformers

18. Learning by Ignoring, with Application to Domain Adaptation

19. Pathological Visual Question Answering

20. Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms

21. On the Generation of Medical Dialogues for COVID-19

22. MedDialog: Two Large-scale Medical Dialogue Datasets

23. COVID-CT-Dataset: A CT Scan Dataset about COVID-19

24. PathVQA: 30000+ Questions for Medical Visual Question Answering

25. Spatio-temporal Evolution of Interactive Coercing Relationship between Tourism Resource Development and Landscape Ecological Security: A Case Study of the Guizhou Section of Chishui River Basin.

27. Discriminative Diffusion Models as Few-shot Vision and Language Learners

33. On the Generation of Medical Dialogs for COVID-19

37. On the Generation of Medical Dialogues for COVID-19

38. On the Generation of Medical Dialogues for COVID19

42. Comprehensive model for managing water resources in the Baotou City, Inner Mongolia

Catalog

Books, media, physical & digital resources