Search

Your search keyword '"Yan, Ming"' showing total 17,169 results

Search Constraints

Start Over You searched for: Author "Yan, Ming" Remove constraint Author: "Yan, Ming"
17,169 results on '"Yan, Ming"'

Search Results

1. Decoupling Layout from Glyph in Online Chinese Handwriting Generation

2. Trustworthy Hate Speech Detection Through Visual Augmentation

3. SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing

4. mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

5. MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model

6. ProFuser: Progressive Fusion of Large Language Models

7. mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

8. MIBench: Evaluating Multimodal Large Language Models over Multiple Images

9. Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models

10. DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation

11. Modeling Comparative Logical Relation with Contrastive Learning for Text Generation

12. Text-like Encoding of Collaborative Information in Large Language Models for Recommendation

13. Initialization-enhanced Physics-Informed Neural Network with Domain Decomposition (IDPINN)

14. Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration

15. Time-Varying Graph Signal Recovery Using High-Order Smoothness and Adaptive Low-rankness

16. Temperature and light reverse the fertility of rice P/TGMS line ostms19 via reactive oxygen species homeostasis.

17. TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning

18. Understanding Emotional Hijacking in Metaverse

19. Protecting Human Users Against Cognitive Attacks in Immersive Environments

20. Safeguarding People's Financial Health in Metaverse with Emotionally Intelligent Virtual Buddy

21. Adaptive Feature Fusion Neural Network for Glaucoma Segmentation on Unseen Fundus Images

22. Shortcuts Arising from Contrast: Effective and Covert Clean-Label Attacks in Prompt-Based Learning

23. Collaborative Knowledge Infusion for Low-resource Stance Detection

24. RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

25. ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy

26. SocialBench: Sociality Evaluation of Role-Playing Conversational Agents

27. mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

28. Efficient sparse probability measures recovery via Bregman gradient

29. Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training

30. Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval

31. Budget-Constrained Tool Learning with Planning

32. Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

33. PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs

34. Model Composition for Multimodal Large Language Models

35. Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion

36. Enabling Weak LLMs to Judge Response Reliability via Meta Ranking

42. The Effect of the COVID-19 Pandemic on Health Care Workers’ Anxiety Levels: Protocol for a Meta-Analysis

43. Are Generative AI systems Capable of Supporting Information Needs of Patients?

44. Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

45. Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

46. Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection

47. LARP: Language-Agent Role Play for Open-World Games

48. TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training

49. Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

50. mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model

Catalog

Books, media, physical & digital resources