Search

Your search keyword '"Yan, Ming"' showing total 189 results

Search Constraints

Start Over You searched for: Author "Yan, Ming" Remove constraint Author: "Yan, Ming" Publication Type Reports Remove constraint Publication Type: Reports
189 results on '"Yan, Ming"'

Search Results

1. Decoupling Layout from Glyph in Online Chinese Handwriting Generation

2. Trustworthy Hate Speech Detection Through Visual Augmentation

3. SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing

4. mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

5. MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model

6. ProFuser: Progressive Fusion of Large Language Models

7. mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

8. MIBench: Evaluating Multimodal Large Language Models over Multiple Images

9. Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models

10. DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation

11. Modeling Comparative Logical Relation with Contrastive Learning for Text Generation

12. Text-like Encoding of Collaborative Information in Large Language Models for Recommendation

13. Initialization-enhanced Physics-Informed Neural Network with Domain Decomposition (IDPINN)

14. Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration

15. Time-Varying Graph Signal Recovery Using High-Order Smoothness and Adaptive Low-rankness

16. TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning

17. Understanding Emotional Hijacking in Metaverse

18. Protecting Human Users Against Cognitive Attacks in Immersive Environments

19. Safeguarding People's Financial Health in Metaverse with Emotionally Intelligent Virtual Buddy

20. Adaptive Feature Fusion Neural Network for Glaucoma Segmentation on Unseen Fundus Images

21. Shortcuts Arising from Contrast: Effective and Covert Clean-Label Attacks in Prompt-Based Learning

22. Collaborative Knowledge Infusion for Low-resource Stance Detection

23. RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

24. ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy

25. SocialBench: Sociality Evaluation of Role-Playing Conversational Agents

26. mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

27. Efficient sparse probability measures recovery via Bregman gradient

28. Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training

29. Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval

30. Budget-Constrained Tool Learning with Planning

31. Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

32. PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs

33. Model Composition for Multimodal Large Language Models

34. Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion

35. Enabling Weak LLMs to Judge Response Reliability via Meta Ranking

36. Are Generative AI systems Capable of Supporting Information Needs of Patients?

37. Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

38. Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

39. Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection

40. LARP: Language-Agent Role Play for Open-World Games

41. TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training

42. Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

43. mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model

44. AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

45. mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

46. MCC-KD: Multi-CoT Consistent Knowledge Distillation

47. Physical Information Neural Networks for Solving High-index Differential-algebraic Equation Systems Based on Radau Methods

48. A numerical investigation of quasi-static magnetoconvection with an imposed horizontal magnetic field

49. UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

50. ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Catalog

Books, media, physical & digital resources