Search

Your search keyword '"Zhao Dongyan"' showing total 137 results

Search Constraints

Start Over You searched for: Author "Zhao Dongyan" Remove constraint Author: "Zhao Dongyan" Database arXiv Remove constraint Database: arXiv
137 results on '"Zhao Dongyan"'

Search Results

1. ReasVQA: Advancing VideoQA with Imperfect Reasoning Process

2. Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

3. VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

4. FIRP: Faster LLM inference via future intermediate representation prediction

5. Understanding Multimodal Hallucination with Parameter-Free Representation Alignment

6. ReMamba: Equip Mamba with Effective Long-Sequence Modeling

7. Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering

8. In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting

9. Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering

10. Graph-Structured Speculative Decoding

11. End-to-End Video Question Answering with Frame Scoring Mechanisms and Adaptive Sampling

12. Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules

13. Unlocking the Potential of Model Merging for Low-Resource Languages

14. VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models

15. Efficient Continual Pre-training by Mitigating the Stability Gap

16. E-Bench: Towards Evaluating the Ease-of-Use of Large Language Models

17. Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework

18. Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers

19. xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

20. Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration

21. StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

22. HawkEye: Training Video-Text LLMs for Grounding Text in Videos

23. What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation

24. PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

25. Probing Multimodal Large Language Models for Global and Local Semantic Representations

26. Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering

27. Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge

28. STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering

29. Multi-Granularity Information Interaction Framework for Incomplete Utterance Rewriting

30. Relation-Aware Question Answering for Heterogeneous Knowledge Graphs

31. A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models

32. Language Models can be Logical Solvers

33. PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

34. Improving Input-label Mapping with Demonstration Replay for In-context Learning

35. From Simple to Complex: A Progressive Framework for Document-level Informative Argument Extraction

36. Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

37. SCALE: Synergized Collaboration of Asymmetric Language Translation Engines

38. Teaching Text-to-Image Models to Communicate in Dialog

39. CharacterChat: Learning towards Conversational AI with Personalized Social Support

40. Leveraging Denoised Abstract Meaning Representation for Grammatical Error Correction

41. Class-Incremental Learning based on Label Generation

42. From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation

43. MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning

44. How Many Answers Should I Give? An Empirical Study of Multi-Answer Reading Comprehension

45. Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training

46. VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions

47. The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code

48. PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models

49. More than Classification: A Unified Framework for Event Temporal Relation Extraction

50. RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank

Catalog

Books, media, physical & digital resources