Search

Your search keyword '"Wang Jingang"' showing total 58 results

Search Constraints

Start Over You searched for: Author "Wang Jingang" Remove constraint Author: "Wang Jingang" Database arXiv Remove constraint Database: arXiv
58 results on '"Wang Jingang"'

Search Results

1. Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern

2. Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

3. Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

4. Multi-Programming Language Sandbox for LLMs

5. FIRP: Faster LLM inference via future intermediate representation prediction

6. Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning

7. Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models

8. Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models

9. Length Desensitization in Direct Preference Optimization

10. How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

11. ReMamba: Equip Mamba with Effective Long-Sequence Modeling

12. SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models

13. Graph-Structured Speculative Decoding

14. What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

15. Rethinking LLM-based Preference Evaluation

16. EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction

17. Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism

18. Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration

19. What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation

20. Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection

21. C-ICL: Contrastive In-context Learning for Information Extraction

22. DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning

23. Sibyl: Empowering Empathetic Dialogue Generation in Large Language Models via Sensible and Visionary Commonsense Inference

24. Improving Input-label Mapping with Demonstration Replay for In-context Learning

25. Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

26. APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection

27. Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

28. mCL-NER: Cross-Lingual Named Entity Recognition via Multi-view Contrastive Learning

29. Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation

30. FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue

31. GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

32. PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models

33. Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery

34. RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank

35. Task-agnostic Distillation of Encoder-Decoder Language Models

36. Lifting the Curse of Capacity Gap in Distilling Language Models

37. Multi-task Transformer with Relation-attention and Type-attention for Named Entity Recognition

38. Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration

39. UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning

40. Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery

41. Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems

42. XPrompt: Exploring the Extreme of Prompt Tuning

43. Generalized Intent Discovery: Learning from Open World Dialogue System

44. Structural Bias for Aspect Sentiment Triplet Extraction

45. Unified Knowledge Prompt Pre-training for Customer Service Dialogues

46. CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations

47. MiniDisc: Minimal Distillation Schedule for Language Model Compression

48. Making Pretrained Language Models Good Long-tailed Learners

49. GNN-encoder: Learning a Dual-encoder Architecture via Graph Neural Networks for Dense Passage Retrieval

50. Deep Partial Multiplex Network Embedding

Catalog

Books, media, physical & digital resources