Search

Your search keyword '"Wu, Tongshuang"' showing total 172 results

Search Constraints

Start Over You searched for: Author "Wu, Tongshuang" Remove constraint Author: "Wu, Tongshuang"
172 results on '"Wu, Tongshuang"'

Search Results

1. Orbit: A Framework for Designing and Evaluating Multi-objective Rankers

2. HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation

3. What Is Wrong with My Model? Identifying Systematic Problems with Semantic Data Slicing

4. What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs

5. SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning

6. Synthetic Multimodal Question Generation

7. WebCanvas: Benchmarking Web Agents in Online Environments

8. Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness

9. Better Synthetic Data by Retrieving and Transforming Existing Datasets

10. Evaluating Mathematical Reasoning Beyond Accuracy

11. Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models

12. Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia

13. A large-scale audit of dataset licensing and attribution in AI

14. Measuring Adversarial Datasets

15. The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

16. Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs

17. How to Teach Programming in the AI Era? Using LLMs as a Teachable Agent for Debugging

18. From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context

19. Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models

20. Prompt2Model: Generating Deployable Models from Natural Language Instructions

21. LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs

22. Large Language Models Enable Few-Shot Clustering

23. Is AI the better programming partner? Human-Human Pair Programming vs. Human-AI pAIr Programming

24. Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses

25. DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions

26. BiasX: 'Thinking Slow' in Toxic Content Moderation with Explanations of Implied Social Biases

27. ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing

28. Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

29. Tool Learning with Foundation Models

30. Parachute: Evaluating Interactive Human-LM Co-writing Systems

31. ScatterShot: Interactive In-context Example Curation for Text Transformation

32. Decisions that Explain Themselves: A User-Centric Deep Reinforcement Learning Explanation System

33. Capabilities for Better ML Engineering

34. Towards Natural Language-Based Visualization Authoring

35. Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension

36. Are Shortest Rationales the Best Explanations for Human Understanding?

37. PromptChainer: Chaining Large Language Model Prompts through Visual Programming

38. StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement

39. Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages

40. NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

41. AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts

42. It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books

43. DeHumor: Visual Analytics for Decomposing Humor

44. Tailor: Generating and Perturbing Text with Semantic Controls

45. Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

46. Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance

47. Beyond Accuracy: Behavioral Testing of NLP models with CheckList

48. Technology-Enabled Disinformation: Summary, Lessons, and Recommendations

Catalog

Books, media, physical & digital resources