Search

Your search keyword '"Jurafsky, Dan"' showing total 563 results

Search Constraints

Start Over You searched for: Author "Jurafsky, Dan" Remove constraint Author: "Jurafsky, Dan"
563 results on '"Jurafsky, Dan"'

Search Results

1. Belief in the Machine: Investigating Epistemological Blind Spots of Language Models

2. Bayesian scaling laws for in-context learning

3. Can Unconfident LLM Annotations Be Used for Confident Conclusions?

4. A layer-wise analysis of Mandarin and English suprasegmentals in SSL speech models

5. h4rm3l: A Dynamic Benchmark of Composable Jailbreak Attacks for LLM Safety Assessment

6. Data Checklist: On Unit-Testing Datasets with Usable Information

7. Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

8. ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

9. ReFT: Representation Finetuning for Language Models

10. NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps

11. Dialect prejudice predicts AI decisions about people's character, employability, and criminality

13. CausalGym: Benchmarking causal interpretability methods on linguistic tasks

14. How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis

15. Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens

16. AnthroScore: A Computational Linguistic Measure of Anthropomorphism

17. KTO: Model Alignment as Prospect Theoretic Optimization

18. Use large language models to promote equity

19. Grounding Gaps in Language Model Generations

20. A Benchmark for Learning to Translate a New Language from One Grammar Book

21. Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

22. Learning the meanings of function words from grounded language using a visual question answering model

23. Othering and low status framing of immigrant cuisines in US restaurant reviews and large language models

24. Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes

25. Developing Speech Processing Pipelines for Police Accountability

26. Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

27. Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation

28. string2string: A Modern Python Library for String-to-String Algorithms

29. Injecting structural hints: Using language models to study inductive biases in language learning

30. Foundation Models and Fair Use

31. Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models

32. Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions

34. Can Automated Feedback Improve Teachers' Uptake of Student Ideas? Evidence from a Randomized Controlled Trial in a Large-Scale Online Course. EdWorkingPaper No. 21-483

35. Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models

36. Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization?

37. Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

38. Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

39. Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models

40. When and why vision-language models behave like bags-of-words, and what to do about it?

41. Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions. EdWorkingPaper No. 21-424

42. Computationally Identifying Funneling and Focusing Questions in Classroom Discourse

43. Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

44. The Authenticity Gap in Human Evaluation

45. Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Arbitrary Textual Style Transfer with Small Language Models

46. Richer Countries and Richer Representations

47. Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words

48. Modular Domain Adaptation

49. Automated speech tools for helping communities process restricted-access corpora for language revival efforts

Catalog

Books, media, physical & digital resources