Back to Search
Start Over
AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts
- Source :
- EMNLP (1)
- Publication Year :
- 2020
-
Abstract
- The remarkable success of pretrained language models has motivated the study of what kinds of knowledge these models learn during pretraining. Reformulating tasks as fill-in-the-blanks problems (e.g., cloze tests) is a natural approach for gauging such knowledge, however, its usage is limited by the manual effort and guesswork required to write suitable prompts. To address this, we develop AutoPrompt, an automated method to create prompts for a diverse set of tasks, based on a gradient-guided search. Using AutoPrompt, we show that masked language models (MLMs) have an inherent capability to perform sentiment analysis and natural language inference without additional parameters or finetuning, sometimes achieving performance on par with recent state-of-the-art supervised models. We also show that our prompts elicit more accurate factual knowledge from MLMs than the manually created prompts on the LAMA benchmark, and that MLMs can be used as relation extractors more effectively than supervised relation extraction models. These results demonstrate that automatically generated prompts are a viable parameter-free alternative to existing probing methods, and as pretrained LMs become more sophisticated and capable, potentially a replacement for finetuning.<br />v2: Fixed error in Figure 2
- Subjects :
- FOS: Computer and information sciences
050101 languages & linguistics
Computer Science - Machine Learning
Computer Science - Computation and Language
Relation (database)
Computer science
business.industry
05 social sciences
Sentiment analysis
02 engineering and technology
computer.software_genre
Relationship extraction
Machine Learning (cs.LG)
Set (abstract data type)
0202 electrical engineering, electronic engineering, information engineering
Benchmark (computing)
020201 artificial intelligence & image processing
0501 psychology and cognitive sciences
Language model
Artificial intelligence
Natural approach
business
computer
Computation and Language (cs.CL)
Natural language processing
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- EMNLP (1)
- Accession number :
- edsair.doi.dedup.....9dc84d9415f70183e2f937a864d861e6