1. Getting Closer to AI Complete Question Answering: A Set of Prerequisite Real Tasks
- Author
-
Anna Rogers, Olga Kovaleva, Matthew Downey, and Anna Rumshisky
- Subjects
Computer science ,business.industry ,Factoid ,AI-complete ,Commonsense reasoning ,02 engineering and technology ,General Medicine ,computer.software_genre ,Task (project management) ,03 medical and health sciences ,Annotation ,0302 clinical medicine ,Reading comprehension ,030221 ophthalmology & optometry ,0202 electrical engineering, electronic engineering, information engineering ,Question answering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Set (psychology) ,business ,computer ,Natural language processing - Abstract
The recent explosion in question answering research produced a wealth of both factoid reading comprehension (RC) and commonsense reasoning datasets. Combining them presents a different kind of task: deciding not simply whether information is present in the text, but also whether a confident guess could be made for the missing information. We present QuAIL, the first RC dataset to combine text-based, world knowledge and unanswerable questions, and to provide question type annotation that would enable diagnostics of the reasoning strategies by a given QA system. QuAIL contains 15K multi-choice questions for 800 texts in 4 domains. Crucially, it offers both general and text-specific questions, unlikely to be found in pretraining data. We show that QuAIL poses substantial challenges to the current state-of-the-art systems, with a 30% drop in accuracy compared to the most similar existing dataset.
- Published
- 2020
- Full Text
- View/download PDF