Back to Search
Start Over
Computational learning of construction grammars
- Source :
- Language and Cognition. 9:254-292
- Publication Year :
- 2016
- Publisher :
- Cambridge University Press (CUP), 2016.
-
Abstract
- This paper presents an algorithm for learning the construction grammar of a language from a large corpus. This grammar induction algorithm has two goals: first, to show that construction grammars are learnable without highly specified innate structure; second, to develop a model of which units do or do not constitute constructions in a given dataset. The basic task of construction grammar induction is to identify the minimum set of constructions that represents the language in question with maximum descriptive adequacy. These constructions must (1) generalize across an unspecified number of units while (2) containing mixed levels of representation internally (e.g., both item-specific and schematized representations), and (3) allowing for unfilled and partially filled slots. Additionally, these constructions may (4) contain recursive structure within a given slot that needs to be reduced in order to produce a sufficiently schematic representation. In other words, these constructions are multi-length, multi-level, possibly discontinuous co-occurrences which generalize across internal recursive structures. These co-occurrences are modeled using frequency and the ΔP measure of association, expanded in novel ways to cover multi-unit sequences. This work provides important new evidence for the learnability of construction grammars as well as a tool for the automated corpus analysis of constructions.
- Subjects :
- 060201 languages & linguistics
Linguistics and Language
business.industry
Computer science
Learnability
Experimental and Cognitive Psychology
06 humanities and the arts
02 engineering and technology
Construction grammar
Context-free grammar
computer.software_genre
Language and Linguistics
Grammar induction
Computational learning theory
0602 languages and literature
0202 electrical engineering, electronic engineering, information engineering
Stochastic context-free grammar
020201 artificial intelligence & image processing
Artificial intelligence
L-attributed grammar
business
computer
Poverty of the stimulus
Natural language processing
Subjects
Details
- ISSN :
- 18669859 and 18669808
- Volume :
- 9
- Database :
- OpenAIRE
- Journal :
- Language and Cognition
- Accession number :
- edsair.doi...........6084b410783989cda94bce238f4542a3