Back to Search
Start Over
Linguistic constraints on statistical word segmentation: The role of consonants in Arabic and English
- Source :
- Kastner, I & Adriaans, F 2018, ' Linguistic constraints on statistical word segmentation : The role of consonants in Arabic and English ', Cognitive Science, vol. 42, pp. 494-518 . https://doi.org/10.1111/cogs.12521, Cognitive Science, 42(S2), 494. Wiley-Blackwell
- Publication Year :
- 2018
-
Abstract
- Statistical learning is often taken to lie at the heart of many cognitive tasks, including the acquisition of language. One particular task in which probabilistic models have achieved considerable success is the segmentation of speech into words. However, these models have mostly been tested against English data, and as a result little is known about how a statistical learning mechanism copes with input regularities that arise from the structural properties of different languages. This study focuses on statistical word segmentation in Arabic, a Semitic language in which words are built around consonantal roots. We hypothesize that segmentation in such languages is facilitated by tracking consonant distributions independently from intervening vowels. Previous studies have shown that human learners can track consonant probabilities across intervening vowels in artificial languages, but it is unknown to what extent this ability would be beneficial in the segmentation of natural language. We assessed the performance of a Bayesian segmentation model on English and Arabic, comparing consonant-only representations with full representations. In addition, we examined to what extent structurally different proto-lexicons reflect adult language. The results suggest that for a child learning a Semitic language, separating consonants from vowels is beneficial for segmentation. These findings indicate that probabilistic models require appropriate linguistic representations in order to effectively meet the challenges of language acquisition.
- Subjects :
- Consonant
Morphology
Computer science
Cognitive Neuroscience
Speech recognition
Experimental and Cognitive Psychology
computer.software_genre
Language Development
050105 experimental psychology
Speech segmentation
Cognition
Artificial Intelligence
Phonetics
morphology
word segmentation
Humans
Learning
Speech
0501 psychology and cognitive sciences
Language
Models, Statistical
Arabic
business.industry
Arab World
05 social sciences
Text segmentation
Bayes Theorem
Language acquisition
Semitic languages
Linguistics
Statistical learning
Constructed language
statistical learning
language acquisition
Word segmentation
Speech Perception
Artificial intelligence
Computational linguistics
business
computer
Natural language
Natural language processing
050104 developmental & child psychology
Subjects
Details
- Language :
- English
- ISSN :
- 03640213
- Database :
- OpenAIRE
- Journal :
- Kastner, I & Adriaans, F 2018, ' Linguistic constraints on statistical word segmentation : The role of consonants in Arabic and English ', Cognitive Science, vol. 42, pp. 494-518 . https://doi.org/10.1111/cogs.12521, Cognitive Science, 42(S2), 494. Wiley-Blackwell
- Accession number :
- edsair.doi.dedup.....210531e516c4aaad23823de173d6b0c6
- Full Text :
- https://doi.org/10.1111/cogs.12521