Back to Search Start Over

Measurement of Lexical Diversity in Children’s Spoken Language: Computational and Conceptual Considerations

Authors :
Ji Seung Yang
Carly Rosvold
Nan Bernstein Ratner
Source :
Frontiers in Psychology, Vol 13 (2022)
Publication Year :
2022
Publisher :
Frontiers Media S.A., 2022.

Abstract

BackgroundType-Token Ratio (TTR), given its relatively simple hand computation, is one of the few LSA measures calculated by clinicians in everyday practice. However, it has significant well-documented shortcomings; these include instability as a function of sample size, and absence of clear developmental profiles over early childhood. A variety of alternative measures of lexical diversity have been proposed; some, such as Number of Different Words/100 (NDW) can also be computed by hand. However, others, such as Vocabulary Diversity (VocD) and the Moving Average Type Token Ratio (MATTR) rely on complex resampling algorithms that cannot be conducted by hand. To date, no large-scale study of all four measures has evaluated how well any capture typical developmental trends over early childhood, or whether any reliably distinguish typical from atypical profiles of expressive child language ability.Materials and MethodsWe conducted linear and non-linear regression analyses for TTR, NDW, VocD, and MATTR scores for samples taken from 946 corpora from typically developing preschool children (ages 2–6 years), engaged in adult-child toy play, from the Child Language Data Exchange System (CHILDES). These were contrasted with 504 samples from children known to have delayed expressive language skills (total n = 1,454 samples). We also conducted a separate sub-analysis which examined possible contextual effects of sampling environment on lexical diversity.ResultsOnly VocD showed significantly different mean scores between the typically -developing children and delayed developing children group. Using TTR would actually misdiagnose typical children and miss children with known language impairment. However, computation of VocD as a function of toy interactions was significant and emerges as a further caution in use of lexical diversity as a valid proxy index of children’s expressive vocabulary skill.DiscussionThis large scale statistical comparison of computer-implemented algorithms for expressive lexical profiles in young children with traditional, hand-calculated measures showed that only VocD met criteria for evidence-based use in LSA. However, VocD was impacted by sample elicitation context, suggesting that non-linguistic factors, such as engagement with elicitation props, contaminate estimates of spoken lexical skill in young children. Implications and suggested directions are discussed.

Details

Language :
English
ISSN :
16641078
Volume :
13
Database :
Directory of Open Access Journals
Journal :
Frontiers in Psychology
Publication Type :
Academic Journal
Accession number :
edsdoj.0a0506c2d86486787201132cc212780
Document Type :
article
Full Text :
https://doi.org/10.3389/fpsyg.2022.905789