Back to Search Start Over

Measuring Lexical Diversity in Texts: The Twofold Length Problem

Authors :
Yves Bestgen
Source :
Language Learning. 2024 74(3):638-671.
Publication Year :
2024

Abstract

The impact of text length on the estimation of lexical diversity has captured the attention of the scientific community for more than a century. Numerous indices have been proposed, and many studies have been conducted to evaluate them, but the problem remains. This methodological review provides a critical analysis not only of the most commonly used indices in language learning studies, but also of the length problem itself, as well as of the methodology for evaluating the proposed solutions. Analysis of three data sets of texts produced by English language learners revealed that indices that reduce all texts to the same length using a probabilistic or an algorithmic approach solve the length-dependency problem; however, all these indices failed to address the second problem, which is their sensitivity to the parameter that determines the length to which the texts are reduced. The paper concludes with recommendations for optimizing lexical diversity analysis.

Details

Language :
English
ISSN :
0023-8333 and 1467-9922
Volume :
74
Issue :
3
Database :
ERIC
Journal :
Language Learning
Notes :
https://oasis-database.org
Publication Type :
Academic Journal
Accession number :
EJ1435073
Document Type :
Journal Articles<br />Reports - Research
Full Text :
https://doi.org/10.1111/lang.12630