Back to Search Start Over

Error bars for lexicostatistical estimates, with a case study comparing the diversity of Chinese and Romance

Authors :
Alexander Maxwell
Louise McMillan
Source :
Linguistica Brunensia, Vol 72, Iss 1 (2024)
Publication Year :
2024
Publisher :
Masaryk University, 2024.

Abstract

This paper applies statistical techniques for measuring sampling error to lexicostatistics, a field in which error has often been discussed, but only rarely measured. We specifically calculate a margin of error for lexicostatistical comparisons based on Swadesh-type vocabulary lists, and use chi-squared tests to estimate a minimum threshold for when two lexicostatistical measurements will be statistically significantly different from one another. The article includes charts which mathematically unsophisticated scholars can easily use to check margins or error. We use margin of error calculations to test the claim that the relative internal diversity of Romance “languages” and Chinese “dialects” is equivalent, finding that no result is possible with extant lexicostatistical studies. We end by suggesting that lexicostatistical dendrograms depict uncertainty with “fat branches,” that is, branches whose width corresponds to statistical uncertainty.

Subjects

Subjects :
Philology. Linguistics
P1-1091

Details

Language :
Czech, English, Russian
ISSN :
18037410 and 23364440
Volume :
72
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Linguistica Brunensia
Publication Type :
Academic Journal
Accession number :
edsdoj.77f567d61ed34f759170ea73c11aa8c4
Document Type :
article
Full Text :
https://doi.org/10.5817/LB2024-37185