1. Spelling performance on the web and in the lab
- Author
-
Chloe Olivier, Arnaud Rey, Jean-Luc Manguin, Sébastien Pacton, Pierre Courrieu, Laboratoire de psychologie cognitive (LPC), Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU), Equipe Hultech - Laboratoire GREYC - UMR6072, Groupe de Recherche en Informatique, Image et Instrumentation de Caen (GREYC), Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Ingénieurs de Caen (ENSICAEN), Normandie Université (NU)-Normandie Université (NU)-Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Ingénieurs de Caen (ENSICAEN), Normandie Université (NU), Laboratoire de Psychologie et Neuropsychologie Cognitives (LPNCog FRE 3292), Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université Paris Descartes - Paris 5 (UPD5), ANR-17-CE28-0013,CHUNKED,Chunking : une étude du rôle critique de la compression de l'information dans la cognition(2017), ANR-11-IDEX-0001,Amidex,INITIATIVE D'EXCELLENCE AIX MARSEILLE UNIVERSITE(2011), ANR-16-CONV-0002,ILCB,ILCB: Institute of Language Communication and the Brain(2016), ANR-11-IDEX-0005,USPC,Université Sorbonne Paris Cité(2011), Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen (GREYC), Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Normandie Université (NU)-École Nationale Supérieure d'Ingénieurs de Caen (ENSICAEN), Normandie Université (NU)-Centre National de la Recherche Scientifique (CNRS)-Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Centre National de la Recherche Scientifique (CNRS), Institut des Sciences de la Terre de Paris (iSTeP), Institut national des sciences de l'Univers (INSU - CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Cognition et Comportement (FRE 2987), ANR-16-CONV-0002,ILCB,Institute of Language Communication and the Brain(2017), ANR-11-IDEX-0001-02/11-LABX-0036,BLRI,Brain and Language Research Institute(2011), Aix Marseille Université (AMU)-Centre National de la Recherche Scientifique (CNRS), Université Paris Descartes - Paris 5 (UPD5)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Rey, Arnaud, Chunking : une étude du rôle critique de la compression de l'information dans la cognition - - CHUNKED2017 - ANR-17-CE28-0013 - AAPG2017 - VALID, INITIATIVE D'EXCELLENCE AIX MARSEILLE UNIVERSITE - - Amidex2011 - ANR-11-IDEX-0001 - IDEX - VALID, ILCB: Institute of Language Communication and the Brain - - ILCB2016 - ANR-16-CONV-0002 - CONV - VALID, and Université Sorbonne Paris Cité - - USPC2011 - ANR-11-IDEX-0005 - IDEX - VALID
- Subjects
Male ,Vocabulary ,Computer science ,Writing ,Social Sciences ,Psycholinguistics ,[SCCO]Cognitive science ,Database and Informatics Methods ,0302 clinical medicine ,Mathematical and Statistical Techniques ,Psychology ,media_common ,Grammar ,Multidisciplinary ,05 social sciences ,Statistics ,Phonology ,Experimental Psychology ,Spelling ,Semantics ,[SCCO.PSYC]Cognitive science/Psychology ,Physical Sciences ,Information Retrieval ,Medicine ,Regression Analysis ,Female ,Word Processing ,Research Article ,media_common.quotation_subject ,Science ,Sample (statistics) ,Research and Analysis Methods ,050105 experimental psychology ,World Wide Web ,03 medical and health sciences ,Young Adult ,Literacy ,Humans ,0501 psychology and cognitive sciences ,Statistical Methods ,Lexicons ,Phonemes ,Biology and Life Sciences ,Linguistics ,[SCCO] Cognitive science ,030217 neurology & neurosurgery ,Mathematics - Abstract
International audience; Several dictionary websites are available on the web to access semantic, synonymous, or spelling information about a given word. During nine years, we systematically recorded all the entered letter sequences from a French web dictionary. A total of 200 million ortho-graphic forms were obtained allowing us to create a large-scale database of spelling errors that could inform psychological theories about spelling processes. To check the reliability of this big data methodology, we selected from this database a sample of 100 frequently misspelled words. A group of 100 French university students had to perform a spelling-to-dictation test on this list of words. The results showed a strong correlation between the two data sets on the frequencies of produced spellings (r = 0.82). Although the distributions of spelling errors were relatively consistent across the two databases, the proportion of correct responses revealed significant differences. Regression analyses allowed us to generate possible explanations for these differences in terms of task-dependent factors. We argue that comparing the results of these large-scale databases with those of standard and controlled experimental paradigms is certainly a good way to determine the conditions under which this big data methodology can be adequately used for informing psychological theories.
- Published
- 2019