Aims The current study aims to investigate whether the Ghostwriter Method improves the veracity judgments of laypeople. In order to do so, each participant will read one transcript derived from a previous study (How was your trip? Investigating cues to truth and deceit). Participants will then have to decide whether their transcript includes a truthful or deceitful statement. Primary Objective The primary objective of the current study is to examine whether laypeople more accurately identify a statement as true or false when the original interview was conducted using the Ghostwriter (enhanced) Method (vs. no Ghostwriter condition [control group]). This study will also examine whether summarising the statement before making a veracity judgment influences the deception detection accuracy. Secondary Objective(s) The secondary objective of the current study is to investigate what cues participants use to determine their veracity judgment, and whether these cues differ from those usually coded in deception detection research (e.g. common knowledge details, verifiable sources, plausibility, etc.). Furthermore, this study will examine whether other factors such as time spent reading the statement, or the participants’ first language influence the accuracy of the veracity judgment. Hypotheses Accuracy • How accurate are participants in their veracity judgement? • Does the condition (C vs GW vs GW-E) influence the accuracy? o Truthful statements tend to have more details (Vrij, 2008b, Leal et al., 2019) o The more details the more likely a statement is perceived as truthful (Bell & Loftus, 1989) o Fictitious statements are perceived as less plausible (Leal et al., 2019; Vrij, 2008) o In the first-ever conducted study, the Ghostwriter condition resulted in more details than the two control conditions, and truth-tellers stories sounded more plausible (Leal et al., 2019) • HYPOTHESES o The GW(-E) condition will lead to greater differences between truth-tellers and liars (e.g. in terms of number of details and plausibility) and therefore participants who read a script from the GW(-E) instruction will obtain higher accuracy than participants who read a transcript from the control condition Task to summarise • How does summarising work? (Bower, 1976; van Dijk, 1979; van Dijk & Kintsch, 1978) o We follow a general framework, i.e. a super-structure or schemata when we build or recall stories (e.g. setting, plot, characters, resolution, etc) o This framework is filled with macro- and micro-structures which are related to the actual content of the story Macro-structures: general facts, e.g. occupation of a character, a birthplace, a city where the story takes place (Bower, 1976) Micro-structures: specific facts, e.g. the character was a bus driver, she was born in Melbourne, the story takes place in London (Bower, 1976) o When we recall stories, certain things get omitted in order to summarise it, i.e. redundant information gets deleted, Location descriptions are deleted or integrated, propositions denoting emotional states are deleted For a more in-depth list, see (van Dijk, 1976) • Are the summaries of truthful statements more detailed? o Operationalisation: amount of details o Truthful statements tend to contain a higher amount of details (Vrij, 2014) The more details the more likely a statement is perceived as truthful (Bell & Loftus, 1989) o Truthful statements sound more plausible, especially in GW condition (Leal et al., 2019) While the GW method encourages both truth-tellers and liars to say more, liars seem to struggle with coming up with additional details that also sound plausible (Leal et al., 2019) Fictitious statements are perceived as less plausible (Leal et al., 2019; Vrij, 2008) o The more plausible a story the easier it is to remember it (Black et al., 1986) • HYPOTHESES: o Statements from the GW(-E) condition will be summarised in greater detail compared to statements from the Control condition o truthful statements will be summarised in greater detail than fictitious statements Exploratory Questions Accuracy • What factors potentially influence the accuracy? o Predictors used: the first language of the reader, time spent reading, self-perceived lie-detection skills, confidence in their own judgement and whether they have summarised the statement Subjective cues / coded variables • Subjective cues = subjective cues participants base their judgement on • Coded variables = cues we actually code for, e.g. complications, common-knowledge details, etc. • What cues do participants use to make their judgement? o Open question where participants can list what made them think the interviewee was a truth-teller/liar o Literature: (Im)plausibility, Amount of details, Consistency, Contradictions, (Lack of) emotions (Masip et al., 2012) Verifiability, number of details, pronouns, plausibility, unnecessary details, feelings/opinions, statement length, overcomplicated information, specificity of information (Vernham et al., 2020) • What type of details do participants highlight (e.g., filler words, (unnecessary) details, etc.)? • Is there an overlap between subjective cues participants highlight and what we code for (coded variables)? • Is there a relationship between the types of details participants highlight and their veracity judgement? Participants. Using WebPower (Zhang et al., 2018) a priori calculations indicated that recruiting 206 participants would result in a power of .90. This result is for a study that has an alpha level of .05 and a medium effect size of f = .25. Participants will be recruited via Prolific. To make use of their representative sample option (UK), at least 300 participants have to be recruited. This will allow us to generalise our findings for a broader population (for a sample breakdown from Prolific, see below). This sample size is also congruent with the amount of participants recruited in previous research in the area of deception detection (e.g., Leal et al., 2015; Vrij & Granhag, 2012; Vrij, Leal, Mann, & Granhag, 2011). A sample of 300 will result in 25 participants per condition (2 (Veracity) x 3 (Interview Instructions) x 2 (Summary y/n). The main statistical analysis will be a three-way ANOVA. Sample breakdown - United Kingdom Factors: Sex, Age, Ethnicity (Simplified GB Census) Sex Age Ethnicity Female 153 18-27 53 Asian 29 Male 147 28-37 53 Black 20 38-47 56 Mixed 13 48-57 50 Other 10 58+ 88 White 228 Design The experiment will use a 2 (Veracity: truth-tellers vs. liars) x 3 (Instruction: control vs. ghostwriter vs. ghostwriter enhanced) x 2 (Summarising statement: yes vs. no) between-subjects design. The dependent variable for the main calculation will be the accuracy rate. Furthermore, other variables will be investigated to explore their influence on veracity accuracy, such as the amount of time spent reading the statements, the participant's first language, the participant’s self-perceived lie detection skills, and the participant’s confidence in their own judgements. In addition, participants will indicate which words and/or passages they used to make their veracity judgements. These will be compared to cues that have been commonly used in previous research, e.g., complications, common knowledge details, plausibility, etc (Leal et al., 2019; Leal, Vrij, Deeb, et al., 2018; Leal, Vrij, Vernham, et al., 2018). Additionally, an open question provides participants with the option to freely express what they based their decision on. Material The statements used in the proposed study originate from a previous experiment (data collection Feb – Mar 2020). Here, participants were asked to talk about a city trip they have made in the last 12 months. Participants were either truthful or deceptive, in which case they have never been to the city they talked about. Interviewees were randomly assigned to one of three interview instructions (Control vs. Ghostwriter (GW) vs. Ghostwriter-enhanced (GWE)). Unfortunately, data collection for that experiment was interrupted by COVID-19 restrictions in March 2020. Up to that point the study had produced 99 transcripts. However, the transcripts in the interrupted study were not equally distributed over all conditions, ranging between 12 and 20 interviews per cell (see below). In the proposed online study participants will be exposed to an equal number of transcripts (i.e. 12 transcripts per group [3 (interview instruction) x 2 (veracity)]), therefore only 72 of the 99 transcripts will be used. The transcripts within each group were chosen at random. Given the target sample size of 300, 4-5 participants will judge the same transcript. Number of participants per cell in the previous study collected before the interruption due to COVID-19: Truthteller Control (C) 19 Ghostwriter (GW) 18 Ghostwriter Enhanced (GW-E) 20 Liar Control (C) 14 Ghostwriter (GW) 12 Ghostwriter Enhanced (GW-E) 16 Procedure When a potential participant clicks on the link, they will be taken to Qualtrics where the study was designed. The first page will include the participant information sheet and the consent form. This will inform participants about the content of the study and how long the study will take. A pilot study with 24 participants indicated that it will take about 20min, however, this may vary depending on the length of the transcript. They will also be informed that the study involves reading a statement from a person who discusses a city trip and that their task will be to judge whether that person actually made that trip or not (i.e., told the truth or not). On the next page, they will be asked to give their consent. Participants will then be pseudo-randomly assigned to a condition, i.e., to one of the twelve possible groups (veracity: lie/truth + condition: control/ghostwriter/ghostwriter enhanced + summary: yes/no). To prevent biasing the judgement, participants will be asked to indicate which places they have been to before. A list of cities that match the ones discussed in the transcripts will be provided to the participants. Participants will only be assigned to one transcript that contains a place they have not previously been to themselves. For example, if the participant has been to Mallorca and London, but not to Amsterdam or Munich, they will be randomly assigned to a transcript that discusses a city trip to Amsterdam or Munich. In the unlikely case that a participant has been to every city that is included in the transcripts, the survey will end. Each participant will only judge the veracity of one transcript to circumvent comparisons. Each participant will therefore see one out of six possible types of transcripts, depending on the original interview instruction (control vs. GW vs. GW-enhanced) and whether the interviewee lied or told the truth (veracity). Without the participant’s knowledge, Qualtrics measures the amount of time a participant spends on this page and will be interpreted as “time spent reading”. Not only will this be used as a variable to determine whether the time spent reading the statement has an influence on the accuracy, but also as a control variable. We cannot control whether a participant actually reads continuously or just walks away from their computer since this is an online study. Therefore, we will look at the average time the other 3-4 ppts needed for the same transcript, and if someone exceeds that by more than +2SD, that participant will be excluded. Depending on the condition the participant is randomly assigned to, they will either immediately make their veracity judgement (no summary condition) or will be asked to summarise key points of the statement in bullet points first and make their judgement afterwards (summary condition). After that, they will answer a couple of questions regarding how confident they are in their judgement and how they assess their own deception detection skills compared to others. Participants will then have the option to highlight words and/or passages that indicated to them that the interviewee was lying or telling the truth. This will also include an open-ended question (“What made you think the interviewee was a liar or a truth-teller?”) where participants can write down the cues they used to determine deceit. As this is quite an unfamiliar task for an online study, a short test question will provide the participants with the necessary practice to familiarise themselves with highlighting passages. Besides socio-demographic questions, participants will be asked whether English is their first language, and if it is not, how fluent their English is based on the CEFR. To get a better understanding of how fluent a participant is, participants will also indicate how often they use English in their everyday life (e.g. daily, at least once a week, etc.). This way we want to get a more well-rounded picture of how versed someone is. Finally, participants will indicate how they perceive their own lie detection skills compared to others, and how convinced they are that their own veracity judgement is correct. On completion of the study, the participant will be fully debriefed, and paid £2.35/h (for an estimated completion time of 20 minutes). Planned Data Analyses The main analysis will be a three-way between-subject ANOVA: 2 (Veracity: truth-tellers vs. liars) x 3 (Instruction: control vs. ghostwriter vs. ghostwriter enhanced) x 2 (Summarising statement: yes vs. no) with accuracy rate as the dependent variable. In regard to the secondary objective, a (binary) logistic regression will be conducted with accuracy as a dependent variable. The predictors will be the first language of the readers, time spent reading, self-perceived lie-detection skills, confidence in their own judgement and whether they have summarised the statement. On an exploratory level, the extent to which participants focus on the same markers when it comes to verbal lie detection (e.g. what words they highlighted) will be examined. Additionally, various correlations and other statistical methods will be used to investigate the exploratory questions outlined above. References Bell, B. E., & Loftus, E. F. (1989). Trivial Persuasion in the Courtroom: The Power of (a Few) Minor Details. Journal of Personality and Social Psychology, 56(5), 669–679. https://doi.org/10.1037/0022-3514.56.5.669 Black, A., Freeman, P., & Johnson-Laird, P. N. (1986). Plausibility and the comprehension of text. British Journal of Psychology, 77(1), 51–62. https://doi.org/10.1111/J.2044-8295.1986.TB01980.X Bower, G. H. (1976). Experiments on Story Understanding and Recall. Quarterly Journal of Experimental Psychology, 28(4), 511–534. https://doi.org/10.1080/14640747608400579 Leal, S., Vrij, A., Deeb, H., & Kamermans, K. (2019). Encouraging interviewees to say more and deception: The ghostwriter method. Legal and Criminological Psychology, 24(2), 273–287. https://doi.org/10.1111/lcrp.12152 Masip, J., Bethencourt, M., Lucas, G., Segundo, M. S. S., & Herrero, C. (2012). Deception detection from written accounts. Scandinavian Journal of Psychology, 53(2), 103–111. https://doi.org/10.1111/j.1467-9450.2011.00931.x van Dijk, T. A. (1976). Narrative macro-structures: logical and cognitive foundations. PTL: Journal for Descriptive Poetics and Theory of Literature, 1, 547–568. van Dijk, T. A. (1979). Recalling and Summarizing Complex Discourse. In Text Processing / Textverarbeitung (pp. 49–118). https://doi.org/10.1515/9783110837537-004 van Dijk, T. A., & Kintsch, W. (1978). Cognitive Psychology and Discourse: Recalling and Summarizing Stories. In W. U. Dressler (Ed.), Current Trends in Textlinguistics (pp. 61–80). Walter De Gruyter Inc. https://doi.org/10.1515/9783110853759.61 Vernham, Z., Vrij, A., Nahari, G., Leal, S., Mann, S., Satchell, L., & Orthey, R. (2020). Applying the verifiability approach to deception detection in alibi witness situations. Acta Psychologica, 204, 103020. https://doi.org/10.1016/j.actpsy.2020.103020 Vrij, A. (2008). Individual Verbal Cues to Deception. In G. Davies & R. Bull (Eds.), Detecting lies and deceit: pitfalls and opportunities (2nd ed., pp. 101–114). John Wiley & Sons. Vrij, A. (2014). Statement Validity Assessment. In G. Davies & R. Bull (Eds.), Encyclopedia of Deception (2nd ed., pp. 201–260). John Wiley & Sons. https://doi.org/10.4135/9781483306902.n334